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Modifications of HIV Env, Gag, and Pol Enhance Immunogenicity 

for Genetic Immunization 

5 Field of the Invention 

The present invention relates to the field of molecular biology. The present 
invention discloses modified HIV Env, Gag, Pol, and Nef proteins, related 
nucleotide sequences, and usage for genetic immunization. 

Background of the Invention 

10 Protective immunity against human immunodeficiency virus- 1 (HTV-1) is 

likely to require recognition of linear and conformation epitopes from multiple HIV 
antigens. Whether these responses can be elicited more effectively by virion-like 
structures or fused CTL epitopes is unknown. 

The immune response to HTV infection in long-term non-progressors (Cao Y 

15 et al., 1995, N Engl J Med, 332:201-8; Pantaleo G et al., 1995, N Engl J Med, 

332:209-16) and HIV -exposed sex workers suggests that specific viral immunity 
may limit infection and the symptoms of disease. No single characteristic yet 
correlates with protective immunity, but studies in non-human of primates suggest 
that both humoral and cellular immunity are required for this response. Depletion of 

20 cytotoxic T cells (CTLs) in chronically-infected macaques enhances viremia (Jin X 
et al.,1999, J Exp Med, 189:991-8; Schmitz JE et al., 1999, Science, 283:857-60). 
In humans, higher CTL responses correlate with lower viral load and stabilization of 
clinical symptoms (Musey L et al., 1997, N Engl J Med, 337:1267-74; Ogg GS et 
al., 1998, Science, 279:2103-6). In animal models, passive transfer of neutralizing 

25 antibodies can also contribute to protection against virus challenge (Burton DR, 
Montefiori DC, 1997, AIDS, 11 Suppl A:S87-S98; Gauduin MC et al., 1997, Nat 
Med, 3:1389-93; Moore JP, Ho DD., 1995, AIDS, 9 Suppl A:S117-S136; Muster T 
et al., 1994, J Virol, 68:4031-4; Muster T et al., 1993, J Virol, 67:6642-7; Poignard 
P etal., 1996, Immunol Today, 17:239-46; Sattentau QJ, 1996, CurrOpin Immunol, 

30 8:540-5; Shibata R et al., 1999, Nat Med, 5:204-10). Neutralizing antibody 
responses can also be developed in HIV-infected individuals (Burton DR, 
Montefiori DC, 1997, AIDS, 11 Suppl A:S87-S98; Poignard P et al., 1996, 
Immunol Today, 17:239-46; Sattentau QJ., 1996, Curr Opin Immunol, 8:540-5) and 
are associated with lower viral loads in long-term non-progressors (Montefiori DC et 

35 al., 1996, J Infect Dis, 173:60-7). Though this neutralizing antibody response is 
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uncommon, it is directed largely against the Env protein of the virus (Robey WG et 

al., 1986, Proc Natl Acad Sci USA, 83:7023-7; Steimer KS et al., 1991, Science, 
254:105-8). 

In early human vaccine trials, gpl20 protein immunogens have yielded 
disappointing results: vaccine-induced antibodies have not been broadly neutralizing 
and have sometimes enhanced infection in vitro (Bolognesi DP, Matthews TJ, 1998, 
Nature, 391:638-9; Connor RI et al., 1998, J Virol, 72:1552-76; Haynes BF, 1996, 
Lancet, 348:933-7; VanCott TC et al., 1995, J Immunol, 155:4100-10). Monomelic 
gpl20 loses oligomer-dependent epitopes and does not include sequences in the 
ectodomain of the gp41 that become exposed during virus entry (Broder CC et al., 
1994, Proc Natl Acad Sci USA, 91:1 1699-703; Moore JP, 1995, Nature, 376:1 15; 
Muster T et al., 1993, J Virol, 67:6642-7). It is assumed that broadly neutralizing 
antibodies bind to native gpl20/gp41 complex on the surface of the virus rather than 
soluble gpl20 (Burton DR, Montefiori DC, 1997, AIDS, 11 SuppI A:S87-S98). 

The development of a cytotoxic T lymphocyte (CTL) response to viruses is 
often crucial to the outcome of infections. Lysis of infected cells prior to the 
production of progeny virions may limit virus burst size (Yang, O et ah, 1996, J. 
Virol., 70:5799-5806), and HIV specific CD8 + cytotoxic T lymphocytes (CTL) have 
been shown to be important in viral clearance and in the control of initial HIV-1 
spread (Borrow, P et al., 1994, J. Virol., 68:6103-6110; Yang, O et al., 1996, J. 
Virol. 70:5799-5806). CTL responses specific to HIV also contribute to reduction in 
viral load during acute and asymptomatic infection (Klein, MR, et al., 1995, J. Exp. 
Med. 181:1365-1372; Moss, PAH et al., 1995, Proc. Natl. Acad. Sci. USA, 92:5773- 
5777) and may be involved in protection against the establishment of persistent HIV 
infections (Rowland-Jones, SL et al., 1993, Lancet, 341:860-861; Rowland-Jones, 
SL et.al., 1995, Nat. Med., 1:59-64). High-frequency CTL responses to HIV-1 
correlated with Low viral load and slow disease progression in chronically infected 
individuals (Musey, L et al., 1997, N. Engl. J. Med., 337:1267-1274; Ogg, GS et al., 
1998, Science, 279:2103-2106.). More compelling evidence of an antiviral effect of 
CD8 cells was demonstrated in controlled studies in macaques, in which CD8 + cells 
were depleted in vivo using a monoclonal antibody. The viral loads in these animals 
increased or decreased as the CDS 4 " cells were depleted or reappeared, respectively 
(Jin, X et al., 1999, J. Exp. Med., 189:991-998; Schmitz, JE, et al., 1999, Science, 
283:857-860). Therefore, induction of a CTL response specific to these proteins 
represents a desirable response in an HIV-1 vaccine. 



02/32943 PCT/US01/25721 

HIV-1 internal structural and enzymatic proteins contain conserved domains 
that preserve their functions and thus exhibit less antigenic diversity that may elicit 
more effective CTL responses (Nixon, DF et al, 1988, Nature, 336:484-487.). 
Efficient and durable CTL responses require endogenous antigen synthesis and 
processing. Current vaccine delivery techniques include immunization with live, 
attenuated viruses, inactivated recombinant virus infection (Letvin, NL, 1998, 
Science, 280:1875-1880) or plasmid DNA expression vectors. A major obstacle in 
the induction of CTL responses with naked DNA or recombinant virus during 
development of an HIV vaccine is that the expression of HIV-1 structural and 
enzymatic genes is tightly regulated by the virus itself. The expression of these 
proteins is heavily dependent upon the existence of the Rev-responsive element 
(RRE) of HIV-1 in recombinant vectors (Cullen, BR, 1992, Microbiol Rev., 
56:375-394; Felber, BK et al, 1989, Proc Natl Acad Sci USA, 86:1495-1499). 
Poor expression is caused by the presence of AT rich inhibitory nucleotide 
sequences (INS) in the gag, gol and eny genes, which inhibit the nuclear export and 
efficient expression of unspliced HIV1 mRNAs. Early studies of DNA vaccination 
against HIV in mice required the inclusion of Rev in their expression vectors (Lu, S 
et al., 1995, Virology, 209:147-154; Okuda, K et al, 1995, AIDS Res. Hum. 
Retroviruses, 11:933-943; Wang, B et al., 1993, Proc. Natl. Acad. Sci. U. S. A 
90:4156-4160), but modification of INS has been shown to facilitate Rev- 
independent expression of HIV-1 Gag (Qiu, J-T et al., 1999, J. Virol., 73:9145- 
9152; zur Megede, J et al, 2000, J Virol 74:2628-2635), allowing detectable 
humoral and CTL responses against this protein (Qiu, J-T et al, 1999, J. Virol, 
73:9145-9152). These modified HIV-1 Gag genes produced viral-like particles of 
the expected density and morphology and induced an immune response to HIV-1 
Gag after DNA immunization in mice (zur Megede, J et al, 2000, J Virol, 74:2628- 
2635). 

Summary of the Invention 

Protective immunity against human immunodeficiency virus-1 (HIV-1) is 
likely to require recognition of linear and conformation epitopes from multiple HIV 
antigens, and whether these responses can be elicited more effectively by virion-like 
structures or fused CTL epitopes was previously unknown. Herein is provided a 
modified HIV Env with deletions in the cleavage site, fusogenic domain, and 
spacing of heptad repeats 1 and 2 to expose the core protein for optimal antigen 
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presentation and recognition. Additionally we provide a Gag-Pol or Gag^Pol-Nef 
fusion protein that is a polyprotein designed to maximize epitope presentation. The 
invention extends the mutation in HIV Env, Gag, Pol, and Nef to any HIV clade or 

■ 

strain and to related proteins of other viruses. Different combinations, different 
orders, and different variations on a theme are envisioned, the theme being to 
optimize presentation of epitopes that generate broad CTJL and antibody responses. 

More particularly, we have investigated the effect of specific mutations in 
human immunodeficiency virus type 1 (HIV-1) envelope (Env) on humoral and 
cellular immune responses after DNA vaccination. Mice were injected with plasmid 
expression vectors encoding HIV Env with modifications of conserved glycosylate 
sites or different COOH-terminal mutations intended to mimic a fusion intermediate. 
Elimination of conserved glycosylation sites did not substantially enhance humoral 
or CTL immunity. In contrast, a modified gpl40 with deletions in the cleavage site, 
fusogenic domain and spacing of heptad repeats 1 and 2 enhanced humoral 
immunity without reducing the efficacy of the CTL response. Because of its ability 
to stimulate the antibody response to native gpl60 without affecting cellular 
immunity, this modified gpl40 or a related derivative is envisioned to be a useful 
/ component of an AIDS vaccine. 

In addition, we have examined the immune response to HIV-1 Gag and Pol 
after plasmid DNA immunization with Rev-independent expression vectors 
encoding various forms of these proteins. Immune responses were analyzed after 
vaccination with four expression vectors, including Gag alone or Gag-Pol, both of 
which gave rise to virion-like particles (VLPs), compared to Pol alone or a Gag-Pol 
fusion protein that did not form VLPs. The Gag-Pol fusion protein induced the most 
broad and potent CTL responses to Gag and Pol in DNA-vaccinated mice, and this 
immunogen also readily elicited an antibody response to HIV-1 Gag and Pol 
determinants. Through its ability to induce broad CTL and antibody responses, this 
Gag-Pol fusion protein or a related derivative is envisioned to be a useful component 
of an AIDS vaccine. 
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Brief Description of the Drawing s 
Figure A. Comparison of GP2 with the structures of viral and cellular 
membrane fusion proteins. 

(A) Recombinant Ebola Zaire GP2, (B) Recombinant Mo-55 from the TM 
subunit of MoMuLv, (C) Low pH-treated HA2 from influenza virus, (D) 
Recombinant, proteolysis-resistant core of HIV-1 gp41, (E) Recombinant SIV gp41, 
NMR structure, (F) Recombinant core coiled segments of the SNARES syntaxin 1- 
A, synaptobrevin-II, and SNAP-25B. Weissenhorn et aL, 1998, Molecular Cell, 2, 
605-616. 

Figure I. Schematic representation of functional domains and mutations in 
HIV-1 Env glycoproteins* 

Full-length envelope polyprotein, gpl60, with the indicated features based on 
the amino acid residues of HXB2 is shown (top). Functional domains include the 
gpl20/gp41 cleavage site (residues 510/511), the fusion domain (512-527), the two 
heptad repeats (546-579 and 628-655), the transmembrane domain (684-705), and 
the cytoplasmic domain (706-856). The mutant forms of the envelope proteins are 
shown below the structure of gpl60. COOH deletions were introduced that 
terminate the envelope protein at positions 752, 704, or 680 to produce gpl50, 
gpl45, or gpl40, respectively. Two internal deletions that removed the cleavage 
site, the fusion domain, and the region between the two heptad repeats were 
introduced into gpl60, gpl50, gpl45, and gpl40. A further deletion in the COOH- 
terminal region at position 592 removed the second heptad repeat, the 
transmembrane domain, and the interspace region to produce gpl28ACFI. To 
disrupt potential glycosylation sites, asparagine (N) residues at eleven positions (88, 
156, 160, 197, 230, 234, 241, 262, 276, 289, and 295) were replaced with aspartic 
acid (D) residues in both gpl60 and gpl50. Versions of both gpl60 and gplSO were 
created with a total of 17 mutated glycosylation sites by including six additional N 
to D substitutions at positions 332, 339, 356, 386, 392, and 448. 
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Figure II. Comparison of the expression of the HIV-1 gpl 60 with codon- 
optimized gpl60. 

A. Expression of plasmids encoding Rev-dependent and Rev-independent 
codon-modified gpl60. Upper panel: expression of Rev-dependent viral gpl60 
(left) and codon-modified gpl60 (right) in transfected 293 cells. Lower panel: 
comparable expression of p-actin in these transfected cells. 

i 

B-D. Expression of mutant CXCR4- and CCR5-tropic HIV Env 
glycoproteins with COOH-terminal truncations (panels A and B, respectively). 
CXCK.4-tropic envelope proteins containing mutant glycosylation sites and mutant 
functional domains are shown (panels C and D, respectively). The indicated 
proteins were detected by immunoblotting as above. Cell lysates produced by 
transfection with vector containing no insert were used as controls (first lane in each 
panel). 



Figure in. Cytotoxicity of full-length gpl60 is eliminated by deletion of the 
COOH-terminal cytoplasmic domain. 

Cell rounding and detachment was not observed in control-transfected 293 
cells (A), in contrast to full-length gpl60 (B) and to a lesser extent in cells 
transfected with gpl 50 (C), in contrast to gpl45 (D) or gp 140 (E). 

Figure IV. Expression of soluble gpl40ACFI HIV-1 envelope variant 

Immunoprecipttation and Western blot analysis of supernatants from the 
indicated transfected cells. 

Figure V. Antibody response against HIV-1 envelope proteins in DNA 
immunized mice. 

A. Comparison of the antibody response in mice immunized with gpl40 
(ACF1) or other Env plasmid expression vectors. Sera were collected 2 weeks after 
the last immunization and used to immunoprecipitate codon-altered gpl 60 from 
lysates of transfected 293 cells. The quantitation of the immunoprecipitated gpl 60 
was done as described in Figure 5B. The average of the normalized data has been 
presented as a bar diagram. 

B. Antibody responses in mice immunized with different mutant Env 
expression vectors. Antisera from immunized mice were diluted in IP buffer and 1 

-6- 



02/32943 PCTAJSOl/25721 

ul of each diluted serum was used lo immune-precipitate codon-altered HIV- 1 gpl60 
. from lysates of transfected 293 cells as described in Figure 3A. The gels were 
scanned and the intensity of the gpl 60 band was determined by densitometry using 
the program Image Quant and presented relative to the intensity of gpl 60 
immunoprecipitated with positive control sera (rabbit anti-gpl60), which was used 
to normalize data between experiments. These data are presented graphically to 
facilitate comparison among groups. 

C. Antibody responses in mice immunized with gpl40 or gpl40 (ACFI) 
were determined by immunoprecipitation and Western blotting. Animals received 
two booster doses (100 ug) of the same plasmid, two weeks apart. Sera (1 ul) 
collected 2 weeks after the last immunization was used to immunoprecipitate codon- 
optimized HIV-1 gpl60 from lysates of transfected 293 cells containing 400 ug of 
total protein. Each lane corresponds to the sera from an animal immunized with 
either the control vector (lanes 1 and 2), CXCR4-tropic gpl40 (lanes 3-6), or 
plasmid that expresses gpl40 with the indicated mutant functional domains (lanes 7- 
10). A mouse monoclonal antibody to gpl60 (HIV-1 V3 Monoclonal (IJIB-V3-13), 
NIH AIDS Reagent Program) was used as a positive control (lane 1 1). 

Figure VI. CTL response against HIV-1 envelope proteins in DNA immunized 
mice. 

The CTL response to CXCR4-tropic Env and indicated deletion mutants is 
shown (A). The CTL responses to CXCR4-tropic envelope with glycosylate site 
and ACFI mutations are shown (B and C, respectively). Dependence of CTL 
activity on CD8 cells was shown by magnetic bead depletion using the indicated 
representative immunogens (D). Spleen cells were isolated from immunized mice 
two weeks after the final immunization and stimulated in vitro with irradiated cells 
expressing gp!60 with addition of hIL2 (5 U/ml) at day 4. The cytolytic activity of 
the restimulated spleen cells was tested after 7 days against V3 peptide-pulsed 
BC10ME cells. Similar findings were observed with target cells that stably express 
full-length Env. 

Figure VII. Schematic representation of HIV-1 Gag-Pol expression constructs. 

The protein sequences of Gag (amino acids 1-432) from HXB2 (GenBank 
accession number K03455) and Pol (amino acids 3-1003) from NL4-3 (GenBank 
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accession number M19921) were used to create a synthetic version of hGag-Pol 
using codons found in human cells. hGag-PolAFSAPr was made by modification of 
the frame shift site (FS) and inactivation of protease. For hPol, 432 amino acids 
were deleted from the NH 2 -terminal region of hGag-Pol and addition of an ATG 
codon. hGag was made by deletion of 925 amino acids from the COOH-termina! 
region of hGag-Pol. hGag-Pol, hGag-PolAFSAPr, hPol and hGag are expressed 
from the pNGVL-3 vector backbone. 



Figure VIII. HIV-1 Gag-Pol expression in transfected 293T cells and stably 
10 transfected CT26 and BC10 ME cells. 

Cell lysates from 293T cells transfected with pCMV AR8.2 containing viral 
Gag-Pol (vGag-Pol) (Naldirii, L et al., 1996, Science, 272:263-267), pNGVL-hGag, 
hPol, hGag-PolAFSAPr and hGag-Pol were separated by 4-15% gradient SDS- 
PAGE, transferred to nitrocellulose filters, and analyzed by immunoblotting with 
15 (A) human anti HIV-l-IgG, (B) monoclonal anti-p24, and (C) rabbit anti-RT. (D) 

Cell lysates from CT26 and BC10ME cells stably transduced with either hGag or 
hPol were analyzed with human anti HIV4-IgG. 

Figure IX. Transmission electron microscopy of HIV-1 immature virus-like 
20 particles (VLP) produced by transfected 293T. 

Cells were transfected with pNGVL-hGag 48 hours prior to harvesting and 
fixing (magnification 25,000X). 

> 

Figure X- Gag or Pol specific CTL response mediated by CD8 positive cells in 
25 immunized mice. 

Two weeks after mice were immunized with a control vector, hGag, hPol, 
hGag-PolAFSAPr, and hGag-Pol, splenic cells were harvested and sensitized with 
naive mouse splenic cells pulsed with Gag or Pol peptides. One week later, effector 
cells were tested for cytolytic activity in a 5-h 5, Cr release assay using 5l Cr-Iabeled 
30 BC10ME target cells that were pulsed for 2 hours with either (A) HIV-1 Gag 
peptides, or (B) HIV-1 Pol peptides. (C) CD4+ or CD8+ lymphocytes were 
depleted from splenic cells of immunized mice with anti-mouse- CD4+ or CD8+ 
Dynal beads according to the manufacturer's instructions. 
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Figure XI. Gag or Pol specific CTL response mediated by CD8 positive cells in 
immunized mice using stable expressing cell lines as target cells. 

Two weeks after immunization in mice, splenic cells were harvested and 
sensitized with naive mouse splenic cells pulsed with Gag or Pol peptides. One 
5 week later, effector cells were tested for cytolytic activity in a 5-h 5l Cr release assay 
using 51 Cr-Iabeled BC10ME target cells expressing either (A) HIV-1 Gag or (B) Pol 
protein. 

Figure XII. HIV-1 p24 antibody ELISA assays, HIV-1 immunoblotting and 
10 immunoprecipitation Western blotting. 

(A) An HIV-1 p24 antibody ELISA assay was performed by coating 96-well 
plates with 50 ^1 of purified recombinant HIV-ln IB p24 antigen at a concentration of 
2 jig/ml in PBS buffer, pH 7.4. 

(B) HIV-1 immunoblotting of strips containing HIV-1 proteins were 
1 5 incubated with pooled mouse sera at a dilution of 1 :25. Bands were visualized using 

the EGL western blotting detection reagent. 

(C) Immunoprecipitation and Western blotting of hPol gene-transfected 
293T cell lysates three days after transfection with RIPA buffer. The pooled mouse 
serum was diluted with IP buffer. After adding 10 jug of the cell lysate containing 

20 HIV-1 Pol protein, the reactions were incubated overnight on a rotator at 4°C. The 
next day, 250 ^1 of Protein G and A Sepharose beads (10%V/V in IP buffer) were 
added, and the reactions were incubated on a rotator for 2 hours at 4°C. The 
reactions were washed 4x with IP buffer, re-suspended with 30 nl of IX sample 
buffer, and then loaded onto SDS-PAGE. The reactions were transferred to an 

25 Immobilon P membrane, and then incubated with anti HIV-l-IgG. Bands were 
visualized using the ECL Western blotting detection reagent 
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Detailed Description of the Invention 

The present invention provides modifications of HIV Env, Gag, Pol, and Nef 
that enhance immunogenicity for genetic immunization. Both HIV and SIV are 
genetically related members of the lentivirus genus of the Retroviridae family. 
Lentivirus isolates from humans are grouped into one of two types, designated HIV- 
* 1 and H1V-2. A classification scheme recognizes nine subtypes (clades) of H1V-1 
(A through I) and five subtypes of HIV-2 (A through E). A compendium of HIV 
and SIV sequence information is found in a database prepared by Myers et al., Los 
Alamos* NM: Los Alamos National Laboratory. 
Nucleic Acid Molecules 

As indicated herein, nucleic acid molecules of the present invention may be 
in the form of RNA or in the form of DNA obtained by cloning or produced 
synthetically. The DNA may be double-stranded or single-stranded. Single-stranded 
DNA or RNA may be the coding strand, also known as the sense strand, or it may be 
the non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. For example, 
recombinant DNA molecules contained in a vector are considered isolated for the 
purposes of the present invention. Further examples of isolated DNA molecules 
include recombinant DNA molecules maintained in heterologous host cells or 
purified (partially or substantially) DNA molecules in solution. Isolated RNA 
molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the 
present invention. Isolated nucleic acid molecules according to the present invention 
further include such molecules produced synthetically. 

Nucleic acid molecules of the present invention include DNA molecules 
comprising an open reading frame (ORF) of a wild-type HIV gene; and DNA 
molecules which comprise a sequence substantially different from those described 
above but which, due to the degeneracy of the genetic code, still encode an ORF of a 
wild-type HIV polypeptide. Of course, the genetic code is well known in the art. 
Degenerate variants optimized for human codon usage are preferred. 

In another aspect, the invention provides a nucleic acid molecule comprising 
a polynucleotide which hybridizes under stringent hybridization conditions to a 
portion of the polynucleotide in a nucleic acid molecule of the invention described 
above. By "stringent hybridization conditions" is intended overnight incubation at 42 
degree C in a solution comprising: 50% formamide, 5 times SSC (750 mM NaCl, 75 

-16- . 
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mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5 times Denhardt's 
solution, 10% dextran sulfate, and 20 jig/ml denatured, sheared salmon sperm DNA, 
followed by washing the filters in 0.1 times SSC at about 65 degree C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide is 
5 intended a polynucleotide (either DNA or RNA) hybridizing to at least about 15 
nucleotides (nt), and more preferably at least about 20 nt, still more preferably at 
least about 30 nt, and even more preferably about 30-70 nt of the reference 
polynucleotide. 

By a portion of a polynucleotide of "at least 20 nt in length," for example, is 
10 intended 20 or more contiguous nucleotides from the nucleotide sequence of the 
reference polynucleotide. Of course, a polynucleotide which hybridizes only to a 
complementary stretch of T (or U) resides, would not be included in a 
polynucleotide of the invention used to hybridize to a portion of a nucleic acid of the 
invention, since such a polynucleotide would hybridize to any nucleic acid molecule 
15 containing a poly T (or U) stretch or the complement thereof (e.g., practically any 
double-stranded DNA clone). 

As indicated herein, nucleic acid molecules of the present invention which 
encode an HIV polypeptide may include, but are not limited to those encoding the 
amino acid sequence of the full-length polypeptide, by itself, the coding sequence 

20 for the full-length polypeptide and additional sequences, such as those encoding a 
leader or secretory sequence, such as a pre-, or pro- or prepro-protein sequence, the 
coding sequence of the full-length polypeptide, with or without the aforementioned 
additional coding sequences, together with additional, non-coding sequences, 
including for example, but not limited to introns and non-coding 5' and 3' sequences, 

25 such as the transcribed, non-translated sequences that play a role in transcription, 

mRNA processing, including splicing and polyadenylation signals, for example, 
ribosome binding and stability of mRNA; and additional coding sequence which 
codes for additional amino acids, such as those which provide additional 
functionalities. 

30 The present invention further relates to variants of the nucleic acid molecules 

of the present invention, which encode portions, analogs or derivatives of the HIV 
protein. Variants may occur naturally, such as a natural allelic variant. By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus 
on a genome of an organism. Genes II, Lewin, B., ed., John Wiley & Sons, New 
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York (1985). Non-naturally occurring variants may be produced using art-known 
mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions 
or additions, which may involve one or more nucleotides. The variants may be 
altered in coding regions, non-coding regions, or both. Alterations in the coding 
regions may produce conservative or non-conservative amino acid substitutions, 
deletions or additions. Especially preferred among these are silent substitutions, 
additions and deletions, which do not alter the properties and activities of the HIV 
polypeptide or portions thereof. Also especially preferred in this regard are 
conservative substitutions. 

Further embodiments of the invention include nucleic acid molecules 
comprising a polynucleotide having a nucleotide sequence at least 95% identical, 
and more preferably at least 96%, 97%, 98% or 99% identical to a nucleotide 
sequence encoding a polypeptide having the amino acid sequence of a wild-type 
HIV polypeptide or a nucleotide sequence complementary thereto. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence encoding a HIV polypeptide is 
intended that the nucleotide sequence of the polynucleotide is identical to the 
reference sequence except that the polynucleotide sequence may include up to five 
point mutations per each 100 nucleotides of the reference nucleotide sequence 
encoding the HIV polypeptide. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 
5% of the nucleotides in the reference sequence may be deleted or substituted with 
another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in 
the reference sequence may be inserted into the reference sequence. These mutations 
of the reference sequence may occur at the 5' or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed 
either individually among nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at least 
95%, 96%, 97%, 98% or 99% identical to the reference nucleotide sequence can be 
determined conventionally using known computer programs such as the Bestfit 
program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics 
Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 
53711). Bestfit uses the local homology algorithm of Smith and Waterman, 
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Advances in Applied Mathematics 2: 482-489 (1981), to find the best segment of 
homology between two sequences. When using Bestfit or any other sequence 
alignment program to determine whether a particular sequence is, for instance, 95% 
identical to a reference sequence according to the present invention, the parameters 
are set, of course, such that the percentage of identity is calculated over the full 
length of the reference nucleotide sequence and that gaps in homology of up to 5% 
of the total number of nucleotides in the reference sequence are allowed. 

The present application is directed to nucleic acid molecules at least 95%, 
96%, 97%, 98% or 99% identical to the nucleic acid sequences shown herein in the 
Sequence Listing which encode a polypeptide having HIV polypeptide activity. By 
"a polypeptide having HIV activity" is intended polypeptides exhibiting HIV 
activity in a particular biological assay. For example, Env, Gag, and Pol protein 
activity can be measured for changes in immunological character by an appropriate 
immunological assay. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill in 
the art will immediately recognize that a large number of the nucleic acid molecules 
having a sequence at least 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid 
sequence shown herein in the Sequence Listing will encode a polypeptide "having 
HIV polypeptide activity." In fact, since degenerate variants of these nucleotide 
sequences all encode the same polypeptide, this will be clear to the skilled artisan 
even without performing the above described comparison assay. It will be further 
recognized in the art that, for such nucleic acid molecules that are not degenerate 
variants, a reasonable number will also encode a polypeptide having HIV 
polypeptide activity. This is because the skilled artisan is fully aware of amino acid 
substitutions that are either less likely or not likely to significantly effect protein 
function (e.g., replacing one aliphatic amino acid with a second aliphatic amino 
acid). 

For example, guidance concerning how to make phenotypically silent amino 
acid substitutions is provided in Bowie, J. U. et al., "Deciphering the Message in 
Protein Sequences: Tolerance to Amino Acid Substitutions," Science 247:1306-1310 
(1990), wherein the authors indicate that proteins are surprisingly tolerant of amino 

* > 

acid substitutions. 
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Polypeptides and Fragments 

The invention further provides a HIV polypeptide having the amino acid 
sequence encoded by an open reading frame (ORF) of a wild-type HIV gene, or a 
peptide or polypeptide comprising a portion thereof (e.g., gpl20). 

It will be recognized in the art that some amino acid sequences of the HIV 
polypeptides can be varied without significant effect of the structure or function of 
the protein. If such differences in sequence are contemplated, it should be 
remembered that there will be critical areas on the protein which determine activity. 

Thus, the invention further includes variations of the HIV polypeptide which 
show substantial HIV polypeptide activity or which include regions of HTV protein 
such as the protein portions discussed below. Such mutants include deletions, 
insertions, inversions, repeats, and type substitutions. As indicated, guidance 
concerning which amino acid changes are Likely to be phenotypically silent can be 
found in Bowie, J. U., et al., "Deciphering the Message in Protein Sequences: 
Tolerance to Amino Acid Substitutions," Science 247:1306-1310 (1990). 

Thus, the fragment, derivative or analog of the polypeptide of the invention 
may be (i) one in which one or more of the amino acid residues are substituted with 
a conserved or non-conserved amino acid residue (preferably a conserved amino 
acid residue) and such substituted amino acid residue may or may not be one 
encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which additional amino acids are 
fused to the mature polypeptide, such as an IgG Fc fusion region peptide or leader or 
secretory sequence or a sequence which is employed for purification of the mature 
polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are 
deemed to be within the scope of those skilled in the art from the teachings herein. 

As indicated, changes are preferably of a minor nature, such as conservative 
amino acid substitutions that do not significantly affect the folding or activity of the 
protein (see Table A). 
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Table A 

Conservative Amino Acid Substitutions 



10 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


lonizable: Acidic 


Aspartic Acid 




Glutamic Acid 


Ionizable: Basic 


Arginine 




Histidine 




Lysine 


Nonionizable Polar 


Asparagine 




Glutamine 




Selenpcystine 




Serine 




Threonine 


Nonpolar (Hydrophobic) 


Alanine 




Glycine 




Isoleucine 




Leucine 




Proline 




Valine 


Sulfur Containing 


Cysteine 




Methionine 



Of course, the number of amino acid substitutions a skilled artisan would 
make depends on many factors, including those described above. Generally 
speaking, the number of amino acid substitutions for any given HIV polypeptide 
will not be more than 50, 40, 30, 20, 10, 5 or 3. 

Amino acids in the HIV polypeptides of the present invention that are 
essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, 
Science 244:1081-1085 (1989)). The latter procedure introduces single alanine 
mutations at every residue in the molecule. The resulting mutant molecules are then 
tested for biological activity such as changes in immunological character. 
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The polypeptides of the present invention are conveniently provided in an 

isolated form. By "isolated polypeptide" is intended a polypeptide removed from its 
native environment. Thus, a polypeptide produced and/or contained within a 
recombinant host cell is considered isolated for purposes of the present invention. 

Also intended as an "isolated polypeptide" are polypeptides that have been 
purified, partially or substantially, from a recombinant host cell or a native source. 
For example, a recombinant^ produced version of the HIV polypeptide can be 
substantially purified by the one-step method described in Smith and Johnson, Gene 
67:31-40(1988). 

The polypeptides of the present invention include a polypeptide comprising a 
polypeptide shown herein in the Sequence Listing; as well as polypeptides which are 
at least 95% identical, and more preferably at least 96%, 97%, 98% or 99% identical 
to those described above and also include portions of such polypeptides with at least 
30 amino acids and more preferably at least 50 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a reference amino acid sequence of an HIV polypeptide is intended 
that the amino acid sequence of the polypeptide is identical to the reference 
sequence except that the polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids of the reference amino acid of the HIV 
polypeptide. In other words, to obtain a polypeptide having an amino acid sequence 
at least 95% identical to a reference amino acid sequence, up to 5% of the amino 
acid residues in the reference sequence may be deleted or substituted with another 

* 

amino acid, or a number of amino acids up to 5% of the total amino acid residues in 
the reference sequence may be inserted into the reference sequence. These 
alterations of the reference sequence may occur at the amino or carboxy terminal 
positions of the reference amino acid sequence or anywhere between those terminal 
positions, interspersed either individually among residues in the reference sequence 
or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence shown 
herein in the Sequence Listing can be determined conventionally using known 
computer programs such the Bestfit program (Wisconsin Sequence Analysis 
Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 
575 Science Drive, Madison, Wis. 53711). When using Bestfit or any other 
sequence alignment program to determine whether a particular sequence is, for 
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instance, 95% identical to a reference sequence according to the present invention, 
the parameters are set, of course, such that the percentage of identity is calculated 
over the full length of the reference amino acid sequence and that gaps in homology 
of up to 5% of the total number of amino acid residues in the reference sequence are 
5 allowed. 

The polypeptides of the invention may be produced by any conventional 
means. Houghten, R. A. (1985) General method for the rapid solid-phase synthesis 
of large numbers of peptides: specificity of antigen-antibody interaction at the level 
of individual amino acids. Proc. Natl. Acad. Sci. USA 82:5131-5135. This 

10 "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in 

US. Pat. No. 4,631,21 1 to Houghten et al. (1986). 

The present invention also relates to vectors which include the nucleic acid 
molecules of the present invention, host cells which are genetically engineered with 
the recombinant vectors, and the production of HIV polypeptides or fragments 

15 thereof by recombinant techniques. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 

20 and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 

25 constructs will further contain sites for transcription initiation, termination and, in 

the transcribed region, a ribosome binding site for translation. The coding portion of 
the mature transcripts expressed by the constructs will preferably include a 
translation initiating at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

30 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes 
for culturing in E. coli and other bacteria. Representative examples of appropriate 
hosts include, but are not limited to, bacterial cells, such as E. coli, Streptomyces 

35 and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells such 
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as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions 
for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, 
pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene; and ptrc99a, 
PKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. Among preferred 
eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from 
Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other 
suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid- 
mediated transfection, electroporation, transduction, infection or other methods. 
Such methods are described in many standard laboratory manuals, such as Davis et 
al., Basic Methods In Molecular Biology (1986). 

The HIV polypeptide can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
( M HPLC ,, ) is employed for purification. Polypeptides of the present invention 
include naturally purified products, products of chemical synthetic procedures, and 
products produced by recombinant techniques from a prokaryotic or eukaryotic host, 
including, for example, bacterial, yeast, higher plant, insect and mammalian cells. 
Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non- 
glycosylated. In addition, polypeptides of the invention may also include an initial 
modified methionine residue, in some cases as a result of host-mediated processes. 
Pharmaceutical Formulations, Dosages, and Modes of Administration 

The compounds of the invention may be administered using techniques well 
known to those in the art. Preferably, compounds are formulated and administered 
by genetic immunization. Techniques for formulation and administration may be 
found in "Remington's Pharmaceutical Sciences", 18 th ed., 1990, Mack Publishing 
Co., Easton, PA. Suitable routes may include parenteral delivery, such as 
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intramuscular, intradermal, subcutaneous, intramedullary injections, as well as, 

intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or 
intraocular injections, just to name a few. For injection, the compounds of the 
invention may be formulated in aqueous solutions, preferably in physiologically 
5 compatible buffers such as Hanks' solution, Ringer's solution, or physiological 
saline buffer. 

* » 

In instances wherein intracellular administration of the compounds of the 
invention is preferred, techniques well known to those of ordinary skill in the art 
may be utilized. For example, such compounds may be encapsulated into 

10 liposomes, then administered as' described above. Liposomes are spherical lipid 
bilayers with aqueous interiors. All molecules present in an aqueous solution at the 
time of liposome formation are incorporated into the aqueous interior. The 
liposomal contents are both protected from the external microenvironment and, 
because liposomes fuse with cell membranes, are effectively delivered into the cell 

15 cytoplasm. 

Nucleotide sequences of the invention which are to be intracellular^ 
administered may be expressed in cells of interest, using techniques well known to 
those of skill in the art. For example, expression vectors derived from viruses such 
as retroviruses, adenoviruses, adeno-associated viruses, herpes viruses, vaccinia 

20 viruses, polio viruses, or sindbis or other RNA viruses, or from plasmids may be 

used for delivery and expression of such nucleotide sequences into the targeted cell 
population. Methods for the construction of such expression vectors are well 
known. See, for example, Sambroolc et al., 1989, Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY, and Ausubel et al., 

25 1989, Current Protocols In Molecular Biology, Greene Publishing Associates and 

Wiley Interscience, NY. 

■ 

The invention extends to the use of a plasmid for primary immunization 
(priming) of a host and the subsequent use of a recombinant virus, such as a 
retrovirus, adenovirus, adeno-associated virus, herpes virus, vaccinia virus, polio 

30 virus, or sindbis or other RNA virus, for boosting said host, and vice versa. For 
example, the host may be immunized (primed) with a plasmid by DNA 
immunization and receive a boost with the corresponding viral construct, and vice 
versa. Alternatively, the host may be immunized (primed) with a plasmid by DNA 
immunization and receive a boost with not the corresponding viral construct but a 

35 different viral construct, and vice versa. 
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With respect to HIV Env, Gag, and Pol, protein sequences of the invention 
may be used as therapeutics or prophylatics (as subunit vaccines) in the treatment of 
AIDS or HIV infection. A therapeutically effective dose refers to that amount of the 
compound sufficient to result in amelioration of symptoms or a prolongation of 
5 survival in a patient. Toxicity and therapeutic efficacy of such compounds can be 
determined by standard pharmaceutical procedures in cell cultures or experimental 
animals, e.g., for determining the LD50 (the dose lethal to 50% of the population) 
and the ED50 (the dose therapeutically effective in 50% of the population). The 
dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
10 expressed as the ratio LD50/ED50. Compounds which exhibit large therapeutic 
indices are preferred. The data obtained from cell culture assays and animal studies 
can be used in formulating a range of dosage for use in humans. The dosage of such 
compounds lies preferably within a range of circulating concentrations that includes 
the ED50 with little or no toxicity. The dosage may vary within this range 
1 5 depending upon the dosage form employed and the route of administration utilized. 
For any compound used in the method of the invention, the therapeutically effective 
dose can be estimated initially from cell culture assays. A dose may be formulated 
in animal models to achieve a circulating plasma concentration range that includes 
the IC50 (e.g., the concentration of the test compound which achieves a half- 
20 maximal inhibition of viral infection relative to the amount of the event in the 
absence of the test compound) as determined in cell culture. Such information can 
be used to more accurately determine useful doses in humans. Levels in plasma may 
be measured, for example, by high performance liquid chromatography (HPLC). 

The compounds of the invention may, further, serve the role of a 
25 prophylactic vaccine, wherein the host produces antibodies and/or CTL responses 
against HIV Env, Gag and Pol, which responses then preferably serve to neutralize 
HIV viruses by, for example, inhibiting further HIV infection. Administration of the 
compounds of the invention as a prophylactic vaccine, therefore, would comprise 
administering to a host a concentration of compounds effective in raising an immune 
30 response which is sufficient to elicit antibody and/or CTL reponses to HIV Env, 
Gag, and Pol, and/or neutralize HIV, by, for example, inhibiting HIV ability to 
infect cells. The exact concentration will depend upon the specific compound to be 
administered, but may be determined by using standard techniques for assaying the 
development of an immune response which are well known to those of ordinary skill 
35 in the art. 
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The compounds may be formulated with a suitable adjuvant in order to 
enhance the immunological response. Such adjuvants may include, but are not 
limited to mineral gels such as aluminum hydroxide; surface active substances such 
as lysolecithin, piuronic polyols, polyanions; other peptides; oil emulsions; and 
5 potentially useful human adjuvants such as BCG and Corynebacterium parvum. 

Adjuvants suitable for co-administration in accordance with the present 
invention should be ones that are potentially safe, well tolerated and effective in 
people including QS-21, Detox-PC, MPL-SE, MoGM-CSF, TiterMax-G, CRL- 
1005, GERBU, TERamide, PSC97B, Adjumer, PG-026, GSK-1, GcMAF, B- 

10 alethine, MPC-026, Adjuvax, CpG ODN, Betafectin, Alum, and MF59 (see Kim et 
al., 2000, Vaccine, 18: 597 and references therein). 

Other contemplated adjuvants that may be administered include lectins, 
growth factors, cytokines and lymphokines such as alpha-interferon, gamma- 
Jnterferon, platelet derived growth factor (PDGF), gCSF, gMCSF, TNF, epidermal 

1 5 growth factor (EGF), IL-l, IL-2, IL-4, IL-6, IL-8, IL-1 0 and IL-12. 

For all such treatments described above, the exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of the 
patient's condition. (See e.g., Fingl et al., 1975, in "The Pharmacological Basis of 
Therapeutics", Ch. 1 p. 1). 

20 It should be noted that the attending physician would know how to and when 

to terminate, interrupt, or adjust administration due to toxicity, or to organ 
dysfunctions. Conversely, the attending physician would also know to adjust 
treatment to higher levels if the clinical response were not adequate (precluding 
toxicity). The magnitude of an administered dose in the management of the viral 

25 infection of interest will vary with the severity of the condition to be treated and the 
route of administration. The dose and perhaps prime-boost regimen, will also vary 
according to the age, weight, and response of the individual patient. A program 
comparable to that discussed above may be used in veterinary medicine. 

The pharmacologically active compounds of this invention can be processed 

30 in accordance with conventional methods of galenic pharmacy to produce medicinal 
agents for administration to patients, e.g., mammals including humans. 

The compounds of this invention can be employed in admixture with 
conventional excipients, i.e., pharmaceutically acceptable organic or inorganic 
carrier substances suitable for parenteral, enteral (e.g., oral) or topical application 

35 which do not deleteriously react with the active compounds. Suitable 
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pharmaceutically acceptable carriers include but are not limited to water, salt 
solutions, alcohols, gum arabic, vegetable oils, benzyl alcohols, polyethylene 
glycols, gelatine, carbohydrates such as lactose, amylose or starch, magnesium 
stearate, talc, silicic acid, viscous paraffin, perfume oil, fatty acid monoglycerides 
and diglycerides, pentaerythritol fatty acid esters, hydroxy methylcellulose, 
polyvinyl pyrrolidone, etc. The pharmaceutical preparations can be sterilized and if 
desired mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, 
wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, cotoring, 
flavoring and/or aromatic substances and the like which do not deleteriously react 
with the active compounds. They can also be combined where desired with other 
active agents, e.g., vitamins. 

For parenteral application, which includes intramuscular, intradermal, 
subcutaneous, intranasal, intracapsular, intraspinal, intrasternal, and intravenous 
injection, particularly suitable are injectable, sterile solutions, preferably oily or 
aqueous solutions, as well as suspensions, emulsions, or implants, including 
suppositories. Formulations for injection may be presented in unit dosage form, e.g., 
in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions, or emulsions in oily or 
aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. Alternatively, the active ingredient may be in 
powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-free water, 
before use. 

For enteral application, particularly suitable are tablets, dragees, liquids, 
drops, suppositories, or capsules. The pharmaceutical compositions may be prepared 
by conventional means with pharmaceutically acceptable excipients such as binding 
agents (e.g., pregelatinised maize starch, polyvinylpyrrolidone or hydroxypropyl 
methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium 
hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or silica); 
disintegrants (e.g., potato starch or sodium starch glycolate); or wetting agents (e.g., 
sodium lauryl sulphate). The tablets may be coated by methods well known in the 
art. Liquid preparations for oral administration may take the form of, for example, 
solutions, syrups or suspensions, or they may be presented as a dry product for 
constitution with water or other suitable vehicle before use. Such liquid preparations 
may be prepared by conventional means with pharmaceutically acceptable additives 
such as suspending agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated 
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edible fats); emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., 
almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and 

4 

preservatives (e.g., methyl or propyl-p-hydroxybenzoates or sorbic acid). The 
preparations may also contain buffer salts, flavoring, coloring and sweetening agents 
5 as appropriate. A syrup, elixir, or the like can be used wherein a sweetened vehicle 
is employed. 

Sustained or directed release compositions can be formulated, e.g., liposomes 
or those wherein the active compound is protected with differentially degradable 
coatings, e.g., by microencapsulation, multiple coatings, etc. It is also possible to 

10 freeze dry the new compounds and use the lyophilizates obtained, for example, for 
the preparation of products for injection. 

For administration by inhalation, the compounds for use according to the 
present invention are conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebulizer, with the use of a suitable 

15 propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a valve to 
deliver a metered amount. Capsules and cartridges of e.g. gelatin for use in an 
inhaler or insufflator may be formulated containing a powder mix of the compound 

20 and a suitable powder base such as lactose or starch. 

For topical application, there are employed as non-sprayable forms, viscous 
to semi-solid or solid forms comprising a carrier compatible with topical application 
and having a dynamic viscosity preferably greater than water. Suitable formulations 
include but are not limited to solutions, suspensions, emulsions, creams, ointments, 

25 powders, liniments, salves, aerosols, etc., which are, if desired, sterilized or mixed 
with auxiliary agents, e.g., preservatives, stabilizers, wetting agents, buffers or salts 
for influencing osmotic pressure, etc. For topical application, also suitable are 

i 

sprayable aerosol preparations wherein the active ingredient, preferably in 
combination with a solid or liquid inert carrier material, is packaged in a squeeze 
30 bottle or in admixture with a pressurized volatile, normally gaseous propellant, e.g., 
a freon. 

The compositions may, if desired, be presented in a pack or dispenser device 
which may contain one or more unit dosage forms containing the active ingredient. 
The pack may for example comprise metal or plastic foil, such as a blister pack. The 
35 pack or dispenser device may be accompanied by instructions for administration. 
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Genetic Immunization 

Genetic immunization according lo the present invention elicits an effective 
immune response without the use of infective agents or infective vectors. 
Vaccination techniques which usually do produce a CTL response do so through the 
5 use of an infective agent. A complete, broad based immune response is not generally 
exhibited in individuals immunized with killed, inactivated or subunit vaccines. The 
present invention achieves the full complement of immune responses in a safe 
manner without the risks and problems associated with vaccinations that use 
infectious agents. 

10 According to the present invention, DNA or RNA that encodes a target 

protein is introduced into the cells of an individual where it is expressed, thus 
producing the target protein. The DNA or RNA is linked to regulatory elements 
necessary for expression in the cells of the individual. Regulatory elements for DNA 
include a promoter and a polyadenylation signal. In addition, other elements, such as 
15 a Kozak region, may also be included in the genetic construct. 

The genetic constructs of genetic vaccines comprise a nucleotide sequence 
that encodes a target protein operably linked to regulatory elements needed for gene 
expression. Accordingly, incorporation of the DNA or RNA molecule into a living 
cell results in the expression of the DNA or RNA encoding the target protein and 
20 thus, production of the target protein. 

When taken up by a cell, the genetic construct which includes the nucleotide 
sequence encoding the target protein operably linked to the regulatory elements may 
remain present in the cell as a functioning extrachromosomal molecule or it may 
integrate into the cell's chromosomal DNA. DNA may be introduced into cells 
25 where it remains as separate genetic material in the form of a plasmid. Alternatively, 
linear DNA which can integrate into the chromosome may . be introduced into the 
cell. When introducing DNA into the cell, reagents which promote DNA integration 
into chromosomes may be added. DNA sequences which are useful to promote 
integration may also be included in the DNA molecule. Since integration into the 
30 chromosomal DNA necessarily requires manipulation of the chromosome, it is 
preferred to maintain the DNA construct as a replicating or non-replicating 
extrachromosomal molecule. This reduces the risk of damaging the cell by splicing 
into the chromosome without affecting the effectiveness of the vaccine. 
Alternatively, RNA may be administered lo the cell. It is also contemplated to 
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provide the genetic construct as a linear minichromosome including a centromere, 
telomeres and an origin of replication. 

The necessary elements of a genetic construct of a genetic vaccine . include a 
nucleotide sequence that encodes a target protein and the regulatory elements 
5 necessary for expression of that sequence in the cells of the vaccinated individual. 

The regulatory elements are operably linked to the DNA sequence that encodes the 
target protein to enable expression. 

The molecule that encodes a target protein is a protein-encoding molecule 
which is translated into protein. Such molecules include DNA or RNA which 
10 comprise a nucleotide sequence that encodes the target protein. These molecules 
may be cDNA, genomic DNA, synthesized DNA or a hybrid thereof or an RNA 
molecule such as mRNA. Accordingly, as used herein, the terms "DNA construct", 
"genetic construct" and "nucleotide sequence" are meant to refer to both DNA and 
RNA molecules. 

15 Th e regulatory elements necessary for gene expression of a DNA molecule 

include: a promoter, an initiation codon, a stop codon, and a polyadenylation signal. 
In addition, enhancers are often required for gene expression. It is necessary that 
these elements be operable in the vaccinated individual. Moreover, it is necessary 
that these elements be operably linked to the nucleotide sequence that encodes the 

20 target protein such that the nucleotide sequence can be expressed in the cells of a 

vaccinated individual and thus the target protein can be produced. 

Initiation codons and stop codons are generally considered to be part of a 
nucleotide sequence that encodes the target protein. However, it is necessary that 
these elements are functional in the vaccinated individual. 

25 Similarly, promoters and polyadenylation signals used must be functional 

within the cells of the vaccinated individual. 

Examples of promoters useful to practice the present invention, especially in 
the production of a genetic vaccine for humans, include but are not limited to 
promoters from Simian Virus 40 (SV40), Mouse Mammary Tumor Virus (MMTV) 
30 promoter, Human Immunodeficiency Virus (HIV) such as the HIV Long Terminal 
Repeat (LTR) promoter, Moloney virus, ALV, Cytomegalovirus (CMV) such as the 
CMV immediate early promoter, Epstein Barr Virus (EBV), Rous Sarcoma Virus 
(RSV) as well as promoters from human genes such as human Actin, human 
Myosin, human Hemoglobin, human muscle creatine and human metalothionein. 
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Examples of polyadenylation signals useful to practice the present invention, 
especially in the production of a genetic vaccine for humans, include but are not 
limited to SV40 polyadenylation signals and LTR polyadenylation signals. In 
particular, the SV40 polyadenylation signal which is in pCEP4 plasmid (Invitrogen, 
5 San Diego Calif.), referred to as the SV40 polyadenylation signal, can be used. 

In addition to the regulatory elements required for DNA expression, other 
elements may also be included in the DNA molecule. Such additional elements 
include enhancers. The enhancer may be selected from the group including but not 
limited to: human Actin, human Myosin, human Hemoglobin, human muscle 
10 creatine and viral enhancers such as those from CMV, RSV and EBV. 

Genetic constructs can be provided with mammalian origin of replication in 
order to maintain the construct extrachromosomally and produce multiple copies of 
the construct in the cell. Plasmids pCEP4 and pREP4 from Invitrogen (San Diego, 
Calif.) contain the Epstein Barr virus origin of replication and nuclear antigen 
15 EBNA-1 coding region which produces high copy episomal replication without 

integration. 

An additional element may be added which serves as a target for cell 
destruction if it is desirable to eliminate cells receiving the genetic construct for any 
reason. A herpes thymidine kinase (tk) gene in an expressible form can be included 

20 in the genetic construct. When the construct is introduced into the cell, tk will be 

produced. The drug gangcyclovir can be administered to the individual and that drug 
will cause the selective killing of any cell producing tk. Thus, a system can be 
provided which allows for the selective destruction of vaccinated cells. 

In order to be a functional genetic construct, the regulatory elements must be 

25 operably linked to the nucleotide sequence that encodes the target protein. 

Accordingly, it is necessary for the initiation and termination codons to be in frame 
with the coding sequence. 

Open reading frames (ORFs) encoding the protein of interest and another or 
other proteins of interest may be introduced into the cell on the same vector or on 

30 different vectors. ORFs on a vector may be controlled by separate promoters or by a 

single promoter. In the latter arrangement, which gives rise to a polycistronic 
message, the ORFs will be separated by translational stop and start signals. The 
presence of an internal ribosome entry site (IRES) site between these ORFs permits 
the production of the expression product originating from the second ORF of 
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interest, or third, etc. by internal initiation of the translation of the bicistronic or 
polycistronic mRNA. 

According to the invention, the genetic vaccine may be administered directly 
into the individual to be immunized or ex vivo into removed cells of the individual 
which are reimplanted after administration. By either route, the genetic material is 
introduced into cells which are present in the body of the individual. Routes of 
administration include, but are not limited to, intramuscular, intraperitoneal, 
intradermal, subcutaneous, intravenous, intraarterially, intraoccularly and oral as 
well as transdermal^ or by inhalation or suppository. Preferred routes of 
administration include intramuscular, intraperitoneal, intradermal and subcutaneous 
injection. Genetic constructs may be administered by means including, but not 
limited to, traditional syringes, needleless injection devices, or microprojectile 
bombardment gene guns. Alternatively, the genetic vaccine may be introduced by 
various means into cells that are removed from the individual. Such means include, 
for example, ex vivo transfection, electroporation, microinjection and 
microprojectile bombardment. After the genetic construct is taken up by the cells, 
they are reimplanted into the individual. It is contemplated that otherwise non- 
immunogenic cells that have genetic constructs incorporated therein can be 
implanted into the individual even if the vaccinated cells were originally taken from 
another individual. 

The genetic vaccines according to the present invention comprise about 1 
nanogram to about 1000 micrograms of DNA. In some preferred embodiments, the 

* 

vaccines contain about 10 nanograms to about 800 micrograms of DNA. In some 
preferred embodiments, the vaccines contain about 0.1 to about 500 micrograms of 
DNA. In some preferred embodiments, the vaccines contain about 1 to about 350 
micrograms of DNA. In some preferred embodiments, the vaccines contain about 25 
to about 250 micrograms of DNA. In some preferred embodiments, the vaccines 
contain about 100 micrograms DNA. 

The genetic vaccines according to the present invention are formulated 
according to the mode of administration to be used. One having ordinary skill in the 
art can readily formulate a genetic vaccine that comprises a genetic construct. In 
cases where intramuscular injection is the chosen mode of administration, an 
isotonic formulation is preferably used. Generally, additives for isotonicity can 
include sodium chloride, dextrose, mannitol, sorbitol and lactose. In some cases, 
isotonic solutions such as phosphate buffered saline are preferred. Stabilizers include 
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gelatin and albumin. In some embodiments, a vasoconstriction agent is added to the 
formulation. The pharmaceutical preparations according to the present invention are 
provided sterile and pyrogen free. 

Genetic constructs may optionally be formulated with one or more response 
5 enhancing agents such as: compounds which enhance transfection, i.e. transfecting 
agents; compounds which stimulate cell division, i.e. replication agents; compounds 
which stimulate immune cell migration to the site of administration, i.e. 
inflammatory agents; compounds which enhance an immune response, i.e. adjuvants 
or compounds having two or more of these activities. 

10 In one embodiment, bupivacaine, a well known and commercially available 

pharmaceutical compound, is administered prior to, simultaneously with or 
subsequent to the genetic construct. Bupivacaine and the genetic construct may be 
formulated in the same composition. Bupivacaine is particularly useful as a cell 
stimulating agent in view of its many properties and activities when administered to 

15 tissue. Bupivacaine promotes and facilitates the uptake of genetic material by the 
cell. As such, it is a transfecting agent. Administration of genetic constructs in 
conjunction with bupivacaine facilitates entry of the genetic constructs into cells. 
Bupivacaine is believed to disrupt or otherwise render the cell membrane more 
permeable. Cell division and replication is stimulated by bupivacaine. Accordingly, 

20 bupivacaine acts as a replicating agent. Administration of bupivacaine also irritates 
and damages the tissue. As such, it acts as an inflammatory agent which elicits 
migration and chemotaxis of immune cells to the site of administration. In addition 
to the cells normally present at the site of administration, the cells of the immune 
system which migrate to the site in response to the inflammatory agent can come 

25 into contact with the administered genetic material and the bupivacaine. 
Bupivacaine, acting as a transfection agent, is available to promote uptake of genetic 
material by such cells of the immune system as well. 

In addition to bupivacaine, mepivacaine, Hdocaine, procains, carbocaine, 
methyl bupivacaine, and other similarly acting compounds may be used as response 

30 enhancing agents. Such agents acts a cell stimulating agents which promote the 

* 

uptake of genetic constructs into the cell and stimulate cell replication as well as 
initiate an inflammatory response at the site of administration. 

Other contemplated response enhancing agents which may function as 
transfecting agents and/or replicating agents and/or inflammatory agents and which 
35 may be administered include lectins, growth factors, cytokines and lymphokines 
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such as alpha-interferon, gamma-interferon, platelet derived growth factor (PDGF), 
gCSF, gMCSF, TNF, epidermal growth factor (EGF), IL-1, IL-2, IL-4, IL-6, IL-8, 
IL-10 and IL-1 2 as well as collagenase, fibroblast growth factor, estrogen, 
dexamethasone, saponins, surface active agents such as immune-stimulating 
5 complexes (ISCOMS), Freund's incomplete adjuvant, LPS analog including 
monophosphoryl Lipid A (MPL), muramyl peptides, quinone analogs and vesicles 
such as squalene and squalane, hyaluronic acid and hyaluronidase may also be used 
administered in conjunction with the genetic construct In some embodiments, 
combinations of these agents are co-administered in conjunction with the genetic 

10 construct. In other embodiments, genes encoding these agents are included in the 
same or different genetic constructs) for co-expression of the agents. 

With respect to HTV Env, Gag, and Pol nucleotide sequences of the 
invention, particularly through genetic immunization, may be used as therapeutics or 
prophylatics in the treatment of AIDS or HIV infection. A therapeutically effective 

15 dose refers to that amount of the compound sufficient to result in amelioration of 
symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical 
procedures in cell cultures or experimental animals, e.g., for determining the LD50 
(the dose lethal to 50% of the population) and the ED50 (the dose therapeutically 

20 effective in 50% of the population). The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. 
Compounds which exhibit large therapeutic indices are preferred. The data obtained 
from cell culture assays and animal studies can be used in formulating a range of 
dosage for use in humans. The dosage of such compounds lies preferably within a 

25 range of circulating concentrations that includes the ED50 with little or no toxicity. 

The dosage may vary within this range depending upon the dosage form employed 
and the route of administration utilized. For any compound used in the method of 
the invention, the therapeutically effective dose can be estimated initially from cell 
culture assays. A dose may be formulated in animal models to achieve a circulating 

30 plasma concentration range that includes the (e.g., the concentration of the test 
compound which achieves a half-maximal inhibition of viral infection relative to the 
amount of the event in the absence of the test compound) as determined in cell 
culture. Such information can be used to more accurately determine useful doses in 
humans. Levels in plasma may be measured, for example, by high performance 

35 liquid chromatography (HPLC). 
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The compounds (for genetic immunization) of the invention may, further, 
serve the role of a prophylactic vaccine, wherein the host produces antibodies and/or 
CTL responses against HIV Env, Gag, and Pol which responses then preferably 
serve to neutralize HIV viruses by, for example, inhibiting further HIV infection. 
5 Administration of the compounds of the invention as a prophylactic vaccine, 
therefore, would comprise administering to a host a concentration of compounds 
effective in raising an immune response which is sufficient to elicit antibody and/or 
CTL reponses to HIV Env, Gag, and Pol and/or neutralize HIV, by, for example, 
inhibiting HIV ability to infect cells. The exact concentration will depend upon the 
10 specific compound to be administered, but may be determined by using standard 
techniques for assaying the development of an immune response which are well 
known to those of ordinary skill in the art. 

Env 

To improve the immune response to native gpl60 and to expose the core 

15 protein for optimal antigen presentation and recognition, we have analyzed the 
immune response to modified forms of the protein. The role of conserved N-linked 
glycosylation sites has been studied, and analogues of fusion intermediates have 
been developed. Expression vectors with deletions in the cleavage site (C), the 
fusion peptide (F), and the interspace 0) between the two heptad repeats were 

20 termed ACFI. Plasmid DNA vaccination has been a useful technology for the 
development and analysis of immunogens. This method of vaccination allows 
appropriate post-translational modification, proper intracellular trafficking, and 
antigen presentation. Direct injection of naked DNA either intramuscularly or 
intradermally in rodents induces immune responses, and the ability to easily modify 

25 plasmid expression vectors to express different forms of HIV envelope proteins 
enables rapid and systematic testing of vaccine immunogens. In this disclosure, we 
have analyzed the immune response to modified Env candidates expressed in 
plasmids with modified codons to improve gene expression. Both antibody and 
CTL responses were analyzed after injection of plasmid DNA into muscle. A 

30 modi fied gpl 40 DNA with improved ability to elicit antibody and CTL responses to 
HIV Env has now been identified that is envisioned as a prototype immunogen that 
can elicit broadly neutralizing antibody responses to HIV. 
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Exposing the Core Protein of Viral Membrane Fusion Proteins 

♦ 

Described herein are modified HIV envelope proteins that improve the 
immune response to native gpl60 and expose the core protein for optimal antigen 
presentation and recognition. Weissenhorn et al., Molecular Cell, 2, 605-616, 1998 
5 proposes a core protein as a model for a fusion intermediate of viral glycoproteins, 

where the glycoproteins are characterized by a central triple stranded coiled coil 
followed by a disulfide-bonded loop that reverses the chain direction and connects to 
an a helix packed antiparallel to the core helices, as, for example, in the case of 
Ebola Zaire GP2, Murine Moloney Leukemia virus (MuMoLv) 55-residue segment 

10 of the TM subunit (Mo-55), low-pH-treated influenza HA2, protease resistant core 
of HTV gp41, and SIV gp41 (Figure A). Thus, the strategy for improving the 
immune response by exposing the protease resistant core of HIV gp41 extends to 
other viral membrane fusion proteins that are characterized by a central triple 
stranded coiled coil followed by a disulfide-bonded loop that reverses the chain 

15 direction and connects to an a helix packed antiparallel to the core helices. 

The present approach involves a series of internal mutations designed to 
replace the cleavage site (C), the fusion domain (F), and the interspace (I) between 
the two heptad repeats all on a backbone of COOH-terminal truncations to expose 
the core protein of the viral membrane fusion protein Env, based on modified gpl40 

20 as a prototype immunogen. By replacement is meant deletions, insertions, and/or 
substitutions of amino acid residues. In one embodiment, deletions are meant (i.e., 
amino acids are deleted to create the ACFI mutations). 

In this embodiment, the AC mutation is intended to eliminate proteolysis by 
deleting the gpl20/gp41 cleavage site that links the envelope covalently to the 

25 ectodomain by 100% 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 
89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81,%, 80%, 79%, 78%, 77%, 76%, 
75%, 74%, 73,%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 
. 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51%, 50%, 49%, 48%, 
47%, 46%, 45%, 44%, 43%, 42, 41%, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 

30 32%, 31%, 30%, 29%, 28%, 27%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 

18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 
2%, or 1%. 

In this embodiment, the AF mutation is intended to solubilize the molecule 
by deleting the fusion domain by 100%, 99%, 98%, 97%, 96%, 95%, 94%, 93%, 
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92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81,%, 80%, 79%, 
78%, 77%, 76%, 75%, 74%, 73,%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 
64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51%, 
50%, 49%, 48%, 47%, 46%, 45%, 44%, 43%, 42, 41 %, 40%, 39%, 38%, 37%, 36%, 
5 35%, 34%, 33%, 32%, 31%, 30%, 29%, 28%, 27%, 26%, 25%, 24%, 23%, 22%, 

21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%,11%, 10%, 9%, 8%, 7%, 
6%, 5%, 4%, 3 %, 2%, or 1 %. 

In this embodiment, the Al mutation is intended to stabilize oligomer 
formation by deleting the interspace between the two heptad repeats by 100%, 99%, 

10 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 

84%, 83%, 82%, 81,%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73,%, 72%, 71%, 
70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 
56%, 55%, 54%, 53%, 52%, 51%, 50%, 49%, 48%, 47%, 46%, 45%, 44%, 43%, 42, 
41%, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31%, 30%, 29%, 28%, 

15 27%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 

13%, 12%,1 1%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1%. 

In this embodiment, the COOH-terminal truncation is intended to reduce 
toxicity by deleting the cytoplasmic domain by 100%, 99%, 98%, 97%, 96%, 95%, 
94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81,%, 

20 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73,%, 72%, 71%, 70%, 69%, 68%, 67%, 

66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 
52%, 51%, 50%, 49%, 48%, 47%, 46%, 45%, 44%, 43%, 42, 41%, 40%, 39%, 38%, 
37%, 36%, 35%, 34%, 33%, 32%, 31%, 30%, 29%, 28%, 27%, 26%, 25%, 24%, 
23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%,11%, 10%, 9%, 

25 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1 %. 

In this embodiment, optionally, the COOH-terminal truncation is extended 
so as to solubilize the molecule by deleting the transmembrane domain by 100% 
99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 
85%, 84%, 83%, 82%, 81,%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73,%, 72%, 
30 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 
57%, 56%, 55%, 54%, 53%, 52%, 51%, 50%, 49%, 48%, 47%, 46%, 45%, 44%, 
43%, 42, 41%, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31%, 30%, 29%, 
28%, 27%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 
14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1%. 
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Amino acid substitutions may encompass those of a conserved or non- 
conserved nature. Presumably, a non-conserved substitution of a domain would act 
like a deletion of the domain. Conserved amino acid substitutions constitute 
switching one or more amino acids with amino acids of similar charge, size, and/or 
hydrophobicity characteristics. Non-conserved amino acid substitutions constitute 
switching one or more amino acids with amino acids of dissimilar charge, size, 
and/or hydrophobicity characteristics. The families of amino acids include the basic 
amino acids (lysine, arginine, histidine); the acidic amino acids (aspartic acid, 
glutamic acid); the non-polar amino acids (alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan); the uncharged polar amino acids 
(glycine, asparagine, glutamine, cysteine, serine, threonine, tyrosine); and the 
aromatic amino acids (phenylalanine, tryptophan, and tyrosine). One or more 
substitutions may be introduced to achieve the AC mutation intended to eliminate 
proteolysis by acting like a deletion of the gpl20/gp41 cleavage site to link the 
envelope covalently to the ectodomain, the AF mutation intended to solubilize the 
molecule by acting like a deletion of the fusion domain, the AI mutation intended to 
stabilize oligomer formation by acting like a deletion of the interspace between the 
two heptad repeats, the COOH-terminal truncation intended to reduce toxicity by 
acting like a deletion of the cytoplasmic domain, and, optionally, the COOH- 
terminal truncation extended so as to solubilize the molecule by acting like a 
deletion of the transmembrane domain. 

Amino acid insertions may constitute single amino acid residues or stretches 
of residues. The insertions may be made at the carboxy or amino terminal end of a 
domain, as well as at a position internal to the domain. Such insertions will 
generally range from 2 to 15 amino acids in length. One or more insertions may be 
introduced to achieve the AC mutation intended to eliminate proteolysis by acting 
like a deletion of the gpl20/gp41 cleavage site to link the envelope covalently to the 
ectodomain, the AF mutation intended to solubilize the molecule by acting like a 
deletion of the fusion domain, the Al mutation intended to stabilize oligomer 
formation by acting like a deletion the interspace between the two heptad repeats, 
the COOH-terminal truncation intended to reduce toxicity by acting like a deletion 
of the cytoplasmic domain, and, optionally, the COOH-terminal truncation extended 
so as to solubilize the molecule by acting like a deletion of the transmembrane 
domain. 
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The nucleic acids of the present invention are optionally DNA, RNA, or 
mRNA. Most typically, the nucleic acids are provided by recombinantly making a 
DNA, which is expressed in a cell as RNA and/or as mRNA. Given the strategy for 
making the nucleic acids of the present invention, one of skill can construct a variety 
5 of clones containing functionally equivalent nucleic acids. Cloning methodologies to 
accomplish these ends, and sequencing methods to verify the sequence of nucleic 
acids are well known in the art. Examples of appropriate cloning and sequencing 
techniques, and instructions sufficient to direct persons of skill through many 
cloning exercises are found in Berger and Kimmel, Guide to Molecular Cloning 
10 Techniques, Methods in Enzymology, volume 152, Academic Press, Inc., San 
Diego, Calif. (Berger); Sambrook, et al. (1989) Molecular Cloning~A Laboratory 
Manual (2nd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Press, N.Y., (Sambrook); and Current Protocols in Molecular Biology, F. M. 
Ausubel, et al, eds., Current Protocols, a joint venture between Greene Publishing 

15 Associates, Inc. and John Wiley & Sons, Inc., (1994 Supplement) (Ausubel). 

Product information from manufacturers of biological reagents and experimental 
equipment also provide information useful in known biological methods. Such 
manufacturers include the SIGMA chemical company (Saint Louis, Mo.), R&D 
systems (Minneapolis, Minn.), Pharmacia LKB Biotechnology (Piscataway, N.J.), 

20 CLONTECH Laboratories, Inc. (Palo Alto, Calif), Chem Genes Corp., Aldrich 
Chemical Company (Milwaukee, Wis.), Glen Research, Inc., GIBCO BRL Life 
Technologies, Inc. (Gaithersberg, Md.), Fluka Chemica-Biochemika Analytika 
(Fluka Chemie AG, Buchs, Switzerland), Invitrogen, San Diego, Calif, and Applied 
Biosystems (Foster City, Calif), as well as many other commercial sources known 

25 to one of skill. 

The nucleic acid compositions of this invention, whether RNA, cDNA, 
mRNA, genomic DNA, or a hybrid of the various combinations, are isolated from 
biological sources or synthesized in vitro. The nucleic acids of the present invention 
are present in transformed or transfected whole cells, in transformed or transfected 
30 cell lysates, or in a partially purified or substantially pure form. 

In vitro amplification techniques suitable for amplifying sequences to 

< 

provide a nucleic acid or for subsequent analysis, sequencing or subcloning are 
known. Examples of techniques sufficient to direct persons of skill through such in 
vitro amplification methods, including the polymerase chain reaction (PCR) the 
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ligase chain reaction (LCR), Qp-replicase amplification and other RNA polymerase 
mediated techniques (e.g., NASBA) are found in Berger, Sambrook, and Ausubel, as 
well as Mullis, et al, (1987) U.S. Pat. No. 4,683,202; PCR Protocols A Guide to 
Methods and Applications (Innis, et al. eds) Academic Press Inc. San Diego, Calif. 
5 (1990) (Innis); Arnheim & Levinson (Oct. t, 1990) C&EN 36-47; The Journal Of 

NIH Research (1991) 3:81-94; Kwoh, et al, Proc. Natl. Acad. Sci. USA, 86:1173 
(1989); Guatelli, et al., Proc. Natl. Acad. Sci. USA, 87:1874 (1990); Lomell, et al., J. 
Clin. Chem., 35:1826 (1989); Landegren, et al., Science, 241:1077-1080 (1988); 
Van Brunt, Biotechnology, 8:291-294 (1990); Wu and Wallace, Gene, 4:560 (1989); 
10 Barringer, et aL, Gene, 89:117 (1990), and Sooknanan and Malek, Biotechnology, 
13:563-564 (1995). Improved methods of cloning in vitro amplified nucleic acids 
are described in Wallace, et al., U.S. Pat No. 5,426,039. Improved methods of 
amplifying large nucleic acids (up to 40 kb) are summarized in Cheng, et al., Nature, 
369:684-685 (1994) and the references therein. One of skill will appreciate that 
15 essentially any RNA can be converted into a double stranded DNA suitable for 
restriction digestion, PCR expansion and sequencing using reverse transcriptase and 
a polymerase: See, Ausubel, Sambrook, Innis, and Berger, all supra. 

One of skill will recognize many ways of generating alterations in a given 
nucleic acid construct. Such well-known methods include site-directed mutagenesis, 
20 PCR amplification using degenerate oligonucleotides, exposure of cells containing 

the nucleic acid to mutagenic agents or radiation, chemical synthesis of a desired 
oligonucleotide (e.g., in conjunction with ligation and/or cloning to generate large 
nucleic acids) and other well-known techniques. See, Giliman and Smith, Gene 
8:81-97 (1979), Roberts, et al., Nature, 328:731-734 (1987) and Sambrook, Innis, 
25 Ausubel, Berger, and Mullis (all supra). 

Most modifications to nucleic acids are evaluated by routine screening 
techniques in suitable assays for the desired characteristic. For instance, changes in 
the immunological character of encoded polypeptides can be detected by an 
appropriate immunological assay. For instance, changes in the cellular 
30 immunological character of the polypeptide can be detected by an appropriate 
antibody or CTL assay. Modifications of other properties such as nucleic acid 
hybridization to a complementary nucleic acid, redox or thermal stability of encoded 
proteins, hydrophobicity, susceptibility to proteolysis, or the tendency to aggregate 
are all assayed according to standard techniques. 
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A wide variety of formats and labels are available and appropriate for 
detection of polypeptide sequences. These include analytic biochemical methods 
such as spectrophotometry, radiography, electrophoresis, capillary electrophoresis, 
high performance liquid chromatography (HPLC), thin layer chromatography 
(TLC), hyperdiffusion chromatography, and the like, and various immunological 
methods such as fluid or gel precipitin reactions, immunodiffusion (single or 
double), Immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELISAs), western blot assays, immunofluorescent assays, 
and the like. Several commercially available ELISA assays for the detection of 
retroviral components, including Env domains, are available, allowing one of skill to 
detect Env in biological samples. 

Similarly, the detection of the nucleic acids of the present invention proceeds 
by well known methods such as Southern analysis, northern analysis, gel 
electrophoresis, PCR, radiolabeling and scintillation counting, and affinity 
chromatography. Many assay formats are appropriate, including those reviewed in 
Tijssen (1993) Laboratory Techniques in biochemistry and molecular biology- 
hybridization with nucleic acid probes parts I and It, Elsevier, New York and Choo 
(ed) (1994) Methods In Molecular Biology Volume 33--In Situ Hybridization 
Protocols, Humana Press Inc., New Jersey (see also, other books in the Methods in 
Molecular Biology series); see especially, Chapter 21 of Choo (id.) "Detection of 
Virus Nucleic Acids by Radioactive and Nonisotopic in Situ Hybridization". Finally, 
PCR is also routinely used to detect nucleic acids in biological samples (see, Innis, 
supra, for a general description of PCR techniques). 

In one preferred embodiment, antibodies are used to detect polypeptide 
sequences. Methods of producing polyclonal and monoclonal antibodies are known 
to those of skill in the art, and many anti-HIV antibodies are available. See, e.g., 
Coligan (1991) Current Protocols in Immunology, Wiley/Greene, NY; and Harlow 
and Lane (1989) Antibodies: A Laboratory Manual, Cold Spring Harbor Press, NY; 
Stites, et al. (eds.) Basic and Clinical Immunology (4th ed.), Lange Medical 
Publications, Los Altos, Calif., and references cited therein; Goding (1986) 
Monoclonal Antibodies: Principles and Practice (2d ed.), Academic Press, New 
York, N.Y.; and Kohler and Milstein, Nature, 256:495-497 (1975). Other suitable 
techniques for antibody preparation include selection of libraries of recombinant 
antibodies in phage or similar vectors. See, Huse, et al., Science, 246:1275-1281 
(1989); and Ward, et al., Nature, 341:544-546 (1989). Specific monoclonal and 
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polyclonal antibodies and antisera will usually bind with a KD of at least about 0.1 
mM, more usually at least about 1 fiM, preferably at least about 0.1 jiM or better, 
and most typically and preferably, 0.01 ^iM or better. 
Development of HIV Env Vectors 

To develop Env glycoprotein variants that might effectively induce humoral 
and cellular immunity, a series ofplasmid expression vectors were generated (Fig. 
I). HIV Bnv is encoded by nucleic acid sequences that contain RNA structures that 
limit gene expression (Andre S et al, 1998, J Virol, 72:1497-503; Qiu JT et al., 1999, 
J Virol, 73:9145-52; Rana TM et ah, 1999, Arch Biochem Biophys, 365:175-85; 
Roebuck KA, Saifuddin M, 1999, Gene Expr, 8:67-84; Romano G, et al., 1999, J 
Cell Biochem, 75:357-68; Schneider R et al., 1997, J Virol, 71:4892-903). These 
vectors were therefore synthesized using codons found in human genes that allow 
these structures to be eliminated without affecting the amino acid sequence. Full- 
length HIV Env (gpl60) was highly expressed in the absence of HTV accessory 
proteins at levels >10-fold higher than Rev-dependent viral gp 160 in transfected 
293 cell by Western blot analysis (Fig. HA), and the relevant mutant proteins were 
detected at the expected apparent molecular weights (Fig. UC-D). As might be 
anticipated, gpl60 expressed from the synthetic gene was not efficiently processed 
in transfected 293 cells, presumably because over-expression of gpl60 saturates the 
cellular proteases responsible for cleavage. (Binley JM et al., 2000, J. Virol, 74:627- 
643.) Synthetic HIV gpl60 induced toxicity in transfected cells, with cell rounding 
and detachment evident within 48 hours (Fig. IIIA vs B). This cytotoxicity was 
reduced by elimination of the COOH-terminal cytoplasmic domain. Env protein 
that terminated at amino acid 752 (gpl50) was less cytotoxic than gpl60, while the 
shorter proteins (gpl45 and gpl40) produced little or no effect (Fig. IUC, D, and E, 
respectively). 

To alter Env immunogenicity, two different approaches were explored. First 
the effects of glycosylation on cellular and humoral immunity were evaluated by 
analysis of mutants in which conserved N-linked glycosylation sites were eliminated 
by site-directed mutagenesis. Two sets of mutations were introduced into both 
gp160 and gpl50 (Fig. I). The first set included eleven potential sites (Aglyl 1), and 
the second set included an additional 6 sites downstream (Aglyl 7). Expression 
studies showed that the glycosylation mutants were efficiently expressed, and the 
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glycosylation mutant protein was appropriately reduced in size compared to wild 
type gpl60 or gpl50, consistent with reduced N-linked glycosylation (Fig IIC). 

The second approach involved a series of internal deletions designed to 
stabilize and expose functional domains of the protein that might be present in an 
extended helical structure prior to the formation of the six-member coiled-coil 
structure in the hairpin intermediate (Chan DC et al., 1997, Cell, 89:263-73; 
Weissenhom W et al., 1997, Nature, 387:426-30). To generate this putative pre- 
hairpin structure, the cleavage site was removed to prevent the proteolytic 
processing of the envelope and stabilize the protein by linking it covalently to the 
gp41 extracellular and/or transmembrane domain. To reduce toxicity and enhance 
stability, the fusion peptide domain was deleted. The heptad repeats in the envelope 
protein are important tertiary structure domains involved in the ability of the 
envelope protein to form trimers (Lu M, Kim PS, 1997, J Biomol Struct Dyn, 
J 5:465-71). The sequence between the heptad repeats was removed to stabilize the 
formation of trimers and eliminate formation of the hairpin intermediate. These 
ACFI deletions were introduced into full-length gpl60 and COOH-terminal 
truncation mutants. Though cells trasfected with vectors encoding gpUOACFI, 
gpl45ACFI and gpl60 readily expressed these proteins (Fig. HD), only gpl40ACFI, 
which lacks the transmembrane domain, was readily detected in the supernatant 
(Fig. IV), indicating that it can give rise to soluble antigen. 
Immunogenicity of Env Mutants After DNA Vaccination 

The ability of these Env proteins to elicit an immune response was 
determined in mice by injection with these plasmid DNA expression vectors. 
Antibody responses were monitored by the ability of antisera from injected mice to 
immunoprecipitate wild type gp!60 from cell lysates by SDS-PAGE and Western 
blotting (Fig. V). In some cases, antibody reactivity was also confirmed by 
immunofluorescence. To quantitate the antibody response, immunoprecipitation 
followed by Western blotting with different dilutions of immunized mouse sera was 
tested. The intensity of the gpl60 band was determined by densitometry and 
standardized relative to a positive control sera used to normalize data between 
experiments. The approximately linear dose response of gpl60 intensity with serum 
dilution allowed quantification of the anti-gpl60 antibody response in mouse sera. 
Data from immunized mice showed that none of the wild type Env proteins, neither 
the gp!60, gplSO, gp!45, nor gpl40 COOH-terminal truncations generated 
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consistently high antibody responses (Fig. VA). Of these proteins, gpl40 was 
somewhat more effective than gpl60 in generating antibody responses but remained 
low and inconsistent. Immunization with vectors designed to express glycosylation 
deficient envelope proteins also did not improve the humoral response. In contrast, 
the ACFI mutants in the Env truncation vectors substantially increased anti-gpl60 
antibody response (Fig. VA; gpl45, gpl40, and gpl28). The longer ACFI Env 
proteins (gpl60 and gpl50) did not show comparable enhancement of this response. 
The gp140 (ACFI) provided more consistent and a greater increase in antibody 
response than gpl28ACFI (Fig VB). In all cases, these vectors that encoded 
gpHOACFI, but not wild type gpl40, induced antibodies that were reactive to native 
gpl60(Fig.VC). 

To determine whether these modifications of Env adversely affected CTL 
responses, spleen cells from immunized mice were tested for their ability to lyse 
relevant target cells. All mice immunized with codon-altered Env vectors, including 
COOH-terminal deletion mutations (Fig. VIA) or glycosylation mutants (Fig. VIB), 
elicited strong CTL responses directed to cell lines pulsed with HIV Env peptides. 
These findings were also confirmed using stably transfected cells that expressed 
Env. Importantly, gpl40 (ACFI), which elicited increased antibody responses 
relative to the comparable wild typ Env, readily induced CTL responses to native 
Env, as did other ACFI mutants (Fig. VIC). Addition of anti-CD8 antibody inhibited 
cytolytic activity, as did depletion using magnetic beads coupled with anti-CD8 
antibody, thus confirming a cytotoxic T cell response to these immunogens after 
genetic immunization (Fig. VID). This response was detectable for at least six 
months after immunization. 
Glycosylation and ACFI HIV Env Mutants 

To develop DNA vaccine candidates for HIV, we developed a series of 
synthetic genes designed to express HIV Env mutants in human cells. In the 
absence of HIV regulatory proteins, these codon-altered envelope protein genes 
expressed well in human cells. Like other DNA vaccines, immunization with these 
vectors elicited strong CTL responses in mice, and antibody responses were not 
robust in mice immunized with wild type Env expression vectors. Mutations in 
highly conserved N-linked glycosylation sites did not significantly alter humoral or 
cellular immune response to native Env. In contrast, a mutant Env with deletions in 
the cleavage site, fusion domain, and a region between the heptad repeats elicited a 
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more potent humoral immune response and retained its ability to stimulate Env- 
specific CTL. 

Recent reports suggest that gpl60 forms trimers in vivo and the domain 
required for trimer formation resides in the ectodomain of the gp4l (Yang X et al., 
2000, J Virol, 74:5716-25). Such trimeric forms of HIV envelope protein are likely 
to present different epitopes to the immune system compared to monomeric gpl20. 
In addition to the linear epitopes in the envelope, this trimeric structure is likely to 
expose conformational epitopes important for B cell triggering of a relevant, 
antibody response. In this regard, g P 140 (ACFI), which induced the greatest 
antibody response, is released in a soluble form (Fig. IVB). In contrast, wild type 
Env did not elicit high titer antibody responses. The toxicity of Env in mammalian 
cells has been seen and could limit both the amount and duration of envelope protein 
expression /'// vivo that would affect immunogenicity. The envelope is also heavily 
glycosylated, and removal of partial or complete gpl20 glycosylation sites has 
resulted in higher titers of strain-specific neutralizing antibody responses to mutant 
SIVs in monkeys (Binley JM et al., 1998, AIDS Res Hum Retroviruses, 14 :191-8; 
Reitter JN et al., 1998, Nat Med, 4:679-84). Though it seemed reasonable that 
deglycosylation would reveal epitopes otherwise masked in the native protein, we 
did not observe enhanced immune reactivity by DNA vaccination using different 
glycosylation site mutants, both in gp 160 and gpl50. This difference with the 
previous study is likely due to the fact that DNA vaccination rather than viral 
infection was utilized for immunization. Though glycosylation mutants are unlikely 
to prove helpful with this former method of immunization, we envision that 
modification of glycosylation sites will be effective with other vectors or adjuvants. 

HIV-1 Env is proteolytically cleaved by a cellular convertase into gpl20 and 
gp41 (Eckert DM et al., 1999, Cell, 99:103-15). The gp41 subunit is composed of 
cytoplasmic, transmembrane, and ectodomain segments. The role of the ectodomain 
of the envelope in membrane fusion, particularly its hydrophobic glycine-rich fusion 
peptide, is well established. Two regions with heptad coiled-coil repeats in the 
ectodomain of gp41 are involved in viral fusion (Eckert DM et al., 1999, Cell, 
99:103-15;- Weissenhorn W et al., 1996, EMBO J, 15:1507-14). Upon fusion, these 
two alpha helices, connected via a disulfide-stabilized loop (Gallaher WR e t al., 
1989, AIDS Res Hum Retroviruses, 5:431-40; Kent KA, Robinson J, 1996, AIDS, 
10 Suppl A:S107-S1 14; Sattentau QJ et al., 1995, Virology, 206:713-7), presumably 
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undergo a transient conformational change to a fusion active state. These changes 
allow the formation of a six-member helical hairpin intermediate structure that 
presumably exposes the fusion peptide at the NH 2 -terminus of gp41, allowing fusion 
to the target cell membrane (Binley J, Moore JP,[published erratum appears in 
5 Nature 1997 Sep 11;389(6647):131], 1997, Nature, 387 :346-8; LaCasse R et al, 
1999, Science, 283:357-62). The ACF1 mutation was intended to eliminate cleavage 
of gpl40, remove the unstable hydrophobic region and stabilize oligomer formation. 
Though detailed structural data is not yet available on this protein, these mutations 
apparently stabilize Env in a conformation that elicits both humoral and cellular 

10 immune responses. For example, the neutralizing epitope in the ectodomain of gp41 
(Muster T et al., 1994, J Virol, 68:4031-4) is present in the series of deletions and 
truncations of the envelope and gpl40 (ACFI) is reactive with the 2F5 neutralizing 
monoclonal antibody that binds to this epitope. Importantly, these immunogens also 
induced CTL responses to Env, Though gpl28 (ACFI) induced slightly more potent 

1 5 CTL activity, gpl40 (ACFI) was better able to elicit such responses, both to peptide- 
pulsed cells and stably transduced target cells. Thus the enhanced humoral immune 
response introduced by this vaccine candidate did not appear to diminish the CTL 
response. Taken together, these results indicate that gpl40ACFl serves as an 
improved immunogen that can more effectively elicit an antibody response against 

20 the envelope by DNA vaccination while preserving its ability to induce a CTL 
response. 

Gag and Pol 

In this disclosure, we have prepared synthetic HIV-1 B clade Gag and Pol 
expression vectors that are based on human (h) codon usage. These vectors encode 

25 hGag-Pol and its derivatives, hGag, hPol and an hGag-Pol fusion protein. The 
synthetic Gag-Pol genes show little nucleotide homology to HIV-1 but are the same 
in protein sequence. The modified Gag-Pol genes were subcloned into a eukaryotic 
plasmid expression vector for expression and DNA immunization studies. Synthetic 
Gag-Pol genes allowed high level Rev-independent expression of HIV-1 Gag-Pol 

30 precursor proteins in human and mouse cell lines and induced significant cellular 
and humoral responses in mice. The Gag-Pol fusion protein induced the broadest 
responses to Gag and Pol determinants and thus is envisioned as a prototype 
immunogen that maximizes epitope presentation. 

* 
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Eliminating The Frame Shift Site To Create Viral Polyproteins 

Described herein are HIV Gag-Pol fusion proteins encoded by a continuous 
open reading frame so to improve the immune response to native Gag and Pol. In 
some viruses, translational frame shifting is exploited during protein synthesis. 
5 Specific sequences in the RNA are required for the frame shifting. The viral RNA 

sequences cause ribosomal slippage so that viral proteins are produced in non- 
equivalent ratios. For example, during translation in HIV, the ribosomes shift 
reading frames to synthesize Gag precursor protein and the gag-pol fusion protein in 
a 20:1 ratio. The strategy here is to maximize epitope presentation by transcribing 
10 an immunogen from a continuous open reading frame by eliminating the frame shift 

site. Thus, the strategy for improving the immune response by the use of a HIV 
Gag-Pol fusion protein encoded by a continuous open reading frame extends to other 
viral proteins that are produced in non-equivalent ratios by virtue of translational 
frame shifting. 

15 The present invention involves HIV Gag-Pol fusion proteins encoded by a 

single continuous open reading frame due to mutation of the frame shift site. The 
frame shift site is mutated by deletions, insertions, and/or substitutions of 
nucleotides to create a single continuous open reading frame. In one embodiment, 
deletions are meant (i.e., nucleotides are deleted to create the same open reading 

20 frame). 

The frame shift site is a mutated frame shift to create a single continuous 
open reading frame. For example, a set of similar retroviral gag-pol frame shift sites 
are optionally made for a given fusion protein, for example, by synthesizing 
different gag-pol frame shift regions and cloning the sequences appropriately, or by 

25 site-directed mutagenesis of a given frame shift clone. The efficacy of the frame 

shift sites are assessed by measuring the production of the fusion protein. The 
sequence that shows the highest level of expression is a "optimized" frame shift 
mutation for the set assessed. Alternatively, where a particular level of expression is 
desired, a frame shift site from a particular set of possible frame shift sites which is 

30 closest to the desired activity level is considered to be "optimized." 

Although a full length Gag sequence is preferred for use in the fusion protein 
of the present invention, Gag is optionally deleted of subsequences without negating 
a polyepitope response. For example, regions of the matrix protein (pi 7), regions of 
the capsid protein (p24), regions of p2, regions of the nucleocapsid protein (p7), 

35 regions of pi, and regions of p6 can be deleted while preserving the polyepitope 
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response. Alternatively, regions of the matrix protein (pi 7), regions of the capsid 
protein (p24), regions of p2, regions of the nucleocapsid protein (p7), regions of pi, 
and regions of p6 can be substituted while preserving the polyepitope response . 
Alternatively, regions of the matrix protein (pi 7), regions of the capsid protein 
(p24), regions of p2, regions of the nucleocapsid protein (p7), regions of pi, and 
regions of p6 can be interrupted by insertions while preserving the polyepitope 
response. Optionally, regions of the matrix protein (pi 7), regions of the capsid 
protein (p24), regions of p2, regions of the nucleocapsid protein (p7), regions of pi, 
and regions of p6 can be mutated to inactivate these proteins. 

Likewise, a fiill length Pol sequence is preferred for use in the fusion protein 
of the present invention, and Pol is optionally deleted of subsequences without 
negating a polyepitope response. For example, regions of the protease protein, 
regions of the reverse transcriptase protein, and regions of the integrase protein can 
be deleted while preserving the polyepitope response. Alternatively, regions of the 
protease protein, regions of the reverse transcriptase protein, and regions of the 
integrase protein can be substituted while preserving the polyepitope response. 
Alternatively, regions of the protease protein, regions of the reverse transcriptase 
protein, and regions of the integrase protein can be interrupted by insertions while 
preserving the polyepitope response. Optionally, regions of the protease protein, 
regions of the reverse transcriptase protein, and regions of the integrase protein can 
be mutated to inactivate these enzymes. 

In other embodiments, the Gag-Pol fusion proteins are co-expressed with 
other proteins, either as fusion proteins or separately, preferably with Env sequence 
proteins, such as the modified HIV Env ACFI proteins described herein. 

In another aspect, the invention involves chimeric nucleic acid molecules. 
The chimeric nucleic acid molecules of the invention typically have a retroviral gag 
nucleic acid sequence and a retroviral pol nucleic acid sequence in the same open 
reading frame due to mutation of the frame shift site. The continuous open reading 
frame encodes a fusion protein such as those described above. 

The chimeric nucleic acids of the present invention are optionally DNA, 
RNA, or mRNA. Most typically, the chimeric nucleic acids are provided by 
recombinantly making a DNA, which is expressed in a cell as RNA and/or as 
mRNA. Given the strategy for making the chimeric nucleic acids of the present 
invention, one of skill can construct a variety of clones containing functionally 
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equivalent nucleic acids. Cloning methodologies to accomplish these ends, and 
sequencing methods to verify the sequence of nucleic acids are well known in the 
art. Examples of appropriate cloning and sequencing techniques, and instructions 
sufficient to direct persons of skill through many cloning exercises are found in 
Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in 
Enzymology, volume 152, Academic Press, Inc., San Diego, Calif. (Berger); 
Sambrook, et al. (1989) Molecular Cloning-A Laboratory Manual (2nd ed.) Vol. 1- 
3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, N.Y., (Sambrook); 
and Current Protocols in Molecular Biology, F. M. Ausubel, et ah, eds. 3 Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John 
Wiley & Sons, Inc., (1994 Supplement) (Ausubel). Product information from 
manufacturers of biological reagents and experimental equipment also provide 
information useful in known biological methods. Such manufacturers include the 
SIGMA chemical company (Saint Louis, Mo.), R&D systems (Minneapolis, Minn.), 
Pharmacia LKB Biotechnology (Piscataway, N.J.), CLONTECH Laboratories, Inc. 
(Palo Alto, Calif.), Chem Genes Corp., Aldrich Chemical Company (Milwaukee, 
Wis.), Glen Research, Inc., GIBCO BRL Life Technologies, Inc. (Gaithersberg, 
Md.), Fluka Chemica-Biochemika Analytika (Fluka Chemie AG, Buchs, 
Switzerland), Invitrogen, San Diego, Calif., and Applied Biosystems (Foster City, 
Calif.), as well as many other commercial sources known to one of skill. 

The chimeric nucleic acid compositions of this invention, whether RNA, 
cDNA, mRNA, genomic DNA, or a hybrid of the various combinations, are isolated 
from biological sources or synthesized in vitro. The chimeric nucleic acids of the 
present invention are present in transformed or transfected whole cells, in 
transformed or transfected cell lysates, or in a partially purified or substantially pure 
form. 

In vitro amplification techniques suitable for amplifying sequences to 
provide a nucleic acid or for subsequent analysis, sequencing or subcloning are 
known. Examples of techniques sufficient to direct persons of skill through such in 
vitro amplification methods, including the polymerase chain reaction (PCR) the 
ligase chain reaction (LCR), Qp-replicase amplification and other RNA polymerase 
mediated techniques (e.g., NASBA) are found in Berger, Sambrook, and Ausubel, as 
well as Mullis, et al., (1987) U.S. Pat. No. 4,683,202; PCR Protocols A Guide to 
Methods and Applications (Innis, et al. eds) Academic Press Inc. San Diego, Calif. 
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(1990) (Innis); Amheirn & Levinson (Oct. 1, 1990) C&EN 36-47; The Journal Of 
N1H Research (1991) 3:81-94; Kwoh, et al., Proc. Natl. Acad. Sci. USA, 86:1173 
(1989); Guatelli, et al., Proc. Natl. Acad. Sci. USA, 87:1874 (1990); Lomell, et al., J. 
Clin. Chem., 35:1826 (1989); Landegren, et al., Science, 241:1077-1080 (1988); 
5 Van Brunt, Biotechnology, 8:291-294 (1990); Wu and Wallace, Gene, 4:560 (1989); 
Barringer, et al., Gene, 89:117 (1990), and Sooknanan and Malek, Biotechnology, 
13:563-564 (1995). Improved methods of cloning in vitro amplified nucleic acids 
are described in Wallace, et al., U.S. Pat. No. 5,426,039. Improved methods of 
amplifying large nucleic acids (up to 40 kb) are summarized in Cheng, et al., Nature, 
10 369:684-685 (1994) and the references therein. One of skill will appreciate that 
essentially any RNA can be converted into a double stranded DNA suitable for 
restriction digestion, PCR expansion and sequencing using reverse transcriptase and 
a polymerase. See, Ausubel, Sambrook, Innis, and Berger, all supra. 

One of skill will recognize many ways of generating alterations in a given 
15 nucleic acid construct. Such well-known methods include site-directed mutagenesis, 
PCR amplification using degenerate oligonucleotides, exposure of cells containing 
the nucleic acid to mutagenic agents or radiation, chemical synthesis of a desired 
oligonucleotide (e.g., in conjunction with ligation and/or cloning to generate large 
nucleic acids) and other well-known techniques. See, Giliman and Smith, Gene 
20 8:81-97 (1979), Roberts, et al., Nature, 328:731-734 (1987) and Sambrook, Innis, 

Ausubel, Berger, and Mullis (all supra). 

Most modifications to nucleic acids are evaluated by routine screening 
techniques in suitable assays for the desired characteristic. For instance, changes in 
the immunological character of encoded polypeptides can be detected by an 
25 appropriate immunological assay. For instance, changes in the cellular 

immunological character of the polypeptide can be detected by an appropriate 
antibody or CTL assay. Modifications of other properties such as nucleic acid 
hybridization to a complementary nucleic acid, redox or thermal stability of encoded 
proteins, hydrophobicity, susceptibility to proteolysis, or the tendency to aggregate 
30 are all assayed according to standard techniques. 

A wide variety of formats and labels are available and appropriate for 
detection of fusion protein sequences. These include analytic biochemical methods 
such as spectrophotometry, radiography, electrophoresis, capillary electrophoresis, 
high performance liquid chromatography (HPLC), thin layer chromatography 
35 (TLC), hyperdiffusion chromatography, and the like, and various immunological 
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methods such as fluid or gel precipitin reactions, immunodiffusion (single or 
double), Immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELISAs), western blot assays, immunofluorescent assays, 
and the like. Several commercially available ELISA assays for the detection of 
5 retroviral components, including Env domains, are available, allowing one of skill to 
detect Env in biological samples. 

Similarly, the detection of the chimeric nucleic acids of the present invention 
proceeds by well known methods such as Southern analysis, northern analysis, gel 
electrophoresis, PCR, radiolabeling and scintillation counting, and affinity 

10 chromatography. Many assay formats are appropriate, including those reviewed in 
Tijssen (1993) Laboratory Techniques in biochemistry and molecular biology- 
hybridization with nucleic acid probes parts I and II, Elsevier, New York and Choo 
(ed) (1994) Methods In Molecular Biology Volume 33~In Situ Hybridization 
Protocols, Humana Press Inc., New Jersey (see also, other books in the Methods in 

15 Molecular Biology series); see especially, Chapter 21 of Choo (id.) "Detection of 
Virus Nucleic Acids by Radioactive and Nonisotopic in Situ Hybridization". Finally, 
PCR is also routinely used to detect nucleic acids in biological samples (see, Innis, 
supra, for a general description of PCR techniques). 

In one preferred embodiment, antibodies' are used to detect polypeptide 

20 . sequences. Methods of.producing polyclonal and monoclonal antibodies are known 
to those of skill in the art, and many anti-HIV antibodies are available. See, e.g., 
Coligan (1991) Current Protocols in Immunology, Wiley/Greene, NY; and Harlow 
and Lane (1989) Antibodies: A Laboratory Manual, Cold Spring Harbor Press, NY; 
Stites, et al. (eds.) Basic and Clinical Immunology (4th ed.), Lange Medical 

25 Publications, Los Altos, Calif., and references cited therein; Goding (1986) 
Monoclonal Antibodies: Principles and Practice (2d ed.), Academic Press, New 
York, N.Y.; and Kohler and Milstein, Nature, 256:495-497 (1975). Other suitable 
techniques for antibody preparation include selection of libraries of recombinant 
antibodies in phage or similar vectors. See, Huse, et al., Science, 246:1275-1281 

30 (1989); and Ward, et al., Nature, 341:544-546 (1989). Specific monoclonal and 
polyclonal antibodies and antisera will usually bind with a KD of at least about 0.1 
mM, more usually at least about 1 ^M, preferably at least about 0.1 |iM or better, 
and most typically and preferably, 0.01 |aM or better. 
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Expression of Synthetic HIV-1 Gag and Pol Genes 

Four synthetic HIV-1 Gag-and/or Pol expression vectors, hGag-Pol, hGag- 
PolAFsAPr, hPol and hGag genes were prepared (Fig. VII). To confirm expression, 
the synthetic or viral Gag-Pol genes were transiently transfected into 293T cells, a 
5 human kidney-derived cell line. When cell lysates were analyzed by 

immunoblotting with human anti-HIV-1 IgG (Fig. VLUA), monoclonal anti-p24 
(Fig. VIUB), and rabbit anti-RT (Fig. VIIIC), Gag p55, Pol pi 10 and Gag-Pol pl60 
precursor proteins were detected in hGag, hPol, and hGag-Pol fusion plasmids 
transfected 293T cells, as was expected. Mature virion proteins, p24 and RTp66, 

10 were detected in the hGag-Pol gene transfected cells (Figs. VHIA, B and C). This 
might be a result of the activation of protease inside cells which was itself a result of 
the high-level expression of Gag and Gag-Pol protein (Karacostas, V et al, 1993, 
Virology, 193:661-671). The expression of Gag precursor proteins from codon- 
altered vectors was £ 10-fold higher than viral Gag-Pol (Fig. VIII), determined by 

15 quantitative phosphorimaging. The level of accumulated Gag-Pol fusion protein 

was 100-fold higher in cells transfected with hGag-Pol compared to viral Gag-Pol. 
Virus-like particles were released from the hGag gene transfected cells (Fig. IX), 
detected by transmission electron microscopy. Though such particles were observed 
at a lower frequency with hGag-Pol, no particles were seen in cells transfected with 

20 hGag-PoIAFsAPr or hPol vectors. Stable expression of HIV-1 Gag and Pol proteins 
from codon-optimized genes in mouse CT26 and BH10ME cells was also observed 
(Fig. VHID). 

Induction of HIV-1 Gag and Pol CTL Responses in Mice by DNA Vaccination 

To evaluate the cellular immune response to HIV-1 Gag and Pol proteins, 
25 Balb/C female mice were injected intramuscularly with the eukaryotic expression 

vector plasmids containing the codon-optimized genes. Two weeks after the final 
vaccination, splenocytes were harvested from the immunized mice and sensitized 
with either Gag or Pol pep tide-pulsed naive mouse splenocytes. One week later, 
CTL responses were analyzed using a 5-hour chromium release assay. 
30 CTL responses specific to HIV-1 Gag and/or Pol were first analyzed using 

Gag or Pol peptide-pulsed BC10ME cells, or mouse fibrosarcoma cell lines derived 
from B/C-N cells (Collins, JL et al., 1981, J Exp. Med., 153:89-106). Immunization 
with hGag, hGag-PolAFSAPr or hGag-Pol genes induced comparably strong CTL 
responses specific to Gag (Fig. XA); however, after immunization with hPol, hGag- 
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PolAFSAPr or hGag-Pol genes, only the fusion protein, hGag-PolAFSAPr, and hPol 
to a lesser extent, elicited a marked CTL response to Pol (Fig. XB). To confirm that 
the specific killing in the CTL assays was induced by CD8 + cytotoxic T 
lymphocytes, CD4 4 " or CD8 + cells were depleted from sensitized splenocytes by 
5 Dynal beads (Dynal, Inc., Lake Success, NY). Depletion of CD8 + cells abolished 

the specific lysis in the hGag-Pol AFSAPr gene-immunized mice, while depletion of 
CD4 4 had little effect on lysis (Fig. XC), suggesting that CD8 + lymphocytes were 
responsible for specific cytotoxicity. 

The responses were further analyzed and confirmed with the hGag or hPol 

10 gene transduced syngeneic CT26 and BC10ME cell lines. Responses to Gag in the 
mice immunized with the hGag, hGag-PolAFSAPr or hGag-Pol genes were similar 
when peptide-pulsed cells were used as targets in the CTL assay (Fig. XIA). Mice 
immunized with the hPol gene generated a specific response to HIV-1 Pol on 
BC10ME cell lines stably expressing Pol as target cells (Fig. XIB). The same 

15 results have been observed with CT26 cell lines. These stably transfected cell lines 
were therefore more sensitive as target cells than peptide-pulsed cells in the Pol CTL 
assays. 

Antibody Response in the Immunized Mice 

Sera from mice immunized with different plasmids was analyzed with a p24 
20 ELISA. hGag immunized mice demonstrated the highest p24 antibody titers (Fig. 
XITA). Unexpectedly, hGag-Pol virus-like particles elicited the lowest levels of p24 
antibody. Similar results were observed by Western blotting with pooled sera (Fig. 
XIIB). The HIV-1 Pol specific antibodies were not detected by a commercially 
available Western blotting kit (Fig. XIIB), but antibodies to Pol were detected in 
25 mice immunized with hPol and hGag-PolAFSAPr with a more sensitive method, 
IP/Western blotting (Fig. XUC). Presumably, this assay is more sensitive and better 
able to detect native conformational epitopes. Though such antibodies were found 
in mice immunized with Pol and Gag-Pol fusion proteins in this assay, minimal 
response was detected in the mice immunized with hGag-Pol. Though both 
30 immunogens elicited similar Gag responses, the Gag-Pol fusion protein was 
therefore more effective in the stimulation of CTL and antibody responses to Pol. 
HIV Gag-Pol Fusion Proteins 

In this disclosure, HIV-1 B-clade Gag and Pol genes were modified to 
increase Rev-independent expression of HIV-1 Gag-Pol proteins. This modification 
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allowed synthesis of HIV Gag and Pol, as well as fusion proteins at levels 10- to 
100-fold higher than the corresponding Rev-dependent viral gene in the absence of 
Rev and RRE elements. These viral proteins were recognized by standard 
polyclonal and monoclonal antibodies (Fig. VIII), and immature VLP were 
produced and released from 293T cells transfected with hGag (Fig. IX). 

The immune response induced by Rev-independent Gag-Pol was directed 
against both Gag and Pol determinants. The hGag, hGag-Pol fusion and hGag-Pol 
all induced strong CTL responses specific for Gag in mice immunized with plasmid 
DNA, but a significant Pol response was elicited only in the mice immunized with 
the hGag-PolAFSAPr or Pol alone. Because immunization with hGag-Pol gene 
failed to induce detectable cellular or humoral responses to HIV-1 Pol protein, these 
findings indicate that the Gag-Pol fusion protein induces a broader range of 
responses and allows delivery of an immunogen with a larger number of epitopes in 
a single continuous open reading frame. During viral replication, viral gag - pol 
produces Gag precursor protein and the gag-pol fusion protein by frame shifting in a 
20:1 ratio (Wilson, W et ah, 1988, Cell, 55:1159-1169). The deletion of a frame 
shift site in hGag-PolAFSAPr results in production of only the Gag-Pol fusion 
protein. Expression of Gag-Pol proteins alone in human cells is not adequate to 
form releasable viral particles because HIV-1 viral assembly requires Gag precursor 
proteins (Park, J and CD Morrow, 1992, J Virol, 66:6304-6313; Smith, AJ et aL, 
1993, J Virol, 67:2266-2275). The ability of hGag-PolAFSAPr to elicit strong Gag 
and Pol specific CTL responses in mice may be explained by high level expression 
of Gag-Pol fusion protein and its retention on the inside of cells, which could create 
a new condition not existing during normal viral replication and could provide 
sufficient proteins for antigen presentation. Moreover, the mutation in viral protease 
prevents viral protein from intracellular activation and reduced cellular toxicity. 
Overexpression of this polyprotein is also likely to affect its intracellular 
localization/transport and is envisioned as improving antigen presentation. 

The Pol gene of HIV-1 is the precursor protein for viral protease, reverse 
transcriptase and integrase, which are crucial to viral replication (Kohl, NE et al., 
1988, Proc. Natl. Acad. Sci. USA, 85:4686-4690.). Retroviral extracellular 
maturation, resulting from self-activation of protease after release produces mature 
RT and IN which have important functions in reverse transcription and integration 
respectively, for HIV-1, HIV-2, and S1V replication. The catalytic cores of these 
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enzymes have relatively conserved domains in order to preserve their functions and 
thus are envisioned as inducing cross-clade CTL responses. As early as 1988, CTLs 
specific for HIV-1 RT were found in blood samples from HIV-1 infected individuals 
(Hosmalin, A et ah, 1990, Proc. Natl. Acad. Sci. USA, 87:2344-2348; Walker, BD 
et al., 1988, Science, 240:64-66.). Relatively strong Gag-specific CTL responses 
have been shown in numerous non-human primate and human studies, using DNA 
vaccines or a live recombinant vector containing viral Gag-Pol constructs (Evans, 
TG, 1999, J Infect Dis, 180:290-298; Ferrari, G, 1997, Blood, 90:2406-2416; Gorse, 
GJ et al, 1999, Vaccine, 18:835-849; Seth, A, 1998, Proc, Natl. Acad. Sci. USA, 
95:101 12-10116; Seth, A et al., 2000, J Virol, 74:2502-2509), but fewer Pol-specific 
CTL responses have been reported. The detection of significant CTL responses 
specific to Pol in our disclosure may be attributed in part to establishment of stable 
Pol expressing cell lines, in which codon alteration and inactivation of FS and PR in 
the Pol gene allow high level expression of the Pol protein without cellular toxicity. 
Though it remains possible that the hGag-Pol, or a combination of hGag and hPol, 
may exert similar effects with appropriate adjuvants or with different prime-boost 
regimens, the Rev-independent Gag-Pol fusion protein stimulates HIV-1 Gag and 
Pol specific CTL responses in mice and is envisioned as proving useful in an AIDS 
vaccine. 



PLASMID DESCRIPTIONS 

Env Plasmids 

VRC2100 

pVR1012x/s R5gpl39-Nef (delta) MHC (delta) CD4/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. The envelope 
protein gene from pX4gpl60/h (Nabel lab #1272) was ligated in frame with the 
mutant Nef gene from pNefDMHCDCD4/h (Nabel lab #1278) to produce 
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pX4gpl39~NefDMHCDCD4/h. The envelope-Nef fusion protein expressed from 
pX4gpl39-NefDMHCDCD4/h contains the first 668 amino acids from the HIV 
envelope glycoprotein (gpl39) ftjsed to the entire mutant Nef protein. The truncated 
envelope polyprotein (gpl 39) contains the entire SU protein and a portion of the TM 
5 protein including the fusion domain, but lacking the transmembrane domain and 
regions important for oligomer formation. The protein sequence of the Nef protein 
from HIV-1 PV22 (GenBank accession number K02083) was used to create a 
synthetic version of the Nef gene (NefTh) using codons optimized for expression in 
human cells. To disrupt the ability of Nef to limit both MHC class I and CD4 
10 expression, point mutations were introduced into the Nef gene from pNetfh (Nabel 
lab #1275). The resulting amino acids substitutions in pNefDMHCDCD4/h are: 
P69A, P72A, P75A, P78A, D174A and D175A. X4gpl39-NefDMHCDCD4/h is 
expressed from the pVR1012x/s (Nabel lab #1267) vector backbone. 

15 VRC2200 

pVR1012x/s R5gpl57-Nef (delta) MHC (delta) CD4/h 

The protein sequence of the envelope polyprotein (gpl 60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

20 gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, bu t the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

25 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (Nabel lab number 

1272) were replaced with the corresponding region from the BaL strain of HIV-1 
(GeneBank accession number M68893, again using human preferred codons). The 
envelope-Nef fusion protein expressed from pR5gpl57-Neffh contains the first 820 
amino acids from the HIV envelope glycoprotein (gpl 57) fused to the entire mutant 

30 Nef protein. The gene for gpl 57 was ligated in frame with the full-length mutant 
Nef gene from pNefDMHCDCD4/h (Nabel lab #1278) to produce pR5gpl57- 
NefDMHCDCD4/h. The protein sequence of the Nef protein from HIV-1 PV22 
(GenBank accession number K02083) was used to create a synthetic version of the 
Nef gene (Nef/h) using codons optimized for expression in human cells. To disrupt 

35 the ability of Nef to limit both MHC class I and CD4 expression, point mutations 
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were introduced into the Nef gene from pNeiTh (Nabel lab #1275). The resulting 
amino acids substitutions in pNe£DMHCDCD4/h are: P69A, P72A, P75A, P78A, 
D174A and D175A. R5gpl 57-NefDMHCDCD4/h is expressed from the 
pVR1012x/s (Nabel lab #1267) vector backbone. 

5 

VRC230Q 

pVR1012x/s X4 gpl39-Nef deltaMHC deltaCD4/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 

10 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. The envelope 

15 protein gene from pX4gpl60/h (Nabel lab #1272) was ligated in frame with the 

mutant Nef gene from pNefDMHC/h (Nabel lab #1276) to produce pX4gpl39- 
NefDMHC/h. The envelope-Nef fusion protein expressed from pX4gpl39- 
NefDMHC/h contains the first 668 amino acids from the HIV envelope glycoprotein 
(gpl39) fused to.the entire mutant Nef protein. The truncated envelope polyprotein 

20 (gpl39) contains the entire SU protein and a portion of the TM protein including the 
fusion domain, but lacking the transmembrane domain and regions important for 
oligomer formation. The protein sequence of the Nef protein from HIV-1 PV22 
(GenBank accession number K02083) was used to create a synthetic version of the 
Nef gene (Nef/h) using codons optimized for expression in human cells. To disrupt 

25 the ability of Nef to limit MHC class I expression, point mutations were introduced 
into the Nef gene from pNef/h (Nabel lab #1275). The resulting amino acids 
substitutions in pNefDMHC/h are: P69A, P72A, P75A, and P78A. X4gpl39- 
NeEDMHC/h is expressed from the pVR1012x/s (Nabel lab #1267) vector backbone 

30* VRC2302 

pVR1012x/s X4gpl30-Nef/h 

For the X4gpl30/h (VRC2703) portion, the protein sequence of the envelope 
polyprotein (gpl60) from HXB2 (X44ropic, GenBank accession number K03455) 
35 was used to create a synthetic version of the gene (X4gpl60/h) using codons 
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optimized for expression in human cells. The nucleotide sequence X4gpl 60/h shows 
little homology to the HXB2 gene, but the protein encoded is the same with the 
following amino acid substitutions: F53L, N94D, K192S, I215N, A224T, A346D, 
and P470L. The full-lenth X4-tropic version of the envelope protein from 
5 pX4gpl60/h (VRC3300) was terminated after the codon for amino acid 602. The 
truncated envelope polyprotein contains the entire SU protein and a portion of the 
TM protein including the fusion domain, but lacking the transmembrane domain. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. 
Regions important for oligomer formation may be partially functional. This 
10 X4gpl30/h (VRC2703) gene was fused to Nef gene. The Nef protein from HF/-1 
PV22 (GenBank accession number K02083) was used to create the viral Nef gene 
(pVR1012-Nef)(Nabel Lab #1093. The nucleotide sequence is homologous to the 
viral gene, and the protein encoded is the same. The expression vector backbone is 
pVR10l2x/s(VRC2000). 

15 

VRC2400 

pVR1012x/s X4gpl57-Ne£DMHCDCD4/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 

20 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N 3 A224T, A346D, P470L, T723I, and S745T. The envelope- 

25 Nef fusion protein expressed from pX4gpl57-NefDMHCDCD4/h contains the first 
820 amino acids from the HIV envelope glycoprotein (gp!57) fused to the entire 
mutant Nef protein. The truncated envelope polyprotein (gpl57) contains the entire 
SU protein and a portion of the TM protein including the fusion domain, the 
transmembrane domain and regions important for oligomer formation. Heptad(H) 1, 

30 Heptad 2 and their Interspace(IS) are required for oligomerization. The gene for 
gpl57 was ligated in frame with the full-length mutant Nef gene from 
P NefDMHCDCD4/h (VRC3600) to produce pX4gpl 57-NeflDMHCDCD4/h. The 
protein sequence of the Nef protein from HIV-1 PV22 (GenBank accession number 
K02083) was used to create a synthetic version of the Nef gene (Nef/h) using codons 

35 optimized for expression in human cells. To disrupt the ability of Nef to limit both 
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MHC class I and CD4 expression, point mutations were introduced into the Nef 
gene from pNef/h (VRC3500). The resulting amino acids substitutions in 
pNe£DMHCDCD4/h are: P69A, P72A, P75A, P78A, D174A and D175A. 
X4gpl60-NefDMHCDCD4/h is expressed from the pVR1012x/s (VRC2000) vector 
5 backbone. 

VRC2700 

pVR10l2x/sX4g P 140/h 



10 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 

GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 

15 N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. The full-tenth 

X4-tropic version of the envelope protein from pX4gp!60/h (VRC3300) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the transmembrane domain. Regions important for oligomer 
20 formation may be partially functional. Heptad(H) 1, Heptad 2 and their 

Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s (VRC2000). 



VRC2701 



25 pVR1012x/s X4gpl 40(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

30 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, and P470L. The full-lenth X4-tropic version 
of the envelope protein from pX4gp160/h (VRC3300) was terminated after the 
codon for amino acid 680. The truncated envelope polyprotein contains the entire 

35 SU protein and a portion of the TM protein including the fusion domain, but lacking 
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the transmembrane domain. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 503- 
536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
5 amino acids 593-620, have been deleted. The expression vector backbone is 
pVR1012x/s (VRC2000). 

VRC2702 

pVR1012x/sX4gpl28(del F/CL)/h 

10 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

15 protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. The full-lenth X4-tropic version 
of the envelope protein from pX4gp!60/h (VRC3300) was terminated after the 
codon for amino acid 592. The truncated envelope polyprotein contains the entire 
SU protein and a portion of the TM protein including the fusion domain, but lacking 

20 the transmembrane domain. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their Tnterspace(IS) are required for 
oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 503- 
536, have been deleted. The expression vector backbone is pVR1012x/s (VRC2000). 

25 VRC27Q6 

pVR1012x/sX4gpl45/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

30 gene (X4gpl60/h) using codons optimized for expression in human cells. The 

nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. The full-lenth X4-tropic version 
of the envelope protein from pX4gpl60/h (VRC3300) was terminated after the 

35 codon for amino acid 704. The truncated envelope polyprotein contains the entire 
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SU protein and a portion of the TM protein including the fusion domain, but lacking 
the transmembrane domain. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their Interspace(rS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

5 

VRC2707 

pVR1012x/s X4gpl45(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
10 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. The full-lenth X4-tropic version 
15 of the envelope protein from pX4gpl60/h (VRC3300) was terminated after the 
codon for amino acid 704. The truncated envelope polyprotein contains the entire 
SU protein and a portion of the TM protein including the fusion domain, but lacking 
the transmembrane domain. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
20 oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 503- 
536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. The expression vector backbone is 
pVR1012x/s(VRC2000). 

25 VRC2800 

pVR1012x/sR5gpl40/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

30 gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HI.V-1 

35 envelope polyprotein amino acids 275 to 361 from X4gpI60/h (VRC3300) were 

-62- 



PCT/US01/25721 

replaced with the corresponding region from the BaL strain of H1V-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-lenth R5- 
tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the transmembrane domain. Regions important for oligomer 
formation may be partially functional. The expression vector backbone is 
pVR1012x/s (VRC2000). 
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VRC2801 

pVR1012x/s R5gpl40(dei F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
5 GenBank accession number K03455) was used to create a synthetic version of the 

gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 

10 R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-lenth R5- 
tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 

1 5 terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the transmembrane domain. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The Fusion and Cleavage (F/CL) 
domains, from amino acids 503-536, have been deleted. The Interspace (IS) between 

20 Heptad (H) I and 2, from amino acids 593-620, have been deleted. Regions 
important for oligomer formation may be partially functional. The expression vector 
backbone is pVR1012x/s (VRC2000). 

VRC2804 

25 pVR1012x/s R5gpl45/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

30 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 lo 361 from X4gpl60/h (VRC3300) were replaced with 

35 the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
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number M68893, again using human preferred codons). The full-tenth R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
5 domain, the transmembrane domain, and regions important for oligomer formation. 

Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
expression vector backbone is pVR1012x/s (VRC2000). 

VRC28Q5 

10 pVRl 012x/s R5gpl45(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

15 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 

20 the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-lenth R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 

25 domain, the transmembrane domain, and regions important for oligomer formation. 

Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 
deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 
have been deleted. The expression vector backbone is pVRl0l2x/s (VRC2000). 

30 
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VRC 2810 

pVR1012 x/s R5gp140delCl(delCFI)/h 



Constant domain 1 was deleted from gpl40de1CFI from amino acid 33-127 and was 
5 replaced with a Nhel site. 



VRC 28 J 1 

* 

pVR1012 x/s R5gpl40delC2 (delCFI)/h 

10 Constant domain 2 was deleted from gpl40delCFI from amino acid 199-293, and 
was replaced with an Nhel site 

VRC2812 

pVR 1012 x/s R5gpl40delC3 (delCFI)/h 

15 

Constant domain 3 was deleted from gpl40delCFI from amino acid 333-380, and 
was replaced with an Nhel site. 

VRC2813 

20 pVR1012x/sR5gpl40delC4(delCFr)/h 

Constant domain 4 was deleted from gpl40delCFI from amino acide 419-458, and 
was replaced with an Nhel site. 



25 VRC2814 

PVR1012 x/s R5gpl40delC5 (delCFI)/h 

Constant domain 5 was deleted from gpl40delCFl from amino acid 472 -498 and 
was rep laced wi th an Nhel site. 

30 

VRC 2820 

pVR1012x/s R5gpl40(dCFI)/dVl 



35 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
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gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
5 R5-tropic version of the envelope protein (R5gpl60/h), the region encoding H1V-1 

envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 

10 terminated after the codon for amino acid 680. The truncated envelope polyprotein 

contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the VI loop (a.a.129 to 154) and transmembrane domain. 
Heptad(H) I, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 

15 deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 
have been deleted. Regions important for oligomer formation may be partially 
functional. The expression vector backbone is pVR1012x/s (VRC2000). 

VRC 2821 

20 pVR1012x/s R5gpl40(dCFI)/dV2 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

25 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, IC192S, 121 5N, A224T, A346D, P470L, T7231, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 

30 replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 

35 domain, but lacking the V2 loop (a.a.160 to 193) and transmembrane domain. 
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Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 
deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 
have been deleted. Regions important for oligomer formation may be partially 
5 functional. The expression vector backbone is pVRl 012x/s (VRC2000). 



VRC 2822 

> ' — — — — — — 

pVR1012x/s R5gp140(dCFI)/dV3 

10 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 

GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 

15 N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

■ 

envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 

20 R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V3 loop (a.a.299 to 327) and transmembrane domain. 
Heptad (H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. 

25 The Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 

deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 
have been deleted. Regions important for oligomer formation may be partially 
functional. The expression vector backbone is pVR1012x/s (VRC2000). 

30 VRC 2823 

pVRl 01 2x/s R5gp 1 40(dCFI)/dV4 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
35 gene (X4gpl60/h) using codons optimized for expression in human cells. The 
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nucleotide sequence X4gpt60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
5 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 

replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 

10 contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V4 loop (a.a.386 to 413) and transmembrane domain. 
Heptad (H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. 
The Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 
deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 

15 have been deleted. Regions important for oligomer formation may be partially 
functional. The expression vector backbone is pVRJOl 2x/s (VRC2000). 

VRC 2824 

pVR1012x/s R5gpl40(dCFI)/dV12 

20 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

25 protein encoded is the same with the following amino acid substitutions: F53L, 

N94D, IC192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 

30 accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gp!60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the VI, V2 loops (a.a.129 to 154,and a.a.160 to 193) and 

35 transmembrane domain. Heptad(H) 1, Heptad 2 and their Inlerspace(IS) are required 
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for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pVR1012x/s 
(VRC2000). 



VRC 2825 

pVR1012x/s R5gpl40(dCFI)/dV13 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the VI, V3 loops (a.a.129 to 154,and a.a.299 to 327) and 
transmembrane domain. Heptad (H) 1, Heptad 2 and their Interspace (IS) are 
required for oligomerization, The Fusion and Cleavage (F/CL) domains, from amino 
acids 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, 
from amino acids 593-620, have been deleted. Regions important for oligomer 
formation- may be partially functional. The expression vector backbone is 
pVR1012x/s(VRC2000). 



VRC 2826 

pVRl 01 2x/s R5gpl 40(dCFI)/d V 1 4 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
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gene (X4gpl60/h) using codons optimized for. expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, KL192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
5 R5-lropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 

10 terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the VI, V4 loops (a.a.129 to 154,and a.a.386 to 413) and 
transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 

15 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pVR1012x/s 
(VRC2000). 

• * 

20 VRC2827 

■ 

pVR!012x/s R5gpl40(dCFI)/dV23 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

25 gene (X4gpl60/h) using codons optimized for expression in human cells. The 

nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

30 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HUV-1 (GeneBank 
accession number M68893, again using human preferred codons). The frill-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 

35 contains the entire SU protein and a portion of the TM protein including the fusion 
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domain, but lacking the V2, V3 loops (a.a.160 to 193,and a.a.299 to 413) and 
transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
5 amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pVR1012x/s 
(VRC2000). 

VRC 2828 

10 pVR1012x/s R5gpl40(dCFI)/dV24 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

15 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 

20 replaced with the corresponding region from the BaL strain of KLV-1 (GeneBank 
accession number M68893, again using human preferred codons). The fiill-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 

25 domain, but lacking the V2, V4 loops (a.a.160 to 193,and a.a.386 to 413) and 

transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 

30 may be partially functional. The expression vector backbone is pVR1012x/s 
(VRC2000). 
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VRC 2829 

pVR1012x/s R5gpl40(dCFI)/dV34 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
5 GenBank accession number K03455) was used to create a synthetic version of the 

gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I2I5N, A224T, A346D, P470L, T723I, and S745T. To produce an 

10 R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl6(Vh (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 

15 terminated after the codon for amino acid 680. The truncated envelope polyprotein 

contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V3, V4 loops (a.a.299 to 327,and a.a.386 to 413) and 
transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(lS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 

20 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pVR1012x/s 
(VRC2000). 

25 VRC 2830 

■ ' ■ 

pVR1012x/s R5gpl40(dCFI)/dV123 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4- tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

30 gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

35 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
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replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gp160/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V1.V2.V3 loops (a.a. 129 to 154, a.a.I60 to 193,and a.a.299 
to 327) and transmembrane domain. Heptad(H) 1, Heptad 2 and their lnterspace(IS) 
are required for oligomerization. The Fusion and Cleavage (F/CL) domains, from 
amino acids 503-536, have been deleted. The Interspace (IS) between Heptad (H) I 
and 2, from amino acids 593-620, have been deleted. Regions important for 
oligomer formation may be partially functional. The expression vector backbone is 
pVR1012x/s(VRC2000). 



VRC 2831 

pVR1012x/s R5gpl40(dCFI)/dV124 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V1,V2,V4 loops (a.a. 129 to 154, a.a. 160 to 193,and a.a.386 
to 413) and transmembrane domain. Heptad (H) 1, Heptad 2 and their Interspace 
(IS) are required for oligomerization. The Fusion and Cleavage (F/CL) domains, 
from amino acids 503-536, have been deleted. The Interspace (IS) between Heptad 
(H) 1 and 2, from amino acids 593-620, have been deleted. Regions important for 
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The expression vector backbone is 



VRC 2832 

5 pVR1012x/s R5gpl40(dCFl)/dVl34 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

10 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 

15 replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gp!60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 

20 domain, but lacking the V1,V3,V4 loops (a.a. 129 to 154, a.a.299 to 327,and a.a.386 

to 413) and transmembrane domain. Heptad(H) 1, 'Hep tad 2 and their Interspace(IS) 
are required for oligomerization. The Fusion and Cleavage (F/CL) domains, from 
amino acids 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 
and 2, from amino acids 593-620, have been deleted. Regions important for 

25 oligomer formation may be partially functional. The expression vector backbone is 

pVR1012xys(VRC2000). 

VRC 2833 

pVR1012x/s R5gpl40(dCFI)/dV234 

30 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
35 protein encoded is the same with the following amino acid substitutions: F53L, 
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N94D, K192S, 1215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
5 accession number M68893, again using human preferred codons). The full-length 

R5-tropic version of the envelope protein gene from pR5gp!60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V2,V3,V4 loops (a.a. 160 to 193, a.a.299 to 327,and a.a.386 

10 to 413) and transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace^) 
are required for oligomerization. The Fusion and Cleavage (F/CL) domains, from 
amino acids 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 
and 2, from amino acids 593-620, have been deleted. Regions important for 
oligomer formation may be partially functional. The expression vector backbone is 

15 pVRl0l2x/s (VRC2000). 

VRC 2834 

. pVR10l2x/sR5gpl40(dCFI)/dV1234 

20 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 

25 N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 

30 R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V1,V2,V3,V4 loops (a.a. 129 to 154, a.a. 160 to 193, 
a.a.299 to 327,and a.a386 to 413) and transmembrane domain. Heptad(H) 1, Heptad 

35 2 and their Interspace(lS) are required for oligomerization. The Fusion and Cleavage 
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(F/CL) domains, from amino acids 503-536, have been deleted. The Interspace (IS) 
between Heptad (H) 1 and 2, from amino acids 593-620, have been deleted. Regions 
important for oligomer formation may be partially functional. The expression vector 
backbone is pVR1012x/s (VRC2000). 

VRC 2835 

pAdApt R5gp 1 40(dCFI)/d V 1 

The protein sequence of the envelope polyprotein (gp!60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the VI loop (a.a.129 to 154) and transmembrane domain. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 
deleted. The Interspace (IS) between Heptad (H) I and 2, from amino acids 593-620, 
have been deleted. Regions important for oligomer formation may be partially 
functional. The expression vector backbone is pAdApt. 

VRC 2836 
30 pAdApt R5gpl 40(dCFI)/d V2 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
35 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
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protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC33O0) were 
5 replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 

accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 

10 domain, but lacking the V2 loop (a.a.160 to 193) and transmembrane domain. 

Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 
deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 
have been deleted. Regions important for oligomer formation may be partially 

15 functional. The expression vector backbone is AdApt. 

VRC 2837 

pAdApt R5gpl40(dCFQ/dV3 

20 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 

GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 

25 N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 

R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 

30 R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V3 loop (a.a.299 to 327) and transmembrane domain. 
Heptad (H) 1, Heptad 2 and their Interspace(lS) are required for oligomerization. 

35 The Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 
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deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 
have been deleted. Regions important for oligomer formation may be partially 
functional. The expression vector backbone is pAdApt 

5 VRC 2838 

p Ad Apt R5gpl40(dCFI)/dV4 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

10 gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tcopic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

15 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 

20 contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V4 loop (a.a.386 to 413) and transmembrane domain. 
Heptad (H) 1, Heptad 2 and their lnterspace(IS) are required for oligomerization. 
The Fusion and Cleavage (F/CL) domains, from amino acids 503-536, have been 
deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 593-620, 

25 have been deleted. Regions important for oligomer formation may be partially 
functional. The expression vector backbone is pAdApt. 

VRC 2839 

pAdApt R5gpl40(dCFI)/dV12 

30 

The protein sequence of the envelope polyprotein (gp!60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
35 protein encoded is the same with the following amino acid substitutions: F53L, 
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N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h),> the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC330O) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
5 accession number M68893, again using human preferred codons). The full-length 

R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the VI, V2 loops (a.a.129 to I54,and a.a.160 to 193) and 
10 transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 

for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pAdApt. 

15 

VRC 2840 

pAdApt R5gpl40(dCFI)/dV13 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 

20 GenBank accession number K03455) was used to create a synthetic version of the 

gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 

25 R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 

30 terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the VI, V3 loops (a.a.129 to 154,and a.a.299 to 327) and 
transmembrane domain. Heptad (H) 1, Heptad 2 and their Interspace (IS) are 
required for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino 

35 acids 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, 
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from amino acids 593-620, have been deleted. Regions important for oligomer 
formation may be partially functional. The expression vector backbone is pAdApt. 

VRC 2841 

5 p Ad Apt R5gpl40(dCFI)/dV14 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was' used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

10 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 

15 replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 

accession number M68893, again using human preferred codons). The full-length 
R5-tropic 'version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 

20 domain, but lacking the VI, V4 loops (a.a.129 to 154,and a.a.386 to 413) and 
transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 

25 may be partially functional. The expression vector backbone is pAdApt. 

VRC 2842 

pAdApt R5gpl40(dCFl)/dV23 

30 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the . 
protein encoded is the same with the following amino acid substitutions: F53L, 

35 N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an . 
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R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
5 R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 

terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TNI protein including the fusion 
domain, but lacking the V2, V3 loops (a. a. 160 to 193,and a.a.299 to 413) and 

* 

transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
10 for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pAdApt. 

15 VRC 2843 

pAdApt R5gpl40(dCFI)/dV24 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

20 gene (X4gpl60/h) using codons optimized for expression in human cells. The 

nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
M94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

25 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 

replaced with the corresponding region from the BaL strain of HIV-l (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 

30 contains the entire SU protein and a portion of the TM protein including the fusion 

domain, but lacking the V2, V4 loops (a.a.160 to 193 s and a.a.386 to 413) and 
transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
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amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pAdApt. 



VRC 2844 

pAdApt R5gpl40(dCFI)/dV34 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HTV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV- 1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V3, V4 loops (a.a.299 to 327,and a.a.386 to 413) and 
transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace^) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 
503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pAdApt. 

VRC 2845 

p Ad Apt R5gpl40(dCFl)/dV123 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: P53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723f, and S745T. To produce an 
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R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GencBank 
accession number M68893, again using human preferred codons). The full-length 
5 R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V1,V2,V3 loops (a.a. 129 to 154, a.a.160 to 193,and a.a.299 
to 327) and transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) 
10 are required for oligomerization. The Fusion and Cleavage (F/CL) domains, from 
amino acids 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 
and 2, from amino acids 593-620, have been deleted. Regions important for 
oligomer formation may be partially functional. The expression vector backbone is 
pAdApt 

15 

VRC 2846 

pAdApt R5gp 140(dCFl)/d V 1 24 

The protein sequence of the envelope polyprotein (gp 1 60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723T, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V1,V2,V4 loops (a.a. 129 to 154, a.a.160 to 193,and a.a.386 
to 413) and transmembrane domain. Heptad (H) 1, Heptad 2 and their Interspace 
(IS) are required for oligomerization. The Fusion and Cleavage (F/CL) domains, 
from amino acids 503-536, have been deleted. The Interspace (IS) between Heptad 
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(H) 1 and 2, from amino acids 593-620, have been deleted. Regions important for 
oligomer formation may be partially functional. The expression vector backbone is 
pAdApt. 



5 VRC 2847 

* 

pAdApt R5gpl40(dCFI)/dV 1 34 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

10 gene (X4gpl60/h) using codons optimized for expression in human cells. The 

nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

15 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 

replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 

20 contains the entire SU protein and a portion of the TM protein including the fusion 

domain, but lacking the V1,V3,V4 loops (a.a. 129 to 154, a.a.299 to 327,and a.a.386 
to 413) and transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) 
are required for oligomerization. The Fusion and Cleavage (F/CL) domains, from 
amino acids 503-536, have been deleted. The Interspace (IS) between Heptad '(H) 1 

25 and 2, from amino acids 593-620, have been deleted. Regions important for 

oligomer formation may be partially functional. The expression vector backbone is 
pAdApt. 



VRC 2848 
30 pAdApt R5gpl 40(dCFl)/dV234 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
35 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
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protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V2,V3,V4 loops (a.a. 160 to 193, a.a.299 to 327,and a.a.386 
to 413) and transmembrane domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) 
are required for oligomerization. The Fusion and Cleavage (F/CL) domains, from 
amino acids 503-536, have been deleted. The Interspace (JS) between Heptad (H) 1 
and 2, from amino acids 593-620, have been deleted. Regions important for 
oligomer formation may be partially functional. The expression vector backbone is 
pAdApt. 

VRC 2849 

pAdApt R5gpl40(dCFI)/dV1234 

The protein sequence of the envelope polyprotein (gp!60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-l 
envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-length 
R5-tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
terminated after the codon for amino acid 680. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the V1,V2,V3,V4 loops (a.a. 129 to 154, a.a. 160 to 193, 
a.a.299 to 327,and a.a.386 to 413) and transmembrane domain. Heptad(H) ], Heptad 
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2 and their Interspace(IS) are required for oligomerization. The Fusion and Cleavage 
(F/CL) domains, from amino acids 503-536, have been deleted. The Interspace (IS) 
between Heptad (H) 1 and 2, from amino acids 593-620, have been deleted. Regions 
important for oligomer formation may be partially functional. The expression vector 
5 backbone is pAdApt. 

VRC2850 

pVR1012 x/s R5gpl45delCl (delCFI)/h 



10 Constant domain 1 was deleted from gpl45delCFI from amino acid 33-127 and was 
replaced with an Nhe I site. 



VRC2851 

pVR1012 x/s R5gpl45delC2 (delCFI)/h 

15 

Constant domain 2 was deleted from gpl45dCFI from amino acid 199-293 and was 
replaced with an Nhe I site. 

VRC2852 

20 pVR10l2 x/s R5gpl45delC3 (delCFI)/h 

Constant domain 3 was deleted from gpl45delCFI from amino acide 333-380 and 
was replaced with an Nhe I site. 

25 VRC2853 

pVRl 012 x/s R5gpl45deIC4 (delCFI)/h 

Constant domain 4 was deleted from gpl45delCFl from amino acid 419-458 and 
was replaced with an Nhe 1 site. 

30 

VRC28S4 

pVR1012 x/s R5gpl45delC5 (delCFl)/h 

* 

Constant domain 5 was deleted from gpl45delCFl from amino acid 472-498 and 
35 was replaced with an Nhe I site. 
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VRC 2860 

pVR1012x/s R5gpl45(dCFI)/h/dVi 

5 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 

1 0 N94D, Kl 92S, 12 1 5N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 

15 version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 

20 VI loop (a.a. 129-154) and Fusion and Cleavage (F/CL) domains (a.a. 503-536) 
have been deleted. Also, the Interspace (IS) between Heptad (H) 1 and 2 (a.a.593- 
620) have been deleted. The expression vector backbone is pVR1012x/s 
(VRC2000). 

25 VRC 2861 

pVR1012x/s R5gpl45(dCFI)/h/dV2 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 

30 gene (X4gpl60/h)- using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 

35 polyprotein amino acids 275 to 36 1 from X4gp 1 60/h (VRC3300) were replaced with 
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the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad(H) I, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
V2 loop (a.a. 160-193) and Fusion and Cleavage (F/CL) domains (a.a. 503-536) 
have been deleted. Also, the Interspace (IS) between Heptad (H) 1 and 2 (a.a.593- 
620) have been deleted. The expression vector backbone is pVR1012x/s 
(VRC2000). 

VRC 2862 

pVR1012x/s R5gpl45(dCFI)/h/dV3 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 fromX4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gp!45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. 
The V3 loop (a.a. 299-327) and Fusion and Cleavage (F/CL) domains (a.a. 503-536) 
have been deleted. Also, the Interspace (IS) between Heptad (H) 1 and 2 (a.a.593- 
620) have been deleted. The expression vector backbone is pVRl012x/s 
(VRC2000). 
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VRC 2863 

p VR1 0 1 2x/s R5gp 1 45 (dCFI)/h/d V4 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 
The V4 loop (a.a. 386-413) and Fusion and Cleavage (F/CL) domains (a.a. 503-536) 
have been deleted. Also, the Interspace (IS) between Heptad (H) 1 and 2 (a.a.593- 
620) have been deleted. The expression vector backbone is pVR1012x/s 
(VRC2000). 

VRC 2864 

pVR1012x/s R5gpl45(dCFl)/h/dV12 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 1215N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
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number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
5 domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
VI, V2 loops (a.a. L29-154 and 160-193) and Fusion and Cleavage (F/CL) domains 
(a.a. 503-536) have been deleted. Also, the Interspace (IS) between Heptad (H) 1 
and 2 (a.a.593-620) have been deleted. The expression vector backbone is 
10 pVRl 01 2x/s (VRC2000). 

VRC 2865 

pVR1012x/sR5gpl45(dCFI)/h/dV13 

15 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions:- F53L, 

20 N94D, Kl 92S, 12 1 5N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 

25 version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 

30 The VI, V3 loops (a.a. 129-154 and 299-327) and Fusion and Cleavage (F/CL) 
domains (a.a. 503-536) have been deleted. Also, the Interspace (IS) between Heptad 
(H) 1 and 2 (a.a.593-620) have been deleted. The expression vector backbone is 
pVR1012x/s(VRC2000). 
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VRC 2866 

pVR1012x/s R5gpl45(dCFI)/h/dV14 



PCT/US01/2572I 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
5 GenBank accession number IC03455) was used to create a synthetic version of the 

gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. To produce an R5-tropic version 

10 of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 

polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession • 
number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 

15 after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 

contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 
The VI, V4 loops (a.a. 129-154 and 386-413) and Fusion and Cleavage (F/CL) 

20 domains (a.a. 503-536) have been deleted. Also, the Interspace (IS) between Heptad 
(H) 1 and 2 (a.a.593-620) have been deleted. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC 2867 

25 pVR1012x/s R5gpl45(dCFI)/h/dV23 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

30 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 1215N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 

35 the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
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number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
5 domain, the transmembrane domain, and regions important for oligomer formation. 

Heptad(H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 
The V2, V3 loops (a.a. 160-193 and 299-327) and Fusion and Cleavage (F/CL) 
domains (a.a. 503-536) have been deleted. Also, the Interspace (IS) between Heptad 
(H) 1 and 2 (a.a.593-620) have been deleted. The expression vector backbone is 
10 pVRl 01 2x/s (VRC2000). 

VRC 2868 

P VRl0l2x/s R5gpl45(dCFI)/h/dV24 

15 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 

20 N94D, K192S, I215N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 

25 version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 

after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad(H) 1, Heptad 2 and their Interspace (rS) are required for oligomerization. 

30 The V2, V4 loops (a.a. 160-193 and 386-413) and Fusion and Cleavage (F/CL) 

domains (a.a. 503-536) have been deleted. Also, the Interspace (IS) between Heptad 
(H) 1 and 2 (a.a.593-620) have been deleted. The expression vector backbone is 
pVR1012x/s(VRC2000). 
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VRC 2869 

pVRl 01 2x/s R5gp H5(dCFI)/h/dV34 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
5 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. To produce an R5-tropic version 

10 of the envelope protein (R5gpl.60/h), the region encoding HIV-1 envelope 

polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 

15 after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 
The V3, V4 loops (a.a. 299-327 and 386-413) and Fusion and Cleavage (F/CL) 

20 domains (a.a. 503-536) have been deleted. Also, the Interspace (IS) between Heptad 
(H) 1 and 2 (a.a.593-620) have been deleted. The expression vector backbone is 
pVR1.012xys (VRC2000). 

VRC 2870 

25 pVR1012x/s R5gpl45(dCFl)/h/dV134 

The protein sequence of the envelope polyprotein (gp!60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

30 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 1215N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 

35 the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
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number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) I, Heptad 2 and their Interspace (IS) are required for oligomerization. 
The VI, V3, V4 loops (a.a. 129-154, 299-327 and 386-413) and Fusion and 
Cleavage (F/CL) domains (a.a. 503-536) have been deleted. Also, the Interspace 
(IS) between Heptad (H) 1 and 2 (a.a.593-620) have been deleted. The expression 
vector backbone is pVR1012x/s (VRC2000). 

VRC 2871 

pVRlOl 2x/s R5gpl45(dCFl)/h/dV234 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBankaccession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, Kl 92S, 121 5N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. 
The V2, V3, V4 loops (a.a. 160-193, 299-327 and 386-413) and Fusion and 
Cleavage (F/CL) domains (a.a. 503-536) have been deleted. Also, the Interspace 
(IS) between Heptad (H) 1 and 2 (a.a.593-620) have been deleted. The expression 
vector backbone is pVRl 012x/s (VRC2000). 
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VRC 2872 

pVR1012x/s R5gpl45(dCFI)dvl23/h 



The protein sequence of the envelope polyprotein (gp!60) from HXB2 (X4-tropic, 
5 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, and P470L. To produce an R5-tropic version 

10 of the envelope protein (R5gpl60/h), the region encoding H1V-1 envelope 

polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 

15 after the codon for amino acid 704. The truncated envelope polyprotein (gpl45) 
contains the entire SU protein and a portion of the TM" protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 
The VI, V2, V4 loops (a.a. 129-154, 160-193 and 386-413) and Fusion and 

20 Cleavage (F/CL) domains (a.a. 503-536) have been deleted. Also, the Interspace 
(IS) between Heptad (H) 1 and 2 (a.a. 593-620) have been deleted. The expression 
vector backbone is pVR1012x/s (VRC2000). 

VRC 2873 

25 pVR1012x/s R5gpl45(dCFl)/h/dV124 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 

30 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 

35 the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
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number M68893, again using human preferred codons). The full-length R5-tropic 
version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gp145) 
contains the entire SU protein and a portion of the TM protein including the fusion 
5 domain, the transmembrane domain, and regions important for oligomer formation. 

Heptad (H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 
The VI, V2, V4 loops (a.a. 129-1.54, 1.60-193 and 386-413) and Fusion and 
Cleavage (F/CL) domains (a.a. 503-536) have been deleted. Also, the Interspace 
(IS) between Heptad (H) 1 and 2 (a.a.593-620) have been deleted. The expression 
10 vector backbone is pVRlOl 2x/s (VRC2000). 

VRC 2874 

pVR1012x/s R5gpl45(dCFI)/h/dV1234 

15 The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 

GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 

20 N94D, Kl 92S, 12 15N, A224T, A346D, and P470L. To produce an R5-tropic version 
of the envelope protein (R5gpl60/h), the region encoding HIV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The full-length R5-tropic 

25 version of the envelope protein gene from pR5gpl60/h (VRC3000) was terminated 
after the codon for amino acid 704. The truncated envelope polyprotein (gp!45) 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, the transmembrane domain, and regions important for oligomer formation. 
Heptad (H) 1, Heptad 2 and their Interspace (IS) are required for oligomerization. 

30 The VI, V2, V3, V4 loops (a.a. 129-154, 160-193, 299-327 and 386-413) and 

Fusion and Cleavage (F/CL) domains (a.a. 503-536) have been deleted. Also, the 
Interspace (IS) between Heptad (H) 1 and 2 (a.a.593-620) have been deleted. The 
expression vector backbone is pVR10l2x/s (VRC2000). 
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VRC2900 

pVR1012x/sR5gpl50/h 



PCT/US01/2572I 



The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
5 GenBank accession number IC03455) was used to create a synthetic version of the 

gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
10 R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 

envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of PIIV-1 (GeneBank 
accession number M68893, again using human preferred codons). The full-lenth R5- 

4 

tropic version of the envelope protein gene from pR5gpl60/h (VRC3000) was 
15 . terminated after the codon for amino acid 752. The truncated envelope polyprotein 
(gpl50) contains the entire SU protein and a portion of the TM protein including the 

■ 

fusion domain, the transmembrane domain, and regions important for oligomer 
formation. The expression vector backbone is pVR1012x/s (VRC2000). 

20 VRC3000 

pVR1012x/sR5gpl60/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
25 gene (X4gp160/h) using codons optimized for expression in human cells. The 

nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

< 

protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an 
R5-tropic version of the envelope protein (R5gpl60/h), the region encoding HIV-1 
30 envelope polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were 
replaced with the corresponding region from the BaL strain of HIV-1 (GeneBank 
accession number M68893, again using human preferred codons). Full length SU 
and TM proteins are expressed from the pVR1012x/s (VRC2000) vector backbone. 
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VRC3200 

pVR1012x/sX4gpl50/h 

* 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
5 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. The Ml-lenth 
10 X4-tropic version of the envelope protein gene from pX4gp!60/h (VRC3300) was 

terminated after the codon for amino acid 752. The truncated envelope polyprotein 
(gpl 50) contains the entire SU protein and a portion of the TM protein including the 
fusion domain, the transmembrane domain, and regions, important for oligomer 
formation. The expression vector backbone is pVR1012x/s (VRC2000). 

15 

VRC3201 

p VR1 01 2x/s X4gpl 50(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl 60) from HXB2 (X4-tropic, 
20 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. The fiill-lenth 
25 X4-tropic version of the envelope protein gene from pX4gpl 60/h (VRC3300) was 

terminated after the codon for amino acid 752. The truncated envelope polyprotein 
(gpl 50) contains the entire SU protein and a portion of the TM protein including the 
fusion domain, the transmembrane domain, and regions important for oligomer 
formation. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
30 oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 503- 
536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. The expression vector backbone is 
pVR1012x/s(VRC2000). 
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VRC3202 

pVRl 012x/s X4gpl 50 Agly/h. 

Eukaryotic vector with humanized codons expressing the HIV envelope 
5 glycoprotein gplSO from HXB2, X4 tropic mutated in the Glycosylation sites. 

VRC3202 pVR1012x/s X4gpl50Dgly/h The protein sequence of the envelope 
polyprotein (gpl60) from HXB2 (X4- tropic, GenBank accession number K03455) 
was used to create a synthetic version of the gene (X4gpl60/h) using codons 
optimized for expression in human cells. The nucleotide sequence X4gpl60/h shows 

10 little homology to the HXB2 gene, but the protein encoded is the same with the 
following amino acid substitutions: F53L, N94D, K192S, 121 5N, A224T, A346D, 
P470L, T723T, and S745T. To disrupt potential glycoslylation sites in the HTV-1 
envelope proteins, point mutations were introduced into the full-length X4-tropic 
version of the envelope protein gene from pX4gpl60/h (VRC3300). The resulting 

15 amino acids substitutions in X4gpl60Dgly/h are: N88D, N156D, N160D, N197E, 

N230D, N234D, N241D, N276D, L288V, N289D, S291T, N295D, N332D, N339D, 
N355D, N386D, and N448D. The full-length X4-tropic version of the envelope 
protein gene from pX4gp!60/h. (VRC3300) was terminated after the codon for 
amino acid 752. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 

20 oligomerization. Full length SU and TM proteins are expressed from the 

pVR1012x/s (VRC2000) vector backbone. 

VRC3203 

pVR1012x/s X4gpl50 AB Agly/h . 

25 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The AB 
designation means the amino acids from 1-307 (Xbal to EcoRl). The nucleotide 
30 sequence X4gpl60/h shows little homology to the HXB2 gene, but the protein 

encoded is the same with the following amino acid substitutions: F53L, N94D, 
K192S, I215N, A224T, A346D, P470L, T723I,and S745T. To disrupt potential 
glycoslylation sites in the HIV-1 envelope proteins, point mutations were introduced 
into the full-lenth X4-tropic version of the envelope protein gene from pX4gpl60/h 
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(VRC3300). The resulting amino acids substitutions in X4gpl60Dgly/h are: N88D, 
N156D, N160D, N197E, N230D, N234D, N241D, N276D, L288V, N289D, S291T, 
and N295D. The full-lenth X4-tropic version of the envelope protein gene from 
pX4gpl60/h (VRC3300) was terminated alter the codon for amino acid 752. 
5 Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. Full 

length SU and TM proteins are expressed from the pVR1012x/s (VRC2000) vector 
backbone. 

VRC33QQ 

10 pVR1012x/s X4gpl60/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
15 nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 

protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. Heptad(H) I, 
Heptad 2 and their Interspace(IS) are required for oligomerization. Full length SU 
and TM proteins are expressed from the pVRl 012x/s (VRC2000) vector backbone. 

20 

VRC3301 

pVR10l2x/s X4gpl60(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
25 GenBank accession number K03455) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, I215N, A224T, A346D, P470L, T723I, and S745T. Full length SU 
30 and TM proteins are Expressed. Heptad(H) 1, Heptad 2 and their Interspace(lS) are 
required for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino 
acids 503-536, have been deleted. The Interspace (rS) between Heptad (H) 1 and 2, 
from amino acids 593-620, have been deleted, The expression vector backbone is 
pVR1012x/s (VRC2000). 

35 
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The protein sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, 
5 GenBank accession number K03455) was used to create a synthetic version of the 

gene (X4gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence X4gpl60/h shows little homology to the HXB2 gene, but the 
protein encoded is the same with the following amino acid substitutions: F53L, 
N94D, K192S, 121 5N, A224T, A346D, P470L, T723I, and S745T. To disrupt 

10 potential glycoslylation sites in the HIV-1 envelope proteins, point mutations were 
introduced into the full-Ienth X4-tropic version of the envelope protein gene from 
pX4gpl60/h (VRC3300). The resulting amino acids substitutions in X4gpl60Dgly/h 
are: N88D, N156D, N160D, N197E,.N230D, N234D, N241D, N276D, L288V, 
N289D, S291T, N295D, N332D, N339D, N356D, N386D, and N448D. Heptad(H) 

15 1, Heptad 2 and their Interspace(TS) are required for oligomerization. Full length SU 
and TM proteins are expressed from the pVR1012x/s (VRC2000) vector backbone. 

VRC3401 

pVR1012x/s X4gpl60AB mut Agly/h . 

20 

The protein sequence of the envelope polyprotein (gp!60) from HXB2 (X4-tropic, 
GenBank accession number K034S5) was used to create a synthetic version of the 
gene (X4gpl60/h) using codons optimized for expression in human cells. The AB 
designation means the amino acids from 1-307 (Xbal to EcoRl). The nucleotide 

25 sequence X4gpl60/h shows little homology to the HXB2 gene, but the protein 

encoded is the same with the following amino acid substitutions: F53L, N94D, 
K192S, I215N, A224T, A346D, P470L, T723r, and S745T. To disrupt potential 
glycoslylation sites in the HIV-1 envelope proteins, point mutations were introduced 
into the full-tenth X4-tropic version of the envelope protein gene from pX4gp!60/h 

30 (VRC3300). The resulting amino acids substitutions in X4gpl60Dgly/h are: N88D, 

N156D, N160D, N197E, N230D, N234D, N241D, N276D, L288V, N289D, S291T, 
and N295D. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. Full length SU and TM proteins are expressed from the 
pVRl 01 2x/s (VRC2000) vector backbone. 

* 
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VRC3500 

pVR1012x/sNeffh 

5 The protein sequence of the Nef protein from HIV-1 PV22 (GenBank accession 

number K02083) was used to create a synthetic version of the Nef gene (NefTh) 
using codons optimized for expression in human cells. The nucleotide sequence 
NefTh shows little homology to the viral gene, but the protein encoded is the same. 
Nef/h is expressed from the pVR1012x/s (VRC2000) vector backbone. 

10 

VRC3600 

pVR1012x/s Ne£DMHCDCD4/h 

The protein sequence of the Nef protein from HIV-1 PV22 (GenBank accession 
15 number K02083) was used to create a synthetic version of the Nef gene (Neffh) 

using codons optimized for expression in human cells. To disrupt the ability of Nef 
to limit both MHC class I and CD4 expression point mutations were introduced into 
the Nef gene from pNeflh (VRC3500). The resulting amino acids substitutions in 
P NefDMHCDCD4/h are: P69A, P72A, P75A, P78A, D174A and . D175A. 
20 pNefDMHCDCD4/h is expressed from the pVRl012x/s (VRC2000) vector 

backbone. 

* 

VRC3700 

pVR1012x/sNefDCD4/h 

25 

The protein sequence of the Nef protein from HIV-1 PV22 (GenBank accession 
number K02083) was used to create a synthetic version o f the Nef gene (Nef/h) 
using codons optimized for expression in human cells. To disrupt the ability of Nef 
to limit CD4 expression, point mutations were introduced into the Nef gene from 

30 pNef/h (VRC3500). The resulting amino acids substitutions in pNefDCD/h are: 
D174A and D175A. pNefDCD4/h is expressed from the pVR1012x/s (VRC2000) 
vector backbone. VRC3700 pVR1012x/s NefDCD4/h The protein sequence of the 
Nef protein from HIV-1 PV22 (GenBank accession number K02083) was used to 
create a synthetic version o f the Nef gene (Nef/h) using codons optimized for 

35 expression in human cells. To disrupt the ability of Nef to limit CD4 expression, 
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point mutations were introduced into the Nef gene from pNetfh (VRC3500). The 
resulting amino acids substitutions in pNefDCD/h are: D174A and D175A. 
pNefDCD4/h is expressed from the pVR1012x/s (VRC2000) vector backbone. 

5 VRC3800 

pVR1012x/s NefDMHC/h 

The protein sequence of the Nef protein from HAM PV22 (GenBank accession 
number K02083) was used to create a synthetic version of the Nef gene (Nef/h) 
10 using codons optimized for expression in human cells. To disrupt the ability of Nef 

to limit MHC class I expression, point mutations were introduced into the Nef gene 
from pNefito (VRC3500). The resulting amino acids substitutions in pNefDMHC/h 
are: P69A, P72A, P75A, and P78A. pNefDMHC/h is expressed from the 
pVR1012x/s (VRC2000) vector backbone. 

15 

VRCS20Q 

pVR1012x/s 89.6Pgpl28(delF/CL)/h 

The protein sequence of the envelope polyprotein (gpl60) from 89.6P (Dual-tropic, 
20 GenBank accession number u89134/LOCUS:SIU89134) was used to create a 

synthetic version of the gene (89.6Pgpl60/h) using codons optimized for expression 
in human cells. The nucleotide sequence 89.6Pgpl60/h shows little homology to the 
89.6P gene, but the protein encoded is the same. The fulMenth 89.6P, dual-tropic 
version of the envelope protein gene from 89.6P gpl60/h (VRC3000) was 
25 terminated after the codon for amino acid 596. The truncated envelope polyprotein 
contains the entire SU protein and a portion of the TM protein including the fusion 
domain, but lacking the transmembrane domain. The Fusion and Cleavage (F/CL) 
domains, from amino acids 508-541, have been deleted. The expression vector 
backbone is pVR1012x/s (VRC2000). 

30 

VRC5201 

pVRlOl 2x/s 89.6Pgpl40(del F/CL del H IS/h 



35 



The protein sequence of the envelope polyprotein (gpl60) from 89.6P (dual-tropic, 
GenBank accession number u89134/locus SIU89134) was used to create a synthetic 
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version of the gene (Dualtropic gpl60/h) using codons optimized for expression in 
human cells. The nucleotide sequence dualtropic gpl60/h shows little homology to 
the 89.6P gene, but the protein encoded is the same. The full-lenth 89.6P, dual-tropic 
version of the envelope protein gene was terminated after the codon for amino acid 
5 683. The truncated envelope polyprotein contains the entire SU protein and a portion 

of the TM protein including the fusion domain, but lacking the transmembrane 
domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 508- 
541, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
10 amino acids 597-625, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pVR1012x/s 
(VRC2000). 

* 

VRC5202 

15 pVR1012x/s89.6Pgpl45(del F/CL del HIS/h 

The protein sequence of the envelope polyprotein (gpl60) from 89.6P (dual-tropic, 
GenBank accession number u89134/locus SIU89134) was used to create a synthetic 
version of the gene (Dualtropic gpl60/h) using codons optimized for expression in 

20 human cells. The nucleotide sequence dualtropic gpl60/h shows little homology to 
the 89.6P gene, but the protein encoded is the same. The full-lenth 89.6P, dual-tropic 
version of the envelope protein gene was terminated after the codon for amino acid 
709. The truncated envelope polyprotein contains the entire SU protein and a portion 
of the TM protein including the fusion domain, but lacking the transmembrane 

25 domain. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 508- 
541, have been deleted. The Interspace (IS) between Heptad (H) I and 2, from 
amino acids 597-625, have been deleted. Regions important for oligomer formation 
may be partially functional. The expression vector backbone is pVR1012x/s 

30 (VRC2000). 
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VRC5203 

pVR1012x/s 89.6Pgpt60/h 



The protein sequence of the envelope polyprotein (gpl60) from 89.6P (Dual-tropic, 
GenBank accession number U89134/LOCUS: SIU89134) was used to create a 
synthetic version of the gene (dual tropic gpl60/h) using codons optimized for 
expression in human cells. The nucleotide sequence 89.6P gp 1.60/h shows little 
homology to the 89.6P gene, but the protein encoded is the same. Full length SU and 
TM proteins are expressed from the pVR1012x/s (VRC2000) vector backbone 

VRCS300 

pVR1012x/s R5(clade C)gpl40(del F/CL del H IS)/h 



The protein sequence of the envelope polyprotein (gpl60) from 92br025 (R5-tropic, 

15 GenBank accession number U52953) was used to create a synthetic version of the 
gene (R5gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence R5gpl60/h shows little homology to the 92br025 gene, but the 
protein encoded is the same. The full-length R5-tropic version of the envelope 
protein was synthesized by Operon under the name: kongene. The Xbal (18nt up- 

20 stream from ATG) to Bglll (1376nt down-stream from ATG) fragment which 
contains poly linker at the 5' end, Kozak sequence and ATG was cloned into the 
Xbal to BgUI sites of VRC2701 pVRl0l2x/s X4gpl40(del F/CL del H IS)/h 
backbone. Therefore, the gene is R5 (clade C) gpl60/h up to the Bglll site (1376nt 
from ATG) and the rest of the gene after BgUI site is VRC2701 pVR1012x/s 

25 X4gpl40(del F/CL del H lS)/h. The truncated envelope polyprotein contains the 

entire SU protein and a portion of the TM protein including the fusion domain, but 
lacking the transmembrane domain. Regions important for oligomer formation may 
be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 

30 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. The expression vector backbone is 
pVR1012x/s (VRC2000). The full-lenth X4-tropic version of the envelope protein 
from pX4gpl60/h (VRC3300) was terminated after the codon for amino acid 680. 



-106- 



WO 02/32943 

VRC5301 
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The prolein sequence of the envelope polyprotein (gp!60) from 92br025 (R5-tropic, 
5 GenBank accessionnumber U52953) was used to create a synthetic version of the 
gene (R5gpl60/h) using codons optimized for expression in human cells. The 
nucleotide sequence R5gpl60/h shows little homology to the 92br025 gene, but the 
protein encoded is the same. The full-length R5-tropic version of the envelope 
protein was synthesized by Operon under the name: kongene. The Xbal (18nt up- 

10 stream from ATG) to BglU (1376nt down-stream from ATG) fragment which 
contains polylinker at the 5' end, Kozak sequence and ATG was cloned into the 
Xbal to BglU sites of VRC2701 pVR1012x/s X4gpl40(del F/CL del H IS)/h 
backbone. Therefore, the gene is R5 (clade C) gpl60/h up to the BgUI site (1376nt 
from ATG) and the rest of the gene after BglU site is VRC2707 pVR1012x/s 

15 X4gpl45(del F/CL del H IS)/h. The truncated envelope polyprotein contains the 

entire SU protein and a portion of the TM protein including the fusion domain, but 
lacking the transmembrane domain. Regions important for oligomer formation may 
be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(lS) are required 
for oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 

20 503-536, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 593-620, have been deleted. The expression vector backbone is 
pVR1012x/s (VRC2000). The full-lenth X4-tropic version of the envelope protein 
from pX4gp!60/h (VHC3300) was terminated after the codon for amino acid 704. 

25 

VRC5303 

pVR1012x/s R5gpl45CladeC(Brazil) delCFI/h 

< 

Authentic Clade C unlike 5301 which is a hybrid between C and B. 

30 

VRC 5304 

pVR1012x/s R5(clade A)gpl40(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl60) from 92rw020 (RS-tropic, 
35 GenBank accessionnumber U51283) was used to create a synthetic version of the 
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gene (Clade-A gpl45delCFI) using codons optimized for expression in human cells. 
The nucleotide sequence R5gpl45delCFI shows little homology to the 92rw020 
gene, but the protein encoded is the same. The Xbal (18nt up-stream from ATG) to 
BamHl (1837 nt down-stream from ATG) fragment which contains polylinker at the 
5* end, Kozak sequence and ATG was cloned into the Xbal to BamHl sites of 
pVR!012x/s backbone. The truncated envelope polyprotein contains the entire SU 
protein and the TM domain, but lacking the the Fusion domain and Cytoplasmic 
domain. Regions important for oligomer formation may be partially functional. 
Heptad(H) l,Heptad 2 and their Interspace(IS) are required for oligomerization. The 
Fusion and Cleavage (F/CL) domains, from amino acids 486-519, have been 
deleted. The Interspace (IS) between Heptad (H) I and 2, from amino acids 576-604, 
have been deleted. The expression vector backbone is pVR1012x/s (VRC2000). 

VRC 5305 

pVR1012x/s R5(clade A)gpl45(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl60) from 92rw020 (R5-tropic, 
GenBank accessionnumber U51283) was used to create a synthetic version of the 
gene (Clade-A gp145delCFI) using codons optimized for expression in human cells. 
The nucleotide sequence R5gpl45delCFI shows little homology to the 92rw020 
gene, but the protein encoded is the same. The Xbal (I8nt up-stream from ATG) to 
BamHl (1912 nt down-stream from ATG) fragment which contains polylinker at the 
5' end, Kozak sequence and ATG was cloned into the Xbal to BamHl sites of 
pVR1012x/s backbone. The truncated envelope polyprotein contains the entire SU 

■ 

protein and the TM domain, but lacking the the Fusion domain and Cytoplasmic 
domain. Regions important for oligomer formation may be partially functional. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
Fusion and Cleavage (F/CL) domains, from amino acids 486-519, have been 
deleted. The Interspace (IS) between Heptad (H) 1 and 2, from amino acids 576-604, 
have been deleted. The expression vector backbone is pVR1012x/s (VRC2000). 
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VRC 5306 

pVRl 01 2x/s R5(clade E)gpl40(del F/CL del H IS)/h 



The protein sequence of the envelope polyprotein (gpl40delCFI) from 93th966.8 
5 (R5-tropic, GenBank accessionnumber U08456) was used to create a synthetic 

version of the gene (Clade-C gpl40delCFl) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 93th966.8 3 but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1856 nt down-stream from ATG) fragment which contains 

10 polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein 
contains the entire SU protein and the TM domain, but lacking the the Fusion 
domain and Cytoplasmic domain. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 

15 oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 497- 
530, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 588-613, have been deleted. The expression vector backbone is 
pVR1012x/s (VRC2000). 

■ 

20 VRC 5307 

p VR10 1 2x/s R5(clade E)gp 1 45(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 93th966.8 
(R5-tropic, GenBank accessionnumber U08456) was used to create a synthetic 

25 version of the gene (Clade-C gpHSdelCFQ using codons optimized for expression 

in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 93th966.8, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1928 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 

30 BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein 
contains the entire SU protein and the TM. domain, but lacking the the Fusion 
domain and Cytoplasmic domain. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 497- 

35 530, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 

-109- 



> zy ™ PCT/US01/25721 

amino acids 588-613, have been deleted. The expression vector backbone is 
pVR1012x/s (VRC2000). 



VRC 5308 

pVR1012x/s R5(clade C South African)gp140(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl40delCFl) from 97ZA012 
(R5-tropic, GenBank accessionnumber AJF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1833 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein 
contains the entire SU protein and the TM domain, but lacking the the Fusion 
domain and Cytoplasmic domain. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their lnterspace(IS) are required for 
oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 487- 
520, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 577-605, have been deleted. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC 5309 

pVRl 0l2x/s R5(clade C South African)gpl45(del F/CL del H IS)/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accessionnumber AF286227) was used to create a synthetic 
version of the gene (Clade-C gp!45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein 
contains the entire SU protein and the TM domain, but lacking the the Fusion 
domain and Cytoplasmic domain. Regions important for oligomer formation may be 
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partially functional. Heptad(H) 1, Heptad 2 and their lnterspace(IS) are required for 
oligomerization. The Fusion and Cleavage (F/CL) domains, from amino acids 487- 
520, have been deleted. The Interspace (IS) between Heptad (H) 1 and 2, from 
amino acids 577-605, have been deleted. The expression vector backbone is 

S pVR1012x/s(VRC2000). 



VRC 5350 

pVRG1012(x/s)-gpl40(dCFl)(BraziIC)/dVi 

10 The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40delCFl) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl40(delCFl) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 

15 (18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the VI loop(a.a.!33-148) of the SU protein. It also 
lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 

20 between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, and 
cytoplasmic domain in the TM protein. Regions important for oligomer formation 
may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are 
required for oligomerization. The expression vector backbone is pVR1012x/s 
(VRC2000). 

25 

VRC 5351 

pVRCl 012(x/s)-gp 1 40(dCFI)(Brazil C)/d V 1 2 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
30 tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gp140(delCFl) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
35 which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
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the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V1,V2 loops (a.a.133-191) of the SU protein. It 
also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace 
(IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, and 
5 cytoplasmic domain in the TM protein. Regions important for oligomer formation 

may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are 
required for oligomerization. The expression vector backbone is pVR1012x/s 
(VRC2000). 

10 VRC 5352 

pVRC1012(x/s)-gpl40(dCFI)(Brazil C)/dV123 

The protein sequence of the envelope polyprotein (gp!60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 

15 of the gene (Brazil-C gpl40delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl40(deICFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(I8nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains poly linker at the 5' end, Kozak sequence and ATG was cloned into 

20 the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V1,V2,V3 loops (a.a.130-191, 330-358) of the 
SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), 
the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane 
domain, and cytoplasmic domain in the TM protein. Regions important for 

25 oligomer formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s (VRC2000). 

VRC 5353 

30 pVRC1012(x/s)-gpl40(dCFI)(Brazil C)/dV1234 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40delCFI) using codons optimized for expression in 
• 35 human cells. The nucleotide sequence of Brazil-C gpl40(delCFI) shows little 
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homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5* end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
5 polyprotein contains deletion in the VI ,V2,V3,V4 loops (a.a. 130-1 91, 330-358, 384- 
408) of the SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 
496-529), the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), 
transmembrane domain, and cytoplasmic domain in the TM protein. Regions 
important for oligomer formation may be partially functional. Heptad(H) 1, Heptad 
10 2 and their Interspace(IS) are required for oligomerization. The expression vector 
backbone is pVR1012x/s (VRC2000). 

VRC 5354 

pVRCl 01 2(x/s)-gp 1 40(dCFI)(Brazil C)/dVl 24 

15 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40deICFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl40(delCFI) shows little 

20 homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V1,V2,V4 loops (a.a. 130-191, 384-408) of the 

25 SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), 

the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane 
. domain, and cytoplasmic domain in the TM protein. Regions important for 
oligomer formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 

30 pVR1012x/s(VRC2000). 
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The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpUOdelCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl40(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 

« 

polyprotein contains deletion in the V1.V3 loops (a.a.133-148, 330-358) of the SU 
protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, 
and cytoplasmic domain in the TM protein. Regions important for oligomer 
formation may be partially functional, Heptad(H) 1, Heptad 2 and their 
Interspaced) are required for oligomerization. The expression vector backbone is 
pVR1012x/s (VRC2000). 

VRC 5356 

pVRC1012(xys)-gpl40(dCFI)(Bra2il C)/dVi34 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpUOdelCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl40(delCFl) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V1,V3,V4 loops (a.a.130-148, 330-358, 384- 
408) of the SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 
496-529), the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), 
transmembrane domain, and cytoplasmic domain in the TM protein. Regions 
important for oligomer formation may be partially functional. Heptad(H) 1, Heptad 
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2 and their Interspace(IS) are required for oligomerization. The expression vector 
backbone is pVRl012x/s (VRC2000). 



VRC 5357 

5 pVRC1012(x/s)-gpl40(dCFI)(Brazil C)/dV14 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpUOdelCFI) using codons optimized for expression in 

10 human cells. The nucleotide sequence of Brazil-C gpl40(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(I8nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 

15 polyprotein contains deletion in the V1,V4 loops (a. a. 130-148, 384-408) of the SU 
protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, 
and cytoplasmic domain in the TM protein. Regions important for oligomer 
formation may be partially functional. Heptad(H) I, Heptad 2 and their 

20 Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR10l2x/s (VRC2000). 

VRC 5358 

pVRC1012(x/s)-gp140(dCFI)(Brazil C)/dV2 

25 

The protein sequence of the envelope polyprotein (gp!60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40deICFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl40(delCFl) shows little 
30 homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 

■ * 

the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V2 loop(a.a.l54-191) ofthe SU protein. Jt also 
35 lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 
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between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, and 
cytoplasmic domain in the TM protein. Regions important for oligomer formation 
may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are 
required for oligomerization. The expression vector backbone is pVR1012x/s 
(VRC2000). 



VRC 5359 

p VRC1 0 1 2(x/s)-gp 1 40(dCFI)(Brazil C)/dV23 

The protein sequence of the envelope polyprotein (gp!60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40delCFI) using codons optimized for expression in 
human cells. The 'nucleotide sequence of Brazil-C gpI40(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(I8nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR!012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V2,V3 loops (a.a. 154-191, 330-358) of the SU 
protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, 
and cytoplasmic domain in the TM protein. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC 5360 

pVRC1012(x/s)-gpl40(dCFI)(Brazil C)/dV234 



The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl 40(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
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the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V2,V3,V4 loops (a.a. 154-191, 330-358, 384- 
408) of the SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 
496-529), the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), 
5 transmembrane domain, and cytoplasmic domain in the TM protein. Regions 

important for oligomer formation may be partially functional. Heptad(H) 1, Heptad 
2 and their Interspace(IS) are required for oligomerization. The expression vector 
backbone is pVR1012x/s (VRC2000). 

10 VRC 5361 

pVRC10l2(x/s)-gpl40(dCFI)(Brazil C)/dV24 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 

15 of the gene (Brazil-C gpl40delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpi40(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 

20 the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V2,V4 loops (a.a. 154-191, 384-408) of the SU 
protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, 
and cytoplasmic domain in the TM protein. Regions important for oligomer 

25 formation may be partially functional. Heptad(H) I, Heptad 2 and their 

Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s (VRC2000). 

VRC 5362 

30 pVRC1012(x/s)-gpl40(dCFI)(BraziI C)/dV3 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gp!40delCFJ) using codons optimized for expression in 
35 human cells. The nucleotide sequence of Brazil-C gpl40(delCFI) shows little 
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homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains poly linker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
5 polyprotein contains deletion in the V3 loop(a.a.330-358) of the SU protein. It also 

lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 
between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, and 
cytoplasmic domain in the TM protein. Regions important for oligomer formation 
may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are 
10 required for oligomerization. The expression vector backbone is pVR1012x/s 

(VRC2000). 

VRC 5363 

pVRCl012(x/s>gpl40(dCFO(BrazilC)/dV34 

15 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl40delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl40(delCF!) shows little 

20 homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V3,V4 loops (a.a.330-358, 384-408) of the SU 

25 protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 

Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, 
and cytoplasmic domain in the TM protein. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 

30 pVR1012x/s (VRC2000). 
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VRC 5364 

pVRClOl 2(x/s)-gpl40(dCFl)(Brazil C)/dV4 



PCT/US01/25721 



The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
5 tropic, GenBank accession number U52953) was used to create a synthetic version 

of the gene (Brazil-C gpUOdelCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpHO(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 

10 which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR10l2x/s backbone. The truncated envelope 
polyprotein contains deletion in the V4 loop(a.a.384-408) of the SU protein. It also 
lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 
between Heptad (H) 1 and 2(a.a. 586-612), transmembrane domain, and 

15 cytoplasmic domain in the TM protein. Regions important for oligomer formation 

may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(TS) are 
required for oligomerization. The expression vector backbone is pVR1012x/s 
(VRC2000). 

20 VRC 5365 

PVRC 1 01 2(x/s)-gpl 45(dCFI)(Brazil C)/d VI 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 

25 of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 

human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 

30 the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the VI loop(a.a.!33-148) of the SU protein. It also 
lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 
between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain in the TM 
protein. Regions important for oligomer formation may be partially functional. 
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Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
expression vector backbone is pVR1012x/s (VRC2000). 



VRC 5366 

5 PVRC1012(x/s)-gpl45(dCFl)(Brazil C)/dV12 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 

10 human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 

15 polyprotein contains deletion in the V1,V2 loops (a.a.133-191) of the SU protein. It 
also lacks the Fusion and Cleavage (F/CL) domains( aa. 496-529), the Interspace 
(IS) between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain in the TM 
protein. Regions important for oligomer formation may be partially functional. 
Heptad(H) I, Heptad 2 and their Jnterspace(TS) are required for oligomerization. The 

20 expression vector backbone is pVR1012x/s (VRC2000). 

VRC 5367 

PVRC1012(x/s)-gpl45(dCFI)(Brazil C)/dV123 

25 The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 

tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl45(delCFl) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 

30 (18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the VI, V2,V3 loops (a.a. 133-191,330-358) of the 
SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), 

35 the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic 
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domain in the TM. protein. Regions important for oligomer formation may be 
partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

5 VRC 5368 

PVRC1012(x/s)-gpl45(dCFI)(Brazil C)/dV1234 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 

10 of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 

human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 

15 the Xbal to BamHl sites of pVRl012x/s backbone. The truncated envelope 
polyprotein contains deletion in the VI, V2, V3,V4 loops (a.a. 133-191,330-358,384- 
408) of the SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 
496-529), the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and 
Cytoplasmic domain in the TM protein. Regions important for oligomer formation 

20 may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are 
required for oligomerization. The expression vector backbone is pVR1012x/s 
(VRC2000). 

VRC 5369 

25 PVRC10l2(x/s)-gpl45(dCFI)(Brazil C)/dVl24 

The protein sequence of the envelope polyprotein (gp!60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45deICFI) using codons optimized for expression in 

30 human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 

35 polyprotein contains deletion in the VI, V2,V4 loops (a.a. 133-148,154-191,384- 
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408) of the SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 
496-529), the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and 
Cytoplasmic domain in the TM protein. Regions important for oligomer formation 
may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are 
required for oligomerization. The expression vector backbone is pVR1012x/s 
(VRC2000). 

VRC S370 

PVRC1 012(x/s)-gpl45(dCFl)(Brazil Q/dVl 3 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gp!45(delCFl) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5* end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVRl012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V1,V3 loops (a.a. 133-148, 330-358) of the SU 
protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain 
in the TM protein. Regions important for oligomer formation may be partially 
functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVRl 012x/s (VRC2000). 

VRC 5371 

PVRC1 01 2(x/s)-gpl 45(dCFI)(Brazil C)/dV 134 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
30 tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gp!45delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gp!45(deICFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
35 which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
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the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the VI, V3,V4 loops (a.a. 133-148,330-358,384- 
408) of the SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 
496-529), the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and 
5 Cytoplasmic domain in the TM protein. Regions important for oligomer formation 
may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are 
required for oligomerization. The expression vector backbone is pVR1012x/s 
(VRC2000). 

10 VRC 5372 

PVRC10 12(x/s)-gp 145(dCFI)(Brazil C)/dV 1 4 

The protein sequence of the envelope polyprotein (gp!60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V1,V4 loops (a.a.133-148, 384-408) of the SU 
protein. It also lacks the Fusion and Cleavage (F/CL) domainsf a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain 
in the TM protein. Regions important for oligomer formation may be partially 
functional. Heptad(H) 1, Heptad 2 and their Interspace^) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

VRC 5373 

PVRC 1 01 2(x/s)-gpl 45(dCFI)(Brazil C)/dV2 
30 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFl) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
35 homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
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(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V2 loop (a.a. 154-191) of the SU protein. It also 
5 lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 

between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain in the TM 
protein. Regions important for oligomer formation may be partially functional. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 

* 

expression vector backbone is pVR1012x/s (VRC2000). 

10 

VRC 5374 

PVRCl012(x/s)-gpl45(dCFI)(Brazil C)/dV23 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (RS- 
15 tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
20 which contains polylinker at the 5* end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V2,V3 loops (a.a.154-191,330-358) of the SU 
protein. It also Jacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain 
25 in the TM protein. Regions important for oligomer formation may be partially 

functional. Heptad(H) I, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

VRC 5375 

30 PVRC1012(x/s)-gpl45(dCFI)(Bra2il C)/dV234 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFf) using codons optimized for expression in 
35 human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
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homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR10I2x/s backbone. The truncated envelope 
5 polyprotein contains deletion in the V2, V3,V4 loops (a.a. 154-191,330-358,384- 
408) of the SU protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 
496-529), the Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and 
Cytoplasmic domain in the TM protein. Regions important for oligomer formation 
may be partially functional. Heptad(H) 1, Heptad 2 and their Interspace^) are 
10 required for oligomerizatibn. The expression vector backbone is pVR1012x/s 
(VRC2000). 

« 

VRC 5376 

PVRC1012(x/s)-gpl45(dCFI)(Brazil C)/dV24 

15 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFl) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 

20 homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5 9 end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V2,V4 loops (a.a. 154-1 91, 384-408) of the SU 

25 protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain 
in the TM protein. Regions important for oligomer formation may be partially 
functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

30 

VRC 5377 

PVRC101 2(x/s)-gpl45(dCFI)(Brazil C)/dV3 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
35 tropic, GenBank accession number U52953) was used to create a synthetic version 
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of the gene (Brazil-C gpl45deICFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpL45(delCFl) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
5 which contains polylinker at the 5* end, Kozak sequence and ATG was cloned into 

the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V3 loop (a.a.330-358) of the SU protein. It also 
lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 
between Heptad (H) I and 2(a.a. 586-612), and Cytoplasmic domain in the TM 
10 protein. Regions important for oligomer formation may be partially functional.- 

Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
expression vector backbone is pVRl 012x/s (VRC2000). 

VRC 5378 

15 PVRC1012(x/s)-gpl45(dCFI)(Brazil C)/dV34 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
tropic, GenBank accession number U52953) was used to create a synthetic version 
of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl45(delCFI) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 
which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V3,V4 loops (a.a.330-358, 384-408) of the SU 
protein. It also lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the 
Interspace (IS) between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain 
in the TM protein. Regions important for oligomer formation may be partially 
functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVRlOl 2x/s (VRC2000). 
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VRC 5379 

PVRC 1 0 1 2(x/s)-gpl 45(dCFI)(Brazi 1 C)/dV4 

The protein sequence of the envelope polyprotein (gpl60) from 92BR025 (R5- 
5 tropic, GenBank accession number U52953) was used to create a synthetic version 

of the gene (Brazil-C gpl45delCFI) using codons optimized for expression in 
human cells. The nucleotide sequence of Brazil-C gpl45(delCFL) shows little 
homology to the gene 92BR025, but the protein encoded is the same. The Xbal 
(18nt up-stream from ATG) to BamHl (1910 nt down-stream from ATG) fragment 

10 which contains polylinker at the 5' end, Kozak sequence and ATG was cloned into 
the Xbal to BamHl sites of pVR1012x/s backbone. The truncated envelope 
polyprotein contains deletion in the V4 loop (a.a.384-408) of the SU protein. It also 
lacks the Fusion and Cleavage (F/CL) domains( a.a. 496-529), the Interspace (IS) 
between Heptad (H) 1 and 2(a.a. 586-612), and Cytoplasmic domain in the TM 

15 protein. Regions important for oligomer formation may be partially functional. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
expression vector backbone is pVRl 012x/s (VRC2000). 

VRC5500 

20 pVR1012x/s R5(SA-C )gpl40(dCFI) dVl/h 

The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl40delCFI) using codons optimized for expression 

25 in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 

the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 

30 the VI loop (a.a.136-150), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 
the Interspace (IS) between Heptad (H) 1 and 2 (a.a.5 77-605), the transmembrane 
domain and the intracellular region. Regions important for oligomer formation may 
be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

35 
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The protein sequence of the envelope polyprotein (gp 1 40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl40delCFl) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 

* 

from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
poly linker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V2 loops (a. a. 136- 194), the Fusion and Cleavage (F/CL) domains (a.a. 487- 
520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the 
transmembrane domain and the intracellular region. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspaced) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC 5502 

pVR1012x/s R5(SA-C )gpl40(dCFI) dV123/h 

The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl40delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V2, V3 loops (a.a. 136-194, 297-325), the Fusion and Cleavage (F/CL) 
domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577- 
605), the transmembrane domain and the intracellular region. Regions important for 
oligomer formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(lS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 
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VRC5503 

p V R1 01 2x/s R5 (SA-C)gp 1 40(dCFI)d V 1 234/h 

5 The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 

(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpMOdelCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 

10 from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5* end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V2, V3,V4 loops (a.a. 136-1 94,297-325, 385-399), the Fusion and Cleavage 
(F/CL) domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 

15 (a. a. 5 77-605), the transmembrane domain and the intracellular region. Regions 
important for oligomer formation may be partially functional. Heptad(H) 1, Heptad 
2 and their Interspace(IS) are required for oligomerization. The expression vector 
backbone is pVR1012x/s (VRC2000). 

20 VRC5S04 

pVR1012x/s R5(SA-C)gpl40(dCFI)dV124/h 

The protein sequence of the envelope polyprotein (gpMOdelCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 

25 version of the gene (Clade-C gpl40delCFl) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (I8nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 

30 BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 

the VI, V2, V4 loops (a.a. 136-194, 385-399), the Fusion and Cleavage (F/CL) 
domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577- 
605), the transmembrane domain and the intracellular region. Regions important for 
oligomer formation may be partially functional. Heptad(H) 1, Heptad 2 and their 



-129- 



WO 02/32943 PCT/US01/25721 

Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 



VRC5505 

5 pVR1012x/s R5(SA-C)gpl40(dCFI)dV13/h 

The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl40delCFI) using codons optimized for expression 
10 in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 

■ 

the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 

15 the VI, V3 loops (a.a.136-150, 297-325), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the 
transmembrane domain and the intracellular region. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 

20 pVR1012x/s (VRC2000). 

» « 

VRC5506 

pVR10l2x/sR5(SA-C)gpl40(dCFI)dV134/h 

25 The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpHOdelCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40deICFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 

30 from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V3, V4 loops (a.a. 136-150, 297-325, 385-399), the Fusion and Cleavage 
(F/CL) domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 

35 (a.a.577-605), the transmembrane domain and the intracellular region. Regions 
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important for oligomer formation may be partially functional. Heptad(H) 1, Heptad 
2 and their Interspace(IS) are required for oligomerization. The expression vector 
backbone is pVRl 0 1 2x/s (VJRC2000). 

J VRC5507 . 

pVR1012x/s R5(SA-C)gp140(dCFI)dVl4/h 



The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpHOdelCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V4 loops (a.a.136-150, 385-399), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a. a. 5 77-605), the 
transmembrane domain and the intracellular region. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspaced) are required for oligomerization. The expression vector backbone is 
pVR1012x/s (VRC2000). 



VRC5508 

pVR10l2x/s R5(SA-C)gpl40(dCFI)dV2/h 

25 

The protein sequence of the envelope polyprotein (gpUOdelCFI) from 97ZA012 
(R5-lropic, GenBank accession number AF286227) was. used to create a synthetic 
version of the gene (Clade-C gpUOdelCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 

30 the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V2 loop (a.a. 156- 194), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 

35 the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the transmembrane 
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domain and the intracellular region. Regions important for oligomer formation may 
be partially functional. Heptad(H) 1, Heptad 2 and their lnterspace(IS) are required 
for oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 



5 VRC5509 

pVRI 01 2x/s R5(SA-C)gp l40(dCFI)dV23/h 

The protein sequence of the envelope polypfotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 

10 version of the gene (Clade-C gpUOdelCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5* end, Kozak sequence and ATG was cloned into the Xbal to 

15 BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V2, V3 loops (a.a. 156-1 94, 297-325), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the 
transmembrane domain and the intracellular region. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 

20 Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRCS510 

pVRI 012x/s R5(SA-C)gpl40(dCFI)dV234/h 

25 

The protein sequence of the envelope polyprotein (gpl40delCFl) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpUOdelCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFl shows little homology to 

30 the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V2, V3, V4 loops (a.a. 156-194, 297-325, 385-399), the Fusion and Cleavage 

35 (F/CL) domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 
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(a.a.577-605), the transmembrane domain and the intracellular region. Regions 
important for oligomer formation may be partially functional. Heptad(H) 1, Heptad 
2 and their Interspace(IS) are required for oligomerization. The expression vector 
backbone is pVR1012x/s (VRC2000). 

5 

■ 

VRC551 1 

pVR1012x/s R5(SA-C)gpl 40(dCFI)dV24/h 

i 

The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
10 (R5-tropic, GenBank accession number AF286227) was used to create a synthetic 

version of the gene (Clade-C gpl40delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40deICFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (I8nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
15 polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVRl 012x7s backbone. The truncated envelope polyprotein lacks 
the V2, V4 loops (a.a.156-194, 385-399), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the 
transmembrane domain and the intracellular region. Regions important for oligomer 
20 formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspaced) are required for oligomerization. The expression vector backbone is 
pVR1012x/s (VRC2000). 

VRCS512 

25 pVR1012x/s R5(SA-C)gpl40(dCFl)dV3/h 

The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl40delCFI) using codons optimized for expression 

30 in human cells. The nucleotide sequence R5gpl40deICFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbat (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 

35 the V3 loop (a.a.297-325), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 
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the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the transmembrane 
domain and the intracellular region. Regions important for oligomer formation may 
be partially functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required 
for oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

VRC5S13 

p VR1 0 1 2x/s R5(S A-C)gp 1 40(dCFI)d V34/h 



The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
10 (R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl40delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl40delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (I8nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
15 polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR!012x/s backbone. The truncated envelope polyprotein lacks 
the V3, V4 loops (a.a. 297-325, 385-399), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the 
transmembrane domain and the intracellular region. Regions important for oligomer 
20 formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x7s (VRC2000). 

VRCS514 

25 pVR!012x/s R5(SA-C)gpl40(dCFl)dV4/h 



The protein sequence of the envelope polyprotein (gpl40delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl40delCFI) using codons optimized for expression 

30 in human cells. The nucleotide sequence R5gpl40delCFL shows little homology to 

the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 

35 the V4 loop (a.a 385-399), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 
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the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), the transmembrane 
domain and the intracellular region. Regions important for oligomer formation may 
be partially functional. Heptad(H) 1, Heptad 2 and their rnterspace(lS) are required 
for oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

5 

VRC551S 

pVRl 0 1 2x/s R5(S A-C)gpl 45(dCFl)d V l/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
10 (R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
15 polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI loop (a.a. 136-150), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 
the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and the intracellular 
region. Regions important for oligomer formation may be partially functional. 
20 Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 

expression vector backbone is pVR1012x/s (VRC2000). 



VRC5516 

pVR1012x/s R5(SA-C)gpl45(dCFI)dV12/h 

25 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45deICFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 

30 the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5* end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V2 loops (a.a. 136- 194), the Fusion and Cleavage (F/CL) domains (a.a. 487- 

35 520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and the 
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intracellular region. Regions important for oligomer formation may be partially 
functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVRI012x/s (VRC2000). 

5 VRC5517 

pVR 1 0 1 2x/s R5(S A-C)gpl45(dCFI)dV 1 23/h 

m * 

The protein sequence of the envelope polyprotein (gpl45delCFl) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 

10 version of the gene (Clade-C gpl45de!CFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 

15 BamHl sites of pVRl012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V2, V3 loops (a.a.136-194, 297-325), the Fusion and Cleavage (F/CL) 
domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577- 
605), and the intracellular region. Regions important for oligomer formation may be 
partially functional. Heptad(H) I, Heptad 2 and their Interspace^) are required for 

20 oligomerization. The expression vector backbone is p VR1 01 2x/s (VRC2000). 

VRC5518 

pVR1012x/s R5(SA-C)gpl45(dCFI)dV1234/h 

25 The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 

(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 

30 from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 

polylinker at the 5 s end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V1,V2,V3 ,V4 loops (a.a.136-194, 297-325, 385-399), the Fusion and Cleavage 
(F/CL) domains (a.a. 487-520), the Interspace (IS), between Heptad (H) 1 and 2 

35 (a.a.577-605), and the intracellular region. Regions important for oligomer 
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formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC5519 

p VR 1 01 2x/s R5(SA-C)gp 1 45(dCFl)d V2/h 



The protein sequence of the envelope polyprotein (gp!45deICFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpi45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V2 loop (a.a.156-194), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 
the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and the intracellular 
region. Regions important for oligomer formation may be partially functional. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
expression vector backbone is pVR1012x/s (VRC2000). 

VRC5520 

pVRl 01 2x/s R5(S A-C)gp 1 45(dCFl)d V23/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5* end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V2, V3 loops (a.a.156-194, 297-325), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and 
the intracellular region. Regions important for oligomer formation may be partially 
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functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 



VRC5521 

p VR1 01 2x/s R5(SA-C)gp 1 45(dCFl)dV234/h 

The protein sequence of the envelope polyprotein (gpl45delCFi) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45deICFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V2, V3 ,V4 loops (a.a.156-194, 297-325, 385-399), the Fusion and Cleavage 
(F/CL) domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 
(a.a.577-605), and the intracellular region. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC5522 

pVR1012x/s R5(SA-C)gpl45(dCFI)dV24/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xba[ to 
BamHl sites of pVR10l2x/s backbone. The truncated envelope polyprotein lacks 
the V2, V4 loops (a.a. 156-1 94, 385-399), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and 
the intracellular region. Regions important for oligomer formation may be partially 
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functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 



VRC5523 

5 pVR1012x/s R5(SA-C)gpl45(dCFI)dV3/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 

10 in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 

the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
poly linker at the 5 5 end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 

15 the V3 loop (a.a.297-325), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 

the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and the intracellular 
region. Regions important for oligomer formation may be partially functional. 
Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
expression vector backbone is pVR1012x/s (VRC2000). 

20 

VRC5524 

p VR1 0 1 2x/s R5(S A-C)gpl45(dCFI)d V34/h 



The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gp145delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the V3, V4 loops (a.a. 297-325, 385-399), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and 
the intracellular region. Regions important for oligomer formation may be partially 
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functional, Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 



VRC5525 

5 pVR1012x/s R5(SA-C)gpl45(dCFI)dV4/h 

* 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 

10 in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5 3 end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 

15 the V4 loop (a.a.385-399), the Fusion and Cleavage (F/CL) domains (a.a. 487-520), 
the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and the intracellular 
region. Regions important for oligomer formation may be partially functional. 
Heptad(H) I, Heptad 2 and their Interspace(IS) are required for oligomerization. The 
expression vector backbone is pVRl .01 2x/s (VRC2000). 

20 

VRC5526 

pVRl012x/s R5(SA-C)gpl 45(dCFI)dV13/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
25 (R5-tropic, GenBank accession number AF286227) was used to create a synthetic 

version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
30 polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 

4 

BamHl sites of pVRIOI2x/s backbone. The truncated envelope polyprotein lacks 
the VI, V3 loops (a.a.136-150, 297-325), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and 
the intracellular region. Regions important for oligomer formation may be partially 
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functional. Heptad(H) 1, Heptad 2 and their Interspace^) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 



VRC5S27 

p VR 1 0 1 2x/s R5(S A-C)gp 1 45(dCFI)d V 1 34/h 

The protein sequence of the envelope polyprotein (gpl45delCFJ) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45deICFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polyl inker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVRl012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V3, V4 loops (a.a.136-150, 297-325, 385-399), the Fusion and Cleavage 
(F/CL) domains (a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 
(a.a.577-605), and the intracellular region. Regions important for oligomer 
formation may be partially functional. Heptad(H) 1, Heptad 2 and their 
Interspace^) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC5528 

pVRl 01 2x/s R5(S A-C)gp 1 45(dCFI)dVl 24/h 

The protein sequence of the envelope polyprotein (gpl45delCFl) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5' end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V2, V4 loops (a.a.136-150, 156-194, 385-399), the Fusion and Cleavage 
(F/CL) domains (a.a, 487-520), the Interspace (IS) between Heptad (H) 1 and 2 
(a.a.5 77-605), and the intracellular region. Regions important for oligomer 

-141- 



'•^ 4J PCT/US01/25721 

formation may be partially functional. Heptad(H) I, Heptad 2 and their 
Interspace(IS) are required for oligomerization. The expression vector backbone is 
pVR1012x/s(VRC2000). 

VRC5529 

pVR1012x/s R5(SA-C)gpl45(dCFI)dV14/h 

The protein sequence of the envelope polyprotein (gpl45delCFI) from 97ZA012 
(R5-tropic, GenBank accession number AF286227) was used to create a synthetic 
version of the gene (Clade-C gpl45delCFI) using codons optimized for expression 
in human cells. The nucleotide sequence R5gpl45delCFI shows little homology to 
the gene 97ZA012, but the protein encoded is the same. The Xbal (18nt up-stream 
from ATG) to BamHl (1914 nt down-stream from ATG) fragment which contains 
polylinker at the 5* end, Kozak sequence and ATG was cloned into the Xbal to 
BamHl sites of pVR1012x/s backbone. The truncated envelope polyprotein lacks 
the VI, V4 loops (a.a.136-150, 385-399), the Fusion and Cleavage (F/CL) domains 
(a.a. 487-520), the Interspace (IS) between Heptad (H) 1 and 2 (a.a.577-605), and 
the intracellular region. Regions important for oligomer formation may be partially 
functional. Heptad(H) 1, Heptad 2 and their Interspace(IS) are required for 
oligomerization. The expression vector backbone is pVR1012x/s (VRC2000). 

Gag-Pol Plasmids 

VRC3900 

pVR1012x/sGag/h 

The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from HXB2 
(GenBank accession number K03455) was used to create a synthetic version of the 
gag gene using codons optimized for expression in human cells. The nucleotide 
sequence of the synthetic gag gene shows little homology to the HXB2 gene, but the 
protein encoded is the same. The synthetic gag gene was ligated in frame with 
sequences encoding the pol polyprotein to produce pGag(fs)Pol/h (VRC4200). The 
protein sequence of the pol polyprotein (amino acids 3-1003) from NL4-3 (GenBank 
accession number Ml 9921) was used to create a synthetic version of the pol gene 
using codons optimized for expression in human cells. To produce a gene that 
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expresses all of the gag proteins, the region encoding pol amino acids 77-1003 were 
deleted from Gag(fs)Pol/h to produce Gag/h. This construct also encodes most of the 
protease (98 amino acids) gene encoding amino acids 3-77. Gag/h is expressed from 
the pVRl 01 2x/s vector backbone. 

5 

VRC3901 

pVR1012x/sSIV Gag/h 

The protein sequence of the gag polyprotein (amino acid from. 1-550) 
10 SIVmac239(GenBanlc accession number M33262) was used to create a synthetic 
version of the gag gene using codons optimized for expression in human cells. The 
nucleotide sequence of the synthetic gag gene shows little homology to the 
SIVMac239 gene, but the protein encoded is the same. 

15 VRC4000 

pVR1012x/s Gag-Pol/h 



The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from HXB2 
(GenBank accession number K03455) was used to create a synthetic version of the 
gag gene using codons optimized for expression in human cells. The nucleotide 
sequence of the synthetic gag gene shows little homology to the HXB2 gene, but the 
protein encoded is the same. The synthetic gag gene contains all of the mature Gag 
proteins except for the last two that are normally cleaved from the carboxy-terminus 
of the gag polyprotein, pi and p6 (amino acids 433-500). The synthetic gag gene 
was ligated in frame with sequences encoding the pol polyprotein. The protein 
sequence of the pol polyprotein (amino acids 3-1003) from NL4-3 (GenBank 
accession number Ml 9921) was used to create a synthetic version of the pol gene 
(Pol/h) using codons optimized for expression in human cells. Gag-Pol/h is 
expressed from the pVR1012x/s vector backbone. 
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VRC4001 

pVR1012x/s SIVGag-Pol/h 



PCT/US01/25721 



5 



10 



20 



30 



Eukaryotic vector with humanized codons expressing the Gag-pol gene of 
SIVmac239. The Sal I-Xbal fragment of SIV gag(VRC3901) was inserted into Sal 
I-Xbal of SIV Pol(VRC4101) to create VRC4001. 



VRC4100 

pVR1012x/sPol/h 



The protein sequence of the pol polyprotein (amino acids 3-1003) from NL4-3 
(GenBank accession number Ml 992 1) was used to create a synthetic version of the 
pol gene using codons optimized for expression in human cells. To initiate 
translation at the beginning of Pol, a methionine codon was added to the 5'-end of 
15 the synthetic polymerase gene to create the Pol/h gene. Pol/h is expressed from the 

pVR10l2x/s (VRC2000) vector backbone. 



VRC4101 

pVR1012x/s SIV Pol/h 



The protein sequence of the pol polyprotein (amino acids 3-1017) from SIVmac239 
(GenBank accession number Ml 9921) was used to create a synthetic version of the 
pol gene using codons optimized for expression in human cells. To initiate 
translation at the beginning of Pol, a methionine codon was added to the 5'-end of 
25 the synthetic polymerase gene to create the Pol/h gene. The Protease (Pr) mutation 

is at pol amino acid 123 and is AAG->GGA or amino acids R->G. Reverse 
transcriptase(RT) mutation is at aa 352(GAC to CAT D(D to H) -RT) and Integrase 
mutation is at aa 788(GAC to GGC(D to A) -IN). Pol/h is expressed from the 
pVR1012x/s (VRC2000) vector backbone. 



VRC4200 

pVR1012x/sGag(fs)Pol/h 



35 



The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from HXB2 
(GenBank accession number K03455) was used to create a synthetic version of the 
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gag gene using codons optimized for expression in human cells. The nucleotide 
sequence of the synthetic gag gene shows little homology to the HXB2 gene, but the 
protein encoded is the same. The synthetic gag gene contains all of the mature Gag 
proteins except for the last two that are normally cleaved from the carboxy-terminus 
5 of the gag polyprotein, pi and p6 (amino acids 433-500). The synthetic gag gene 
was ligated in frame with sequences encoding the pol polyprotein. The protein 
sequence of the pol polyprotein (amino acids 3-1003) from NL4-3 (GenBank 
accession number M19921) was used to create a synthetic version of the pol gene 
(Pol/h) using codons optimized for expression in human cells. To create the 

10 possibility for translational frameshifting as means to express the gag-pol 
polyprotein, the synthetic coding region for the last four amino acids of the NC 
protein through the rest of gag plus an additional 3 amino acids from pol were 
replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number M19921). The 

15 substitution of viral for synthetic sequences both introduces the sites required for 

frameshifting and restores the ability to express all gag proteins, including pi and 
p6. Gag(fs)Pol/h is expressed from the pVR1012x/s vector backbone. 

VRC4300 

20 pVRl 01 2 Gag-Pol(d delta RT delta IN)/h 

Eukaryotic vector with humanized codons expressing the Gag and the frame shifted 
Pol genes of HIV HXB2 subtype B with deletions in Reverse transcriptase, and 
Tntegrase regions. The protein sequence of the gag polyprotein (Pr55, amino acidsl- 

i 

25 432) from HXB2 (GenBank accession number K03455) was used to create a 
synthetic version of the gag gene using codons optimized for expression in human 
cells. The nucleotide sequence of the synthetic gag gene shows little homology to 
the HXB2 gene, but the protein encoded is the same. The synthetic gag gene 
contains all of the mature Gag proteins except for the last two that are normally 

30 cleaved from the carboxy-terminus of the gag polyprotein, pi and p6 (amino acids 

433-500). The synthetic gag gene was ligated in frame with sequences encoding the 
pol polyprotein. The protein sequence of the pol polyprotein (amino acids 3-1003) 
from NL4-3 (GenBank accession number Ml 9921) was used to create a synthetic 
version of the pol gene (Pol/h) using codons optimized for expression in human 

35 cells. To create the possibility for translational frameshifting as means to express 
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the gag-pol polyprotein, the synthetic coding region for the last four amino acids of 
the NC protein through the rest of gag plus an additional 3 amino acids from pol 
were replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number Ml 9921). The 
substitution of viral for synthetic sequences both introduces the sites required for 
frameshifting and restores the ability to express all gag proteins, including pi and p6 
The Reverse Transcriptase (RT) mutation is at gag-pol amino acid 771 and is GAC- 
>CAC or amino acids D->H. The Integrase (IN) mutation is at gag-pol amino acid 
1209 and is ACT->CAT or amino acids D->A. Note: This vector is not in the 
p VR1 01 2x/s backbone. 



VRC4301 

pVR1012x/s-Gag(FS)-Pol-delta RT IN-IRES-R5 gp!57-Nef 

For the Gag(FS)Pol delta RT delta IN/h portion, VRC4302 pVR1012x/s 
Gag(fs)Pol(deltaPR delta/h was used for the protein sequence of the gag polyprotei n 
(Pr55, amino acids 1-432) from HXB2 (GenBank accession number K03455) was 
used to create a synthetic version of the gag gene using codons optimized for 
expression in human cells. The nucleotide sequence of the synthetic gag gene shows 
little homology to the HXB2 gene, but the protein encoded is the same. The 
synthetic gag gene contains all of the mature Gag proteins except for the last two 
that are normally cleaved from the carboxy- terminus of the gag polyprotein, pi and 
p6 (amino acids 433-500). The synthetic gag gene was ligated in frame with 
sequences encoding the pol polyprotein. The protein sequence of the pol polyprotein 
(amino acids 3-1003) from NL4-3 (GenBank accession number M19921) was used 
to create a synthetic version of the pol gene (Pol/h) using codons optimized for 
expression in human cells. To create the possibility for translational frameshifting as 
means to express the gag-pol polyprotein, the synthetic coding region for the last 
four amino acids of the NC protein through the rest of gag plus an additional 3 
amino acids from pol were replaced with the corresponding viral sequences 
(nucleotides 2074-2302 on the HXB2 genome) from NL4-3 (GenBank accession 
number M19921). The substitution of viral for synthetic sequences both introduces 
the sites required for frameshifting and restores the ability to express all gag 
proteins, including pi and p6. Gag(rs)Pol/h is expressed from the pVRl012x/s 
vector backbone. The Reverse Transcriptase (RT) mutation is at gag-pol amino acid 

-146- 



WO 02/32943 PCTAJS01/25721 

771 and is GAC->CAC or amino acids D->H, The Integrase (rN) mutation is at gag- 
pol amino acid 1209 and is ACT->CAT or amino acids D->A. This gene has been 
fused to an Internal Ribosoma] Entry Site (IRES) and then fused to the R5gpl57- 
Nef from VRC2200 pVR1012x/s R5gpl57- NefDMHCDCD4/h in which the protein 
5 sequence of the envelope polyprotein (gpl60) from HXB2 (X4-tropic, GenBank 

accession number K03455) was used to create a synthetic version of the gene 
(X4gpl60/h) using codons optimized for expression in human cells. The nucleotide 
sequence X4gpl60/h shows little homology to the HXB2 gene, but the protein 
encoded is the same with the following amino acid substitutions: F53L, N94D, 

10 K192S, I215N, A224T, A346D, P470L, T723I, and S745T. To produce an R5-tropic 

version of the envelope protein (R5gpl60/h), the region encoding HTV-1 envelope 
polyprotein amino acids 275 to 361 from X4gpl60/h (VRC3300) were replaced with 
the corresponding region from the BaL strain of HIV-1 (GeneBank accession 
number M68893, again using human preferred codons). The envelope-Nef fusion 

15 protein expressed from pR5gpl.57-Nef/h contains the first 820 amino acids from the 

HIV envelope glycoprotein (gpl57) fused to the entire mutant Nef protein. The gene 
for gpl57 was ligated in frame with the full-length mutant Nef gene from 
pNefDMHCDCD4/h (VRC3600) to produce pR5gpl57- NefDMHCDCD4/h. The 
protein sequence of the Nef protein from HIV-1 PV22 (GenBank accession number 

20 K02083) was used to create a synthetic version of the Nef gene (Ne#h) using codons 
optimized for expression in human cells. To disrupt the ability of Nef to limit both 
MHC class I and CD4 expression, point mutations were introduced into the Nef 
gene from pNef/h (VRC3500). The resulting amino acids substitutions in 
pNefDMHCDCD4/h are: P69A, P72A, P75A, P78A, D1.74A and D175A. R5gpl57- 

25 NefDMHCDCD4/h is expressed from the pVR1012x/s (VRC2000) vector 

backbone. 

VRC4302 

pVRl 01 2 Gag(delFS)Pol(delta PR delta RT delta lN)/h 

30 

The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from HXB2 
(GenBank accession number K03455) was used to create a synthetic version of the 
gag gene using codons optimized for expression in human cells. The nucleotide 
sequence of the synthetic gag gene shows little homology to the HXB2 gene, but the 
35 protein encoded is the same. The synthetic gag gene contains all of the mature Gag 
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proteins except for the last two that are normally cleaved from the carboxy-terminus 
of the gag polyprotein, pi and p6 (amino acids 433-500). The synthetic gag gene 
was ligated in frame with sequences encoding the pol polyprotein. The protein 
sequence of the pol polyprotein (amino acids 3-1003) from NL4-3 (GenBank 
5 accession number M19921) was used to create a synthetic version of the pol gene 
(Pol/h) using codons optimized for expression in human cells. To create the 
possibility for translational frameshifting as means to express the gag-pol 
polyprotein, the synthetic coding region for the last four amino acids of the NC 
protein through the rest of gag plus an additional 3 amino acids from pol were 

10 replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number M19921). The 
substitution of viral for synthetic sequences both introduces the sites required for 
frameshifting and restores the ability to express all gag proteins, including pi and 
p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the Gag 

15 and Pol sequences. Gag(fs)Pol/h is expressed from the pVR1012x/s vector 
backbone! The Protease (PR) mutation is at gag-pol amino acid 553 and is AGG- 
>GGC or amino acids R->G. The Reverse Transcriptase (RT) mutation is at gag-pol 
amino acid 771 and is GAC->CAC or amino acids D->H. The Integrase (IN) 
mutation is at gag- pol amino acid 1209 and is ACT->CAT or amino acids D->A. 

20 Note: This vector is not in the pVR1012x/s backbone. 

VRC4303 

pVR1012 SIV Gag(delFS)Pol(delta PR delta RT delta IN)/h 

25 Eukaryotic vector with humanized codons expressing the Gag and the frame-shift- 
deleted Pol genes SIVmac239 with deletions in the Protease, Reverse transcriptase, 
and Integrase regions. The 5 of Ts from 3188 to 3192 of SIV gag-pol(VRC4001) 
were deleted to create VRC4303. 



30 VRC4304 

pVRl 01 2 Gag (delFS)Pol delta PR delta RT delta IN/h 

■ 

The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from HIV-1 
C clade(GenBank accession number U52953) was used lo create a synthetic version 
35 of the gag gene using codons optimized for expression in human cells. The 
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nucleotide sequence of the synthetic gag gene shows little homology to the HIV-1 
gene, but the protein encoded is the same. The synthetic gag gene contains all of the 
mature Gag proteins except for the last two that are normally cleaved from the 
carboxy-terminus of the gag polyprotein, pi and p6 (amino acids 433-500). The 
5 synthetic gag gene was ligated in frame with sequences encoding the pol 

polyprotein. The protein sequence of the pol polyprotein (amino acids 3-1003) from 
NL4-3 (GenBank accession number Ml 9921) was used to create a synthetic version 
of the pol gene (Pol/h) using codons optimized for expression in human cells. To 
create the possibility for translational frameshifting as means to express the gag-pol 

10 polyprotein, the synthetic coding region for the last four amino acids of the NC 

protein through the rest of gag plus an additional 3 amino acids from pol were 
replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number M19921). The 
substitution of viral for synthetic sequences both introduces the sites required for 

15 frameshifting and restores the ability to express all gag proteins, including pi and 

p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the Gag 
and Pol sequences. Gag(fs)Pol/h is expressed from the pVR1012x/s vector 
backbone. The Protease (PR) mutation is at gag-pol amino acid 553 and is AGG- 
>GGC or amino acids R->G. The Reverse Transcriptase (RT) mutation is at gag-pol 

20 amino acid 771 and is GAC->CAC or amino acids D->H. The Integrase (IN) 

mutation is at gag- pol amino acid 1209 and is ACT->CAT or amino acids D~>A. 
Note: This vector is not in the pVRl012x/s backbone. 

VRC43QS 

25 pVRl 012 Gag-A(deIFS)Pol(delta PR delta RT delta JN)/h 

Eukaryotic vector with humanized codons expressing the Gag and the frame shifted 
Pol genes of HIV HIV-1A clade with deletions in the Protease, Reverse 
transcriptase, and Integrase regions. VRC4305 pVR1012 Gag(-AdelFS)Pol(delta PR 

30 delta RT delta IN)/h The protein sequence of the gag polyprotein (Pr55, amino acids 

1-432) from HIV-1 A clade (GenBank accession number AF004885) was used to 
create a synthetic version of the gag gene using codons optimized for expression in 
human cells. The nucleotide sequence of the synthetic gag gene shows little 
homology to the HIV-1 gene, but the protein encoded is the same. The synthetic gag 

35 gene contains all of the mature Gag proteins except for the last two that are normally 
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cleaved from the carboxy-terminus of the gag polyprotein, pi and p6 (amino acids 
433-500). The synthetic gag gene was ligated in frame with sequences encoding the 
pol polyprotein. The protein sequence of the pol polyprotein (amino acids 3-1003) 
from NL4-3 (GenBank accession number Ml 9921) was used to create a synthetic 
version of the pol gene (Pol/h) using codons optimized for expression in human 
cells. To create the possibility for translational frameshifting as means to express the 
gag-pol polyprotein, the synthetic coding region for the last four amino acids of the 
NC protein through the rest of gag plus an additional 3 amino acids from pol were 
replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number M19921). The 
substitution of viral for synthetic sequences both introduces the sites required for 
frameshifting and restores the ability to express all gag proteins, including pi and 
p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the Gag 
and Pol sequences. Gag(fs)Pol/h is expressed from the pVR1012x/s vector 
backbone. The Protease (PR) mutation is at gag-pol amino acid 553 and is AGG- 
>GGC or amino acids R->G. The Reverse Transcriptase (RT) mutation is at gag-pol 
amino acid 771 and is GAC->CAC or amino acids D->H. The Tntegrase (IN) 
mutation is at gag- pol amino acid 1209 and is ACT->CAT or amino acids D->A. 
'Note: This vector is not in the pVRl012x/s backbone. 

PVRC4306 

pVRl 012 Gag(delFS)Pol delta PR delta RT delta IN/Neffh 

The protein sequence of the Gag polyprotein (Pr55, amino acids 1-432) from HXB2 
(GenBank accession number K03455) was used to create a synthetic version of the 
Gag gene using codons optimized for expression in human cells. The nucleotide 
sequence of the synthetic Gag gene shows little homology to the HXB2 gene, but the 
protein encoded is the same. The synthetic Gag gene contains all of the mature Gag 
proteins except for the last two that are normally cleaved from the carboxy-terminus 
of the Gag polyprotein, pi and p6 (amino acids 433-500). The synthetic Gag gene 
was ligated in frame with sequences encoding the Pol polyprotein. The protein 
sequence of the Pol polyprotein (amino acids 3-1003) from NL4-3 (GenBank 
accession number Ml 9921) was used to create a synthetic version of the Pol gene 
(Pol/h) using codons optimized for expression in human cells. To create the 
possibility for translational frameshifting as means to express the Gag-Pol 
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polyprotein, the synthetic coding region for the last four amino acids of the NC 
protein through the rest of Gag plus an additional 3 amino acids from Pol were 
replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number M19921). The 
substitution of viral for synthetic sequences both introduces the sites required for 
frameshifting and restores the ability to express all Gag proteins, including pi and 
p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the Gag 
and Fol sequences. The Protease (PR) mutation is at Gag-Pol amino acid 553 and is 
AGG->GGC or amino acids R->G. The Reverse Transcriptase (RT) mutation is at 
Gag-Pol amino acid 771 and is GAC->CAC or amino acids D->H. The Integrase 
(IN) mutation is at Gag- Pol amino acid 1209 and is ACT->CAT or amino acids D- 
>A. The Nef/h gene was fused to downstream of Pol gene of Gag(delFS)Pol(delta 
PR delta RT delta IN). No loss or extra-amino acid was created by the fusion 
between Nef and Pol. The ATG of Nef was preserved.The protein sequence of the 
Nef protein from HTV-1 PV22 (GenBank accession number K02083) was used to 
create a synthetic version of the Nef gene (Netfh) using codons optimized for 
expression in human cells. The nucleotide sequence Nef/h shows little homology to 
the viral gene, but the protein encoded is the same. Note: This vector is not in the 
pVR1012x/s backbone. 



VRC 4308 

VRC4302-myr 

PVR1012 Gag (delFS) Pol deltaPR deltaRT deltaIN deltaMyr/h 

The myristoylation site was deleted from pVRC4302. The protein sequence of the 
gag polyprotein (Pr55, amino acids 1-432) from HXB2 (GenBank accession number 
K03455) was used to create a synthetic version of the gag gene using codons 
optimized for expression in human cells. The nucleotide sequence of the synthetic 
gag gene shows little homology to the HXB2 gene, but the protein encoded is the 
same. The synthetic gag gene contains all of the mature Gag proteins except for the 
last two that are normally cleaved from the carboxy-terminus of the gag polyprotein, 
pi and p6 (amino acids 433-500). The synthetic gag gene was ligated in frame with 
sequences encoding the pol polyprotein. The protein sequence of the pol 
polyprotein (amino acids 3-1003) from NL4-3 (GenBank accession number 
Ml 9921) was used to create a synthetic version of the pol gene (Pol/h) using codons 
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optimized for expression in human cells. To create the possibility for translation^ 
frameshi fling as means to express the gag-pol polyprotein, the synthetic coding 
region for the last four amino acids of the NC protein through the rest of gag plus an 
additional 3 amino acids from pol were replaced with the corresponding viral 
5 sequences (nucleotides 2074-2302 on the HXB2 genome) from NL4-3 (GenBank 

accession number Ml 9921). The substitution of viral for synthetic sequences both 
introduces the sites required for frameshifting and restores the ability to express all 
gag proteins, including pi and p6. The deleted Frame Shifted (delFS) has 5 T 
nucleotides deleted between the Gag and Pol sequences. Gag(fs)Pol/h is expressed 

10 from the pVRl012x/s vector backbone. The Protease (PR) mutation is at gag-pol 
amino acid 553 and is AGG->GGC or amino acids R->G. The Reverse 
Transcriptase (RT) mutation is at gag-pol amino acid 771 and is GAC->CAC or 
amino acids D->H. The Integrase (IN) mutation is at gag-pol amino acid 1209 and 
is ACT->CAT or amino acids D->A. The myristylation site was deleted. Note:This 

1 5 vector is not in the pVR10l2x/s backbone. 

VRC 4309 

pVR1012 Gag (delFS) Pol deltaPR deltaRT deltaTN delta Myr/Nef/h 

20 The myristoylation site was deleted from pVRC4306> The protein sequence of the 

gag polyprotein (Pr55, amino acids 1-432) from HXB2 (GenBank accession number 
K03455) was used to create a synthetic version of the gag gene using codons 
optimized for expression in human cells. The nucleotide sequence of the synthetic 
gag gene shows little homology to the HXB2 gene, but the protein encoded is the 

25 same. The synthetic gag gene contains all of the mature Gag proteins except for the 
last two that are normally cleaved from the carboxy-terminus of the gag polyprotein, 
pi and p6 (amino acids 433-500). The synthetic gag gene was ligated in frame with 
sequences encoding the pol polyprotein. The protein sequence of the pol 
polyprotein (amino acids 3-1003) from NL4-3 (GenBank accession number 

30 M19921) was used to create a synthetic version of the pol gene (Pol/h) using codons 

optimized for expression in human cells. To create the possibility for translational 
frameshifting as means to express the gag-pol polyprotein, the synthetic coding 
region for the last four amino acids of the NC protein through the rest of gag plus an 
additional 3 amino acids from pol were replaced with the corresponding viral 

35 sequences (nucleotides 2074-2302 on the HXB2 genome) from NL4-3 (GenBank 
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accession number Ml 992 1). The substitution of viral for synthetic sequences both 
introduces the sites required for frameshifting and restores the ability to express all 
gag proteins, including pi and p6. The deleted Frame Shifted (delFS) has 5 T 
nucleotides deleted between the Gag and Pol sequences. Gag(fs)Pol/h is expressed 
5 from the pVR1012x/s vector backbone. The Protease (PR) mutation is at gag-pol 

amino acid 553 and is AGG->GGC or amino acids R->G. The Reverse 
Transcriptase (RT) mutation is at gag-pol amino acid 771 and is GAC->CAC or 
amino acids D->H. The Integrase (IN) mutation is at gag-pol amino acid 1209 and 
is ACT->CAT or amino acids D->A. The stop codon TAG was removed and 
10 synthetic B clade Nef ( Genbank access number ) was fusion to the 3' end of pol by 
PCR. Note: This vector is not in the pVR1012x/s backbone. 

■ 

VRC 4310 

pVR1012 Nef Gag (del fs) (del Myr) Pol (delta PR delta RT delta IN)/h 

15 

The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from HXB2 
(GenBank accession number K03455) was used to create a synthetic version of the 
gag gene using codons optimized for expression in human cells. The nucleotide 
sequence of the synthetic gag gene shows little homology to the HXB2 gene, but the 

20 protein encoded is the same. The synthetic gag gene contains all of the mature Gag 
proteins except for the last two that are normally cleaved from the carboxy-terminus 
of the gag polyprotein, pi and p6 (amino acids 433-500). The synthetic gag gene 
was ligated in frame with sequences encoding the pol polyprotein. The protein 
sequence of the pol polyprotein (amino acids 3-1003) from NL4-3 (GenBank 

25 accession number M19921) was used to create a synthetic version of the pol gene 
(Pol/h) using codons optimized for expression in human cells. To create the 
possibility for translational frameshifting as means to express the gag-pol 
polyprotein, the synthetic coding region for the last four amino acids of the NC 
protein through the rest of gag plus an additional 3 amino acids from pol were 

30 replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number Ml 9921). The 
substitution of viral for synthetic sequences both introduces the sites required for 
frameshifting and restores the ability to express all gag proteins, including pi and 
p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the Gag 

35 and Pol sequences. Gag(fs)Pol/h is expressed from the pVR10l2x/s vector 
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backbone. The Protease (PR) mutation is at gag-pol amino acid 553 and is AGG- 
>GGC or amino acids R->G. The: Reverse Transcriptase (RT) mutation is at gag-pol 
amino acid 771 and is GAC->CAC or amino acids D->H. The Integrase (W) 
mutation is at gag-pol amino acid 1209 and is ACT->CAT or amino acids D->A. 
5 The stop codon TAG of synthetic B clade Nef ( Genbank access number ) gene was 

removed and was fused to the 5' end of gene gene by PCR.Note: This vector is not 
in the p VR 1 0 1 2x/s backbone. 

VRC4311 

10 pVR1012Gag-C(delFS)Pol(delta PR delta RT delta IN) Nef/h 
VRC4304 + Clade C Nef 

* 

The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from HIV-i 
C clade(GenBank accession number U52953) was used to create a synthetic version 

15 of the gag gene using codons optimized for expression in human cells. The 

nucleotide sequence of the synthetic gag gene shows little homology to the HIV-l 
gene, but the protein encoded is the same. The synthetic gag gene contains all of the 
mature Gag proteins except for the last two that are normally cleaved from the 
carboxy-terminus of the gag polyprotein, pi and p6 (amino acids 433-500). The 

20 synthetic gag gene was ligated in frame with sequences encoding the pol 
polyprotein. The protein sequence of the pol polyprotein (amino acids 3-1003) from 
NL4-3 (GenBank accession number Ml 9921) was used to create a synthetic version 
of the pol gene (Pol/h) using codons optimized for expression in human cells. To 
create the possibility for translational frameshifting as means to express the gag-pol 

25 polyprotein, the synthetic coding region for the last four amino acids of the NC 

protein through the rest of gag plus an additional 3 amino acids from pol were 
replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 
HXB2 genome) from NL4-3 (GenBank accession number Ml 9921). The 
substitution of viral for synthetic sequences both introduces the sites required for 

30 frameshifting and restores the ability to express all gag proteins, including pi and 
p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the Gag 
and Pol sequences. Gag(fs)Pol/h is expressed from the pVR1012x/s vector 
backbone. The Protease (PR) mutation is at gag-pol amino acid 553 and is AGG- 
>GGC or amino acids R~>G. The Reverse Transcriptase (RT) mutation is at gag-pol 

35 amino acid 771 and is GAC->CAC or amino acids D->H. The Integrase (IN) 
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mutation is at gag-pol amino acid 1209 and is ACT->CAT or amino acids D->A. 
The stop codon TAG was removed and synthetic C clade Nef ( Genbank accession 
number:U52953) was fusion to the 3' end of pol by PCR. Note:This vector is not in 
the pVR1012x/s backbone. 

5 

VRC 431 2 

Gag (del fs) (del Myr) Nef Pol APRARTAIN/h 
VRC-Myr-gag-Dnef-Dpol 

(Eukaryotic vector with humanized codons expressing the Gag, truncated Nef, and 
10 truncated Pol proteins of HIV subtype B .ns.) 

This construct was derived from VRC4302. The protein sequence of the gag 
polyprotein (Pr55, amino acids 1-432) from HXB2 (GenBank accession number 
K03455) was used to create a synthetic version of the gag gene using codons 

15 optimized for expression in human cells. The nucleotide sequence of the synthetic 

gag gene shows little homology to the HXB2 gene, but the protein encoded is the 
same. The synthetic gag gene contains all of the mature Gag proteins except for the 
last two that are normally cleaved from the carboxy-terminus of the gag polyprotein, 
pi and p6 (amino acids 433-500). The synthetic gag gene was ligated in frame with 

20 sequences encoding synthetic nef gene that 51 aa were deleted from 5'. 77 aa were 

deleted from 5' of pol polyprotein, and ligated with 3' of nef in which tag stop 
codon was deleted . The protein sequence of the pol polyprotein (amino acids 3-78 - 
1003) from NL4-3 (GenBank accession number M19921) was used to create a 
synthetic version of the pol gene (Pol/h) using codons optimized for expression in 

25 human cells. To create the possibility for translational frameshifting as means to 
express the gag-pol polyprolein, the synthetic coding region for the last four amino 
acids of the NC protein through the rest of gag plus an additional 3 amino acids 
from pol were replaced with the corresponding viral sequences (nucleotides 2074- 
2302 on the HXB2 genome) from NL4-3 (GenBank accession number M19921). 

30 The substitution of viral for synthetic sequences both introduces the sites required 
for frameshifling and restores the ability to express all gag proteins, including pi 
and p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the 
Gag and Pol sequences. Gag(rs)Pol/h is expressed from the pVR1012x/s vector 
backbone. The Protease (PR) mutation is at gag-pol amino acid 553 and is AGG- 
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>GGC or amino acids R->G. The Reverse Transcriptase (RT) mutation is at gag-pol 
amino acid 771 and is GAC->CAC or amino acids D->H. The Integrase (IN) 
mutation is at gag-pol amino acid 1209 and is ACT->CAT or amino acids D->A. 
Note: This vector is not in the pVR1012x/s backbone. 

< 

5 

VRC 4313 

P VR1012 Gag Clade A (del fs)Pol(A PR A RT A IN)/h 
VRC4305+ Clade A Nef 

■ 

10 The protein sequence of the gag polyprotein (Pr55, amino acids 1-432) from H1V-1 
A clade (GenBank accession number AF004885) was used to create a synthetic 
version of the gag gene using codons optimized for expression in human cells. The 
nucleotide sequence of the synthetic gag gene shows little homology to the HIV-1 
gene, but the protein encoded is the same. The synthetic gag gene contains all of the 

15 mature Gag proteins except for the last two that are normally cleaved from the 
carboxy-terminus of the gag polyprotein, pi and p6 (amino acids 433-500). The 
synthetic gag gene was ligated in frame with sequences encoding the pol 
polyprotein. The protein sequence of the pol polyprotein (amino acids 3-1003) from 
NL4-3 (GenBank accession number Ml 9921) was used to create a synthetic version 

20 of the pol gene (Pol/h) using codons optimized for expression in human cells. To 
create the possibility for translational frameshifting as means to express the gag-pol 
polyprotein, the synthetic coding region for the last four amino acids of the NC 
protein through the rest of gag plus an additional 3 amino acids from pol were 
replaced with the corresponding viral sequences (nucleotides 2074-2302 on the 

25 HXB2 genome) from NL4-3 (GenBank accession number Ml 9921). The 
substitution of viral for synthetic sequences both introduces the sites required for 
frameshifting and restores the ability to express all gag proteins, including pi and 
p6. The deleted Frame Shifted (delFS) has 5 T nucleotides deleted between the Gag 
and Pol sequences. Gag(fs)Pol/h is expressed from the pVR1012x/s vector 

30 backbone. The Protease (PR) mutation is at gag-pol amino acid 553 and is AGG- 
>GGC or amino acids R->G. The Reverse Transcriptase (RT) mutation is at gag-pol 
amino acid 771 and is GAC->CAC or amino acids D->H. The Integrase (IN) 
mutation is at gag-pol amino acid 1209 and is ACT->CAT or amino acids D->A. 
The stop codon TAG was removed and synthetic A clade Nef ( Genbank accession 
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number: AF069670) was fused to the 3' end of pol by PCR 

* 

in the pVR1012x/s backbone. 



PCT/US01/25721 
Note: This vector is not 



Examples 

5 Env 
Immunogens 

Plasmids expressing the CXCR4-tropic HIV-1 HXB2 Env were made 
synthetically with sequences designed to disrupt viral RNA structures that limit 
protein expression by using codons typically found in human cells. Briefly, the 

10 synthetic Env gene of HXB2 (GenBank accession number K03455) was generated 
in three fragments by assembling the overlapping synthetic oligonucleotides using 
. PCR amplification. To produce a CCR5-tropic version of the HIV-1 envelope, the 
region encoding amino acids 275 to 361 of HXB2 (CXCR4-tropic) gpl60 was 
replaced with CCR5-tropic HIV-1 BaL sequence (GenBank accession number 

15 M68893), which includes the V3 loop. Glycosylation mutants were generated by 

site-directed mutagenesis to replace asparagine with glutamic acid residues at 
seventeen conserved glycosylation sites between amino acids 88 and 448. To 
express truncation mutant Env proteins, stop codons were introduced after positions 
752, 704, 680 or 592 to produce gpl50, gpl45, gpl40, or gpl28, respectively. The 

20 Env protein was further changed by deleting amino acids 503 to 537 and 593 to 619, 
which removes the cleavage site sequence, the fusion domain, and a part of the 
spacer between the two heptad repeats. The structures of the synthetic HIV 
envelope genes are shown (Fig. I). The cDNAs were cloned in the expression vector 
pNGVL or pVR1012 under the control of the cytomegalovirus immediate-early 

25 enhancer, promoter, and first intron. The protein sequence was identical to HXB2 

Env except for the following amino acid substitutions: F53L, N94D, K192S, 121 5N, 
A224T, A346D, P470L, T723I, and S745T. 
Expression of Envelope Proteins in Transfected Cells 

293 cells (10 6 ) were plated in 60mm dishes. Cells were transfected on the 

30 following day with 2 of plasmid using calcium phosphate (Graham FL, van der 

Eb AJ, 1973, Virology, 52:456-67). Cells were harvested 48 hours after transfection 
and lysed in buffer containing 50mM HEPES pH 7.0, 250mM NaCl and 0.5% 
NP40. The protein concentration in the lysates was determined using the Bradford 
reagent (Bio-Rad). Proteins (25 ng) in lysates were separated by 7.5% SDS-PAGE 
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and transferred to Immobilon-P membrane (Millipore, Bedford, MA). Env was 

detected by immunoprecipitation followed by Western blotting using polyclonal 

antibody against gpl60 (Intracel, Rockville, MD). 

Cell Surface Expression of Envelope by FAGS Analysis 
5 293 cells were harvested 48 hours after transfection and washed twice with 

phosphate-buffered saline (PBS) containing 1% bovine serum albumin and 

incubated for 30 minutes on ice with polyclonal immunoglobulin from an HIV-1 

infected patient. The secondary antibody against human IgG conjugated with FITC 

(Jackson Immuno Research) was added, incubated for 30 minutes on ice, washed 3 
10 times with PBS, and analyzed by flow cytometry (FACScan). The median 

fluorescent intensity values were derived using Cell Quant software. 

DNA Injection in Mice 

Six week old, female BALB/c mice were injected intramuscularly with 100 

jig of purified plasmid DNA suspended in 200 )il of normal saline. For each 
15 plasmid DNA, a group of 4 mice was injected three times at intervals of two weeks. 

The mice were bled two weeks after the last injection, sera collected and stored at 

4°C. 

Quantitation of the Antibody Response 

Immunoprecipitation and Western blotting was used to detect the antibodies 
20 that bind to native envelope proteins. Sera from immunized mice were used to 

immunoprecipitate gpl60 from cell lysates of gpl60-transfected 293 cells. Indicated 
dilutions were used to immunoprecipitate gpl60 from the cell lysate (400 fig). 
Immunocomplexes were separated by 7.5% SDS-PAGE and analyzed by 
immunoblotting using the polyclonal antibody against gp!60. 
25 Analysis of CTL Response 

Spleens were removed aseptically and gently homogenized to a single cell 
suspension, washed, and resuspended to a final concentration of 5x1 0 7 cells/ml. 
Cells were incubated for 7 days in presence of IL-2 (10 U/ml) and either irradiated 
peptide-pulsed splenocytes from naive mice or an irradiated stable cell line 
30 expressing the full length gpl60 BC-enWrev, (Irwin MJ et al., 1994, J Virol, 

68:5036-44). Three types of target cells were used: peptide-pulsed P815 cells 
(ATCC T1B64), BC10ME cells stably expressing gpl60, and BC10ME cells 
(Collins JL, Patek PQ, Cohn M. Tumorigenicity and lysis by natural killers. J Exp 
Med, 1981; 153:89-106) pulsed with peptides derived from gp!60 sequence. Target 
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cells were labeled with 5I Cr for 90 minutes and washed three times with RPMl-1640 
with 10% FBS, 2mM glutamine, 5xlO" 5 M P-mercaptoethanol and fungisone, 
250units/ml, and resuspended in this media. Cytolytic activity was determined in 
triplicate samples using all different target cell dilutions in a 5-hour 5l Cr-release 
5 assay (Ohno T et al., 1997, Gene Ther, 4:361-6). 

Gag and Pol 

Development of Synthetic HIV-1 Gag-Pol Expression Vectors 

The protein sequences of Gag (amino acids 1-432) from HXB2 (GenBank 
accession number K03455) and Pol (amino acids 3-1003) from NL4-3 (GenBank 

10 accession number M19921) were reverse translated by the GCG Package (Genetic 
Computer Group, Inc., Madison, WI) using codons expected for human cells. A 226- 
bp fragment spanning the frame shift site and the overlapping region of the two 
reading frames from NL4-3 were retained to allow expression of Gag and Gag-Pol 
precursor polyproteins in the same construct. 86 oligonucleotides covering 4325 

15 DNA base pairs with 5' Sail and 3' EcoRI sites were purchased from GIBCO Life 
Technologies. Each of the oligonucleotides was 75 base-pairs with 25 nt of overlap. 
The codon optimized Gag-Pol gene (hGag-Pol) was assembled by PCR with Pwo 
(Boehringer Mannheim) and Turbo Pfu (Stratagene) high fidelity DNA polymerase. 
The PCR conditions were optimized with a PCR optimization kit (Stratagene) on a 

20 gradient Robocycler (Stratagene). Full-length synthetic Gag-Pol gene was cloned 

into the Sal I and blunted Bgl II site of the mammalian expression vector pNGVL-3 
(Xu, Ling et al, 1998, Nature Medicine, 4:37-42) and confirmed by DNA 
sequencing. Three additional constructs were derived from the hGag-Pol gene. 5 
Thymidines (Ts) in the frame shift site (FS) of the hGag-Pol gene were deleted 

25 (AFS) and the protease was inactivated by replacing AGG in protease to GGC 
(R42G) to create hGag-PolAFSAPr (Hung, M, 1998, J Virol, 72:4819-4824; Loeb, 
DD et al., 1989, Nature 340:397-400). 432 amino acids of the NH2- terminal of 
hGag-Pol gene were deleted and an ATG start codon was added to create the hPol 
gene. 925 amino acids of the COOH-terminal hGag-Pol were deleted to create the 

30 hGag gene. hGagPolAFSAPr, hPol and hGag genes were expressed in the pNGVL-3 

plasmid, derived by insertion of a polylinker into pVR1012. The plasmid expressing 
viral Gag-Pol, pCMVA8.2 was a kind gift (Naldini, L et al., 1996, Science, 272:263- 
267). 
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Transient Transfection and Analysis of Expression 

293T cells were maintained in Dulbecco's modified Eagle medium (DMEM; 
G1BCO-BRL), supplemented with 10% fetal bovine serum (FBS). Plasmid DNAs 
were purified with double cesium chloride sedimentation gradients. Approximately 
5 3x10 293T cells were placed in a 10-cm dish one day before transfection. 10 jig of 

pCMVdR8.2 plasmid (containing the viral gag-pol gene), or 5 |ig of pVR1012s 
(containing the codon altered genes), were used to transfect 293T cells, using the 
calcium phosphate method (Chen, C and H Okayama, 1987, Mol. Cell. Biol, 7:2745- 
2752). Three days after transfection, cell lysates were prepared with RIP A buffer 

10 (Boehringer Mannheim, Indianapolis, IN) and separated by 4-15% gradient SDS- 
polyacrylamide gel electrophoresis (PAGE), then transferred onto an Immobilon P 
membrane (Millipore). Membranes were then incubated with anti HIV-l-IgG 
(AIDS Research and Reference Reagent Program), monoclonal anti-p24 (ICN)> or 
rabbit anti-RT (Intracel, Rockville, MD). Bands were visualized using the ECL 

15 Western blotting detection reagent (Amersham Pharmacia Biotech, Piscataway, NJ), 

as described by the manufacturer. Expression levels were determined using a 
phosphorimager. 

Generation of Stably Transfected Cell Lines 

hGag and hPol genes were individually subcloned into the Xho I and EcoRI 
20 sites of a retroviral vector, pPGS-CITE-Neo. Three plasmid systems were used to 

produce recombinant retroviruses containing the hGag or hPol genes (Yang, Z et al, 

1998, Science, 279:1034-1037). 48 hours after transfection, the supernatants were 

collected to transduce CT26 and BC10ME (Collins, JL, 1981, J Exp. Med, 153:89- 

106) which are syngeneic to Balb/C mice, and selected in 0.8 mg/ml of G418 two 
25 days after infection. The positive clones were screened and confirmed by Western 

blotting and maintained in 10% FCS supplemented RPMI (GIBCO-BRL) with 

0.5mg/mlG418. 

DNA Vaccination of Mice 

Female BALB/C mice, 6-8 weeks old, were used for immunogenicity 
30 studies. For DNA vaccination, mice were immunized with 100 p.1 ( 0.5 |ig/ml DNA 

, 0.9% NaCl) in the quadriceps muscle of each hind leg every two weeks for a total 

of 4 injections. 



-160- 



WO 02/32943 PCT/US01/25721 

CTL Assay 

Animals were sacrificed after the last immunization and spleens removed 
from both naive and immunized mice using aseptic techniques. Splenic 
lymphocytes were harvested and the chromium release CTL assay was performed in 
5 triplicate as previously described (Ohno, T. et al., 1997, Gene Ther, 4:361-366). 

The peptides used for sensitizing cells are as follows: Two peptide mixtures from 
the Gag protein PI 7(88-1 15) and p24(62-76). Seven peptide mixtures from the Pol 
protein: 1) P66(l 75-189), 2) P66(179-193), 3) P66(l 83-197), 4) P66(l 87-201), 5) 
P66(223-237), 6) P66(227-241), 7) P66(367-381). 

10 Measurement of Antibody Responses 
Anti-p24 ELISA assays 

The anti-p24 ELTSA assay was performed in Immunlon ninety-six-well 
plates (Dynet Technologies Inc., Chantilly, VA). The plates were coated with 50 p.1 
of purified recombinant HIV-IUIB p24 antigen (Intracel) at a concentration of 2 

15 |ig/ml in PBS buffer, pH 7.4 (G1BCO) with 0.05% sodium azide. Plates were 
washed 3x in PBS containing 3% BSA and 0.05% Tween20 (blocking buffer) and 
incubated for 2 hours. Mouse sera were serially diluted from 1:100 to 1:12,800 in 
blocking buffer, added to the p24-coated plates and incubated overnight at 4°C. 
Plates were then washed four times with PBS (0.05% Tween 20), and incubated with 

20 goat anti-mouse IgG (1:10,000 dilution, Sigma) for 2 hours at room temperature. 
Plates were washed four times, and then pNPP alkaline phosphatase substrate (75 [il; 
Sigma, St. Louis, MO) was added to each well. The reaction was stopped after one 
hour by addition of 0.5 N NaOH (25 ^il). The plates were read on an ELISA reader 
at 405 nm, and titers were calculated at a cutoff optical density of 0.4. 

25 HIV-1 immunoblotting 

The strips containing HIV-1 proteins (Immunectics Inc., Cambridge, MA) 
were incubated with pooled mouse sera at a dilution of 1 :25. Purified human anti- 
HIV IgG (AIDS Research and Reference Reagent Program, Rockville, MD) was 
used as a positive control. Bands were visualized using the ECL Western blotting 
30 detection reagent (Amersham Pharmacia Biotech, Piscataway, NJ). 

Immunoprecipitation ami Western blotting 

Three days after transfection, hPol gene-transfected 293T cell lysates were 
prepared with RIPA buffer. The pooled mouse sera were diluted with 
immunoprecipitation (IP) buffer (lOOmM KCI, 2.5mM MgCI 2 , 20mM HEPES, pH 
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7.9, 0.!% NP-40, ImM DTT and proteinase inhibitors). After adding 10 |ag of the 
cell lysate containing the HIV- 1 Pol protein, the reactions were incubated overnight 
on a rotator at 4°C. 250 jil of Protein G and A Sepharose beads (10%V/V in IP 
buffer) were then added, and the reactions were incubated for 2 hours on a rotator at 
5 4°C. The beads were washed four times with IP buffer, resuspended in 30 fil of IX 

sample buffer and loaded onto SDS-PAGE. After transfer onto an Immobilon P 
membrane (Millipore), membranes were incubated with anti HIV-l-IgG (AIDS 
Research and Reference Reagent Program). Bands were visualized using the ECL 
Western blotting detection reagent (Amersham Pharmacia Biotech). 

10 Prime/Boost Vaccination Strategy 

Groups of guinea pigs (4/group) were immunized (primed) with the vectors 
listed below, 3 times at 2 week intervals. Two weeks after the third DNA 
immunization, blood was collected for immune analysis. 2-3 days after the blood 
collection, the animals received a boost with a corresponding adenoviral construct, 

15 AdApt (Sullivan, N. et a1„ 2000, Nature, 408:605-608). Blood was again collected 2 
weeks after the adenoviral boost. Blood was tested for neutralizing antibodies and 
(using an ELISA) the presence of antibody against the envelope. The groups are 
summarized below. 

Prime Adenoviral Boost 

20 gpMOdelCFI (R5) Adv gpl40delCFI(R5) 

gpl45delCFI (R5) Adv gp1 45delCFI(R5) 

gpl45delCFI (R5) Adv gpl40delCFI(R5) 

gpl40deICFI (89.6P) Adv gpl40delCFI(89.6P) 

gpl45delCFI (89.6P) Adv gpl45delCFI(89.6P) 

25 gpl45delCFI (89.6P) Adv gpl40deICFI(89.6P) 

Adv gpl40delCFI (R5) Adv gpl40delCFJ(R5) 

Adv gpl45delCFI (R5) Adv gp!45delCFI(R5) 

Adv gpl 45delCFI (R5) Adv gpl40delCPI(R5) 

Adv gpl40delCFl (89.6P) Adv gpl 40delCFI(89.6P) 

30 Adv gpl45delCFI (89.6P) Adv gpl45delCFI(89.6P) 

Adv gpl 45delCFI (89.6P) Adv gpl 40delCFl(89.6P) 
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Results 

The level of antibody in the sera after adenoviral boost was 8- tolO-fold 
higher than the level of antibody present after 3 injections of DNA alone, in almost 
all of the cases. Neutralizing antibodies were detected in the sera of animals 
» immunized with either gp145deICFI DNA or gpHOdelCFI DNA followed by 

Advgpl40delCFI. 

Neutralization was also observed in animals immunized with gpl45deICFI 
DNA followed by Adv gpl45delCFI, and in animals both primed and boosted with 
adenovirus. 

Optimal neutralization was seen when animals were primed with 
gpl45delCFI DNA followed by adenovirus expressing gpl40delCFI in both clade B 
(R5-Bal strain) and 89.6P (Clade B, dual tropic strain). 

All neutralizing activity was able to be blocked with the V3 peptide of the 
corresponding strains. 



15 
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While the present invention has been described in some detail for purposes 
of clarity and understanding, one skilled in the art will appreciate that various 
20 changes in form and detail can be made without departing from the true scope of the 

invention. All figures, tables, and appendices, as well as patents, applications and 
publications referred to above are hereby incorporated by reference. 
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WHAT IS CLAIMED IS: 

1. A nucleic acid molecule comprising a polynucleotide encoding a 
modified glycosylation or ACFI HIV Env, or combination thereof, encoding a 
plasmid taken from Table 1 (or its modified HIV Env insert), or encoding an analog 

5 of said plasmid or insert having at least 95% identity thereto. 

2. The nucleic acid molecule of Claim 1 which is plasmid 2100 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

« 

3. The nucleic acid molecule of Claim 1 which is plasmid 2200 (or its 
10 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

4. The nucleic acid molecule of Claim 1 which is plasmid 2300 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

15 5. The nucleic acid molecule of Claim 1 which is plasmid 2302 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

6. The nucleic acid molecule of Claim 1 which is plasmid 2400 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

20 identity thereto. 

7. The nucleic acid molecule of Claim 1 which is plasmid 2700 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

8. The nucleic acid molecule of Claim 1 which is plasmid 2701 (or its 
25 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

9. The nucleic acid molecule of Claim 1 which is plasmid 2702 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

30 10. The nucleic acid molecule of Claim 1 which is plasmid 2706 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

11. The nucleic acid molecule of Claim 1 which is plasmid 2707 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
35 identity thereto. 
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12. The nucleic acid molecule of Claim 1 which is plasmid 2800 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

13. The nucleic acid molecule of Claim 1 which is plasmid 2801 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

14. The nucleic acid molecule of Claim 1 which is plasmid 2804 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 15. The nucleic acid molecule of Claim 1 which is plasmid 2805 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

16. The nucleic acid molecule of Claim 1 which is plasmid 2810 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

17. The nucleic acid molecule of Claim 1 which is plasmid 2811 (or its 
modified HTV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

18. The nucleic acid molecule of Claim 1 which is plasmid 2812 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

19. The nucleic acid molecule of Claim I which is plasmid 2813 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 20. The nucleic acid molecule of Claim I which is plasmid 2814 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

21. The nucleic acid molecule of Claim I which is plasmid 2820 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

22. The nucleic acid molecule of Claim 1 which is plasmid 2821 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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23. The nucleic acid molecule of Claim 1 which is plasmid 2822 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

24. The nucleic acid molecule of Claim 1 which is plasmid 2823 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

25. The nucleic acid molecule of Claim 1 which is plasmid' 2824 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 26. The nucleic acid molecule of Claim I which is plasmid 2825 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

27. The nucleic acid molecule of Claim 1 which is plasmid 2826 (or its 
modified HTV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

28. The nucleic acid molecule of Claim 1 which is plasmid 2827 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

29. The nucleic acid molecule of Claim 1 which is plasmid 2828 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

30. The nucleic acid molecule of Claim 1 which is plasmid 2829 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 31. The nucleic acid molecule of Claim 1 which is plasmid 2830 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

32. The nucleic acid molecule of Claim 1 which is plasmid 2831 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

33. The nucleic acid molecule of Claim 1 which is plasmid 2832 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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34. The nucleic acid molecule of Claim 1 which is plasmid 2833 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

35. The nucleic acid molecule of Claim 1 which is plasmid 2834 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

36. The nucleic acid molecule of Claim 1 which is plasmid 2835 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 37. The nucleic acid molecule of Claim I which is plasmid 2836 (or its 

modified HTV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

38. The nucleic acid molecule of Claim 1 which is plasmid 2837 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

39. The nucleic acid molecule of Claim 1 which is plasmid 2838 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

40. The nucleic acid molecule of Claim 1 which is plasmid 2839 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

41. The nucleic acid molecule of Claim 1 which is plasmid 2840 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 42. The nucleic acid molecule of Claim 1 which is plasmid 2841 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

43. The nucleic acid molecule of Claim 1 which is plasmid 2842 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

44. The nucleic acid molecule of Claim 1 which is plasmid 2843 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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45. The nucleic acid molecule of Claim 1 which is plasmid 2844 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

46. The nucleic acid molecule of Claim 1 which is plasmid 2845 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

47. The nucleic acid molecule of Claim, 1 which is plasmid 2846 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 48. The nucleic acid molecule of Claim 1 which is plasmid 2847 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

49. The nucleic acid molecule of Claim 1 which is plasmid 2848 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

50. The nucleic acid molecule of Claim 1 which is plasmid 2849 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

51. The nucleic acid molecule of Claim 1 which is plasmid 2850 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

52. The nucleic acid molecule of Claim 1 which is plasmid 2851 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 53 « The nucleic acid molecule of Claim 1 which is plasmid' 2852 (or its 

* modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

54. The nucleic acid molecule of Claim I which is plasmid 2853 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

55. The nucleic acid molecule of Claim I which is plasmid 2854 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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56. The nucleic acid molecule of Claim 1 which is plasmid 2860 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

57. The nucleic acid molecule of Claim 1 which is plasmid 2861 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

58. The nucleic acid molecule of Claim 1 which is plasmid 2862 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 59. The nucleic acid molecule of Claim 1 which is plasmid 2863 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

60. The nucleic acid molecule of Claim 1 which is plasmid 2864 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

61. The nucleic acid molecule of Claim 1 which is plasmid 2865 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

62. The nucleic acid molecule of Claim 1 which is plasmid 2866 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

63. The nucleic acid molecule of Claim 1 which is plasmid 2867 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 64. The nucleic acid molecule of Claim 1 which is plasmid 2868 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

65. The nucleic acid molecule of Claim 1 which is plasmid 2869 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

66. The nucleic acid molecule of Claim 1 which is plasmid 2870 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 



-169- 



WO 02/32943 PCT/US01/25721 

67. The nucleic acid molecule of Claim 1 which is plasmid 2871 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

68. The nucleic acid molecule of Claim 1 which is plasmid 2872 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

69. The nucleic acid molecule of Claim 1 which is plasmid 2873 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 70. The nucleic acid molecule of Claim I which is plasmid 2874 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

71. The nucleic acid molecule of Claim 1 which is plasmid 2900 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

1.5 identity thereto. 

72. The nucleic acid molecule of Claim 1 which is plasmid 3000 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

* 

73. The nucleic acid molecule of Claim 1 which is plasmid 3200 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

74. The nucleic acid molecule of Claim 1 which is plasmid 3201 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 75. The nucleic acid molecule of Claim I which is plasmid 3202 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

76. The nucleic acid molecule of Claim 1 which is plasmid 3203 (or its 
modified HTV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

77. The nucleic acid molecule of Claim 1 which is plasmid 3300 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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78. The nucleic acid molecule of Claim 1 which is plasmid 3301 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

79. The nucleic acid molecule of Claim 1 which is plasmid 3400 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

80. The nucleic acid molecule of Claim 1 which is plasmid 3401 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 81. The nucleic acid molecule of Claim 1 which is plasmid 3500 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

82. The nucleic acid molecule of Claim 1 which is plasmid 3600 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

83. The nucleic acid molecule of Claim 1 which is plasmid 3700 (or its 

» 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

84. The nucleic acid molecule of Claim 1 which is plasmid 3800 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

85. The nucleic acid molecule of Claim 1 which is plasmid 5200 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 86. The nucleic acid molecule of Claim 1 which is plasmid 5201 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

87. The nucleic acid molecule of Claim 1 which is plasmid 5202 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

88. The nucleic acid molecule of Claim 1 which is plasmid 5203 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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89. The nucleic acid molecule of Claim 1 which is plasmid 5300 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

90. The nucleic acid molecule of Claim 1 which is plasmid 5301 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

91. The nucleic acid molecule of Claim 1 which is plasmid 5303 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 92. The nucleic acid molecule of Claim 1 which is plasmid 5304 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

93. The nucleic acid molecule of Claim 1 which is plasmid 5305 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

94. The nucleic acid molecule of Claim 1 which is plasmid 5306 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

95. . The nucleic acid molecule of Claim 1 which is plasmid 5307 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

96. The nucleic acid molecule of Claim 1 which is plasmid 5308 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 97. The nucleic acid molecule of Claim 1 which is plasmid 5309 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

98. The nucleic acid molecule of Claim 1 which is plasmid 5350 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

99. The nucleic acid molecule of Claim I which is plasmid 5351 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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100. The nucleic acid molecule of Claim 1 which is plasmid 5352 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

101. The nucleic acid molecule of Claim 1 which is plasmid 5353 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

102. The nucleic acid molecule of Claim 1 which is plasmid 5354 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 103. The nucleic acid molecule of Claim 1 which is plasmid 5355 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

104. The nucleic acid molecule of Claim 1 which is plasmid 5356 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

105. The nucleic acid molecule of Claim 1 which is plasmid 5357 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

106. The nucleic acid molecule of Claim 1 which is plasmid 5358 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

107. The nucleic acid molecule of Claim 1 which is plasmid 5359 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 108. The nucleic acid molecule of Claim 1 which is plasmid 5360 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

109. The nucleic acid molecule of Claim 1 which is plasmid 5361 (or its 
modified HI V Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

110. The nucleic acid molecule of Claim 1 which is plasmid 5362 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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111. The nucleic acid molecule of Claim 1 which is plasmid 5363 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

1 12. The nucleic acid molecule of Claim 1 which is plasmid 5364 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

113. The nucleic acid molecule of Claim 1 which is plasmid 5365 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 114. The nucleic acid molecule of Claim 1 which is plasmid 5366 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

115. The nucleic acid molecule of Claim 1 which is plasmid 5367 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

116. The nucleic acid molecule of Claim 1 which is plasmid 5368 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

117. The nucleic acid molecule of Claim 1 which is plasmid 5369 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

118. The nucleic acid molecule of Claim 1 which is plasmid 5370 (or its 
modified HTV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 119. The nucleic acid molecule of Claim 1 which is plasmid 5371 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

120. The nucleic acid molecule of Claim 1 which is plasmid 5372 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

121. The nucleic acid molecule of Claim 1 which is plasmid 5373 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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122. The nucleic acid molecule of Claim 1 which is plasmid 5374 (or its 
modified HJV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

123. The nucleic acid molecule of Claim 1 which is plasmid 5375 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

124. The nucleic acid molecule of Claim 1 which is plasmid 5376 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 125. The nucleic acid molecule of Claim 1 which is plasmid 5377 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

126. The nucleic acid molecule of Claim 1 which is plasmid 5378 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

127. The nucleic acid molecule of Claim 1 which is plasmid 5379 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

128. The nucleic acid molecule of Claim 1 which is plasmid 5500 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

129. The nucleic acid molecule of Claim 1 which is plasmid 5501 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 130. The nucleic acid molecule of Claim I which is plasmid 5502 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

131. The nucleic acid molecule of Claim 1 which is plasmid 5503 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

132. The nucleic acid molecule of Claim 1 which is plasmid 5504 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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133. The nucleic acid molecule of Claim 1 which is plasmid 5505 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

134. The nucleic acid molecule of Claim 1 which is plasmid 5506 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

135. The nucleic acid molecule of Claim 1 which is plasmid 5507 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 136. The nucleic acid molecule of Claim 1 which is plasmid 5508 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

137. The nucleic acid molecule of Claim 1 which is plasmid 5509 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

138. The nucleic acid molecule of Claim 1 which is plasmid 5510 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

* 

139. The nucleic acid molecule of Claim 1 which is plasmid 5511 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

140. The nucleic acid molecule of Claim 1 which is plasmid 5512 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 141. The nucleic acid molecule of Claim 1 which is plasmid 5513 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

142. The nucleic acid molecule of Claim 1 which is plasmid 5514 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

143. The nucleic acid molecule of Claim 1 which is plasmid 5515 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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144. The nucleic acid molecule of Claim 1 which is plasmid 5516 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

145. The nucleic acid molecule of Claim 1 which is plasmid 5517 (or its 
5 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

146. The nucleic acid molecule of Claim 1 which is plasmid 5518 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

10 147. The nucleic acid molecule of Claim 1 which is plasmid 5519 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

148. The nucleic acid molecule of Claim 1 which is plasmid 5520 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

15 identity thereto. 

149. The nucleic acid molecule of Claim 1 which is plasmid 5521 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

150. The nucleic acid molecule of Claim 1 which is plasmid 5522 (or its 
20 modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

identity thereto. 

151. The nucleic acid molecule of Claim 1 which is plasmid 5523 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

25 152. The nucleic acid molecule of Claim 1 which is plasmid 5524 (or its 

modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

153. The nucleic acid molecule of Claim 1 which is plasmid 5525 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 

30 identity thereto. 

154. The nucleic acid molecule of Claim 1 which is plasmid 5526 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 
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155. The nucleic acid molecule of Claim 1 which is plasmid 5527 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

156. The nucleic acid molecule of Claim 1 which is plasmid 5528 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

157. The nucleic acid molecule of Claim 1 which is plasmid 5529 (or its 
modified HIV Env insert) or an analog of said plasmid or insert having at least 95% 
identity thereto. 

158. A nucleic acid molecule comprising a polynucleotide encoding in a 
continuous open reading frame a modified HTV Gag-Pol or Gag-Pol-Nef fusion 
protein, encoding a plasmid taken from Table 1 (or its modified HIV Gag, Pol, or 
Nef insert), or encoding an analog of said plasmid or insert having at least 95% 
identity thereto. 

159. The nucleic acid molecule of Claim 158 which is plasmid 3900 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

160. The nucleic acid molecule of Claim 158 which is plasmid 3901 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

161. The nucleic acid molecule of Claim 158 which is plasmid 4000 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

162. The nucleic acid molecule of Claim 158 which is plasmid 4001 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

1 63. The nucleic acid molecule of Claim 1 58 which is plasmid 4100 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

164. The nucleic acid molecule of Claim 158 which is plasmid 4101 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

165. The nucleic acid molecule of Claim 158 which is plasmid 4200 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 
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1 66. The nucleic acid molecule of Claim 1 58 which is plasmid 4300 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

167. The nucleic acid molecule of Claim 158 which is plasmid 4301 (or its 
5 modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 

at least 95% identity thereto. 

168. The nucleic acid molecule of Claim 158 which is plasmid 4302 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

10 169. The nucleic acid molecule of Claim 158 which is plasmid 4303 (or its 

modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

170. The nucleic acid molecule of Claim 158 which is plasmid 4304 (or its 
modi fied HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 

1 5 at least 95% identity thereto. 

171. The nucleic acid molecule of Claim 158 which is plasmid 4305 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

172. The nucleic acid molecule of Claim 158 which is plasmid 4306 (or its 
20 modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 

at least 95% identity thereto. 

► 

173. The nucleic acid molecule of Claim 1 58 which is plasmid 4308 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

25 1 74. The nucleic acid molecule of Claim 1 58 which is plasmid 4309 (or its 

modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

175. The nucleic acid molecule of Claim 158 which is plasmid 4310 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 

30 at least 95% identity thereto. 

176. The nucleic acid molecule of Claim 158 which is plasmid 4311 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 
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177. The nucleic acid molecule of Claim 158 which is plasmid 4312 (or its 
modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 
at least 95% identity thereto. 

178. The nucleic acid molecule of Claim 158 which is plasmid 4313 (or its 
5 modified HIV Gag, Pol, or Nef insert) or an analog of said plasmid or insert having 

at least 95% identity thereto. 

179. A modified HIV Env, Gag, Pol, or Nef encoded by the nucleic acid 
molecule of any of Claims l-l 78. 

180. The nucleic acid molecule of any of Claims 1-178 further comprising 
10 a backbone, wherein said backbone is a plasmid vector. 

181. The nucleic acid molecule of any of Claims 1-178 further comprising 
a backbone, wherein said backbone is an adenoviral vector. 

182. A nucleic acid molecule comprising a polynucleotide encoding a 
modified glycosylation or ACFI HIV Env, or combination thereof, encoding a 

15 modified HIV Env insert taken from a plasmid taken from Table 1, or encoding an 

analog of said insert having at least 95% identity thereto, and further encoding in a 
continuous open reading frame a modified HIV Gag-Pol or Gag-Pol-Nef fusion 
protein, encoding a modified HIV Gag, Pol, or Nef insert taken from a plasmid taken 
from Table 1, or encoding an analog of said insert having at least 95% identity 

20 thereto. 

183. A pharmaceutical composition comprising at least one nucleic acid 
molecule of any of Claims 1-178 and a pharmaceutical^ acceptable solution in a 
therapeutically effective dose 

1 84. A vaccine composition comprising at least one nucleic acid molecule 
25 of any of Claims 1-178 and a pharmaceutical ly acceptable solution in a 

prophy tactically effective dose. 

185. A pharmaceutical composition comprising at least one modified HIV 
Env, Gag, Pol, or Nef of Claim 179 and a pharmaceutical^ acceptable solution in a 
therapeutically effective dose. 

30 186. A vaccine composition comprising at least one modified HIV Env, 

Gag, Pol, or Nef of Claim 179 and a pharmaceutical^ acceptable solution in a 
prophylactically effective dose. 

187. The composition of any of Claims 183-186 additionally comprising 
an adjuvant. 

35 188. The composition of Claim 187 wherein said adjuvant is a cytokine. 

-180- 



WO 02/32943 PCT/US01/2572I 

189. The composition of Claim 1 83 or 1.84 which combines plasmid 4306 
(gag-pol-nef) and plasmid 2827 (env). 

190. The composition of Claim 183 or 184 which combines plasmid 4306 
(gag-pol-nef Clade B), plasmid 2827 (env Clade B), plasmid 4311 (gag-pol-nef 

5 Clade C), and plasmid 5509 (env Clade C). 

191. A method comprising the step of administering by genetic 
immunization a nucleic acid molecule of any of Claims 1-178 to a host to ameliorate 
the symptoms of ADDS or HIV infection. 

192. A method comprising the step of administering by genetic 
10 immunization a nucleic acid molecule of any of Claims 1-178 to a host to generate 

an antibody or CTL response against native HIV Env, Gag, Pol, orNef. 

193. A method comprising the step of administering by protein 
immunization a modified HTV Env, Gag, Pol, or Nef of Claim 179 to a host to 
ameliorate the symptoms of AIDS or HIV infection. 

15 194. A method comprising the step of administering by protein 

immunization a modified HIV Env, Gag, Pol, or Nef of Claim 179 to a host to 
generate an antibody or CTL response against native HIV Env, Gag, Pol, or Nef 

195. The method of any of Claims 191-194 wherein said step is a primary 
immunization. 

20 196. The method of any of Claims 191-194 wherein said step is a boost 

197. The method of Claims 191 or 192 wherein said step is a primary 
immunization with a plasmid vector. 

198. The method of Clai ms 191 or 192 wherein said step is a boost with an 
adenoviral vector. 

25 199. The method of Claims 191 or 192 wherein said step is a primary 

immunization with a plasmid vector expressing gpl45delCFI and further comprising 
a step which is a boost with an adenoviral vector expressing gpl40delCFI. 

200. A nucleic acid molecule of any of Claims 1-178 for use by genetic 
immunization in amelioration of the symptoms of AIDS or HIV infection. 

30 201. A nucleic acid molecule of any of Claims 1-178 for use by genetic 

immunization in generation of an antibody or CTL response against native HIV Env, 
Gag, Pol, or Nef. 

202. A modified HIV Env, Gag, Pol, or Nef of Claim 179 for use by 
protein immunization in amelioration of the symptoms of AIDS or HIV infection. 
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203. A modified HIV Env, Gag, Pol, or Nef of Claim 179 for use by 
protein immunization in generation of an antibody or CTL response against native 
HIV Env, Gag, Pol, or Nef. 

204. A nucleic acid molecule comprising a polynucleotide encoding a 
5 modified ACFI HIV Env or encoding in a continuous open reading frame a modified 

HIV Gag-Pol or Gag-Pol-Nef fusion protein, or a combination thereof, applied to 
any HIV clade or strain. 

205. A nucleic acid molecule comprising a polynucleotide encoding a 
modified ACFI HIV Env or encoding in a continuous open reading frame a modified 

10 HIV Gag-Pol or Gag-Pol-Nef fusion protein, or a combination thereof, applied to 
related proteins of other viruses. 

206. A nucleic acid molecule comprising a polynucleotide encoding a 
modified viral membrane fusion protein with deletions in sufficient domains to 
expose a core protein for optimal antigen presentation, wherein the viral membrane 

15 fusion protein is defined by a central triple stranded coiled coil followed by a 
disulfide-bonded loop that reverses the chain direction and connects to an a helix 
packed antiparallel to the core helices. 

207. A nucleic acid molecule comprising a polynucleotide encoding in a 
continuous open reading frame a modified viral protein, wherein the viral protein is 

20 transcribed in native form by translational frame shifting in a non-equivalent ratio or 
through a separate open reading frame. 
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SEQUENCE LISTING 

<110> THE GOVERNMENT OF THE UNITED STATES OF AMERICA, AS REPRESENTED 
BY THE SECRETARY, DEPARTMENT OF HEALTH AND HUMAN SERVICES 
NABIiE, Gary J. 
CHAKRABARTIr Bimal K. 
HUANG, Yue 



<120> MODIFICATIONS OF HIV Env, Gag, AND Pol 

ENHANCE IMMUNOGENICITY FOR GENETIC IMMUNIZATION 



<130> NIH206.001VPC 

<140> UNKNOWN 
<141> 

<150> US 60/225,097 

<151> 2000-08-14 

<150> US 60/252,115 
<151> 2000-11-14 

<150> US 60/279,257 
<151> 2001-03-28 

<160> 176 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 7547 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 1 

tcgcgcgttt cggtgatgac ggtgaaaacc 
cagcttgtct gtaagcggat gccgggagca 
ttggcgggtg tcggggctgg cttaactatg 
accatafcgcg gtgtgaaata ccgcacagat 
ctattggcca btgcatacgb tgtatccata 
tccaacatta ccgccatgtt gacattgatt 
ggggtcatta gttcatagcc catatatgga 
cccgcctggc tgaccgccca acgacccccg 
cafcagbaacg ccaataggga cfcttccatbg 
tgcccacttg gcagbacatc aagtgbatca 
tgacggtaaa tggcccgcct ggcabbatgc 
ttggcagtac atctacgtab tagbcabcgc 
catcaatggg cgtggatagc ggttbgactc 
cgtcaatggg agtttgfctfct ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agcbcgbtta gtgaaccgtc agatcgcctg 



bcbgacacat gcagctcccg gagacggtca 60 
gacaagcccg tcagggcgcg tcagcgggbg 120 
cggcatcaga gcagabtgba ctgagagtgc 180 
gcgtaaggag aaaataccgc atcagattgg 24 0 
bcataatatg tacatttata ttggctcatg 300 
attgactagt battaatagt aabcaatbac 360 
gttccgcgtt acabaacbba cggfcaaabgg 420 
cccabbgacg bcaataatga cgtatgbtcc 480 
acgtcaabgg gbggagtatb tacggtaaac 540 
tatgccaagb acgcccccta btgacgbcaa 600 
ccagtacatg accbbabggg acttbcctac 660 
tatbaccatg gfcgabgcggb tttggcagba 720 
acggggabbt ccaagtctcc accccabtga 780 
tcaacgggac tfctccaaaat gtcgtaacaa 840 
gcgtgbacgg bgggacgbcb atataagcag 900 
gagacgccab ccacgctgfct ttgaccbcca 960 
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tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagbgacg taagtaccgc ctatagactc babaggcaca cccctbbggc 1080 
tcttatgcafc gctabactgt ttttggcttg gggcctabac acccccgctt ccttatgcta 1140 
taggtgatgg tatagctbag cctabaggtg bgggbtabbg accabbabbg accacbcccc 1200 
babtggtgac gatactttcc abtacbaabc cataacabgg ctctttgcca caactatctc 1260 
babbggcbab atgccaatac tctgbccbbc agagactgac acggactctg babtbbbaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagbbbbt attaaacata gcgtgggatc tccacgcgaa tcbcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct bccacatccg agcccbggbc ccatgccbcc 1500 
agcggcbcat ggtcgctcgg cagcbccbtg cbcctaacag tggaggccag acbtaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbatgbgbcb 1620 
gaaaabgagc gbggagabtg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag ctgagbbgbb gtattcbgab aagagtcaga ggbaacbccc 1740 
gbbgcggtgc bgtbaacggb ggagggcagb gtagtctgag cagtacbcgb tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt bccbbbccab gggtcbbbtc 1860 
tgcagtcacc gtcgtcgaca cgtgbgabca gatatcgcgg ccgcbctaga acaccabgcg 132 0 
cgbgaaggag aagtaccagc accbgbggcg cbggggcbgg cgctggggca ccatgcbgct 1980 
gggcabgcbg abgatcbgca gcgccaccga gaagctgbgg gtgaccgtgb acbacggcgt 2040 
gcccgbgbgg aaggaggcca ccaccacccb gctcbgcgcc agcgacgcca aggccbacga 2100 
caccgaggbg cacaacgbgb gggccaccca cgcctgcgbg cccaccgacc ccaaccccca 2160 
ggaggbggtg cbggtgaacg bgaccgagaa cttcgacabg bggaagaacg acatggbgga 2220 
gcagatgcac gaggacatca bcagccbgbg ggaccagagc ctgaagcccb gcgtgaagcb 2280 
gaccccccbg bgcgtgagcc bgaagbgcac cgacctgaag aacgacacca acaccaacag 2340 
cagcagcggc cgcatgatca tggagaaggg cgagatcaag aactgcagcb bcaacatcag 24 00 
caccagcabc cgcggcaagg tgcagaagga gbacgccbtc btcbacaagc bggacabcat 2460 
ccccabcgac aacgacacca ccagctacag cctgaccagc bgcaacacca gcgbgabcac 2520 
ccaggcctgc cccaaggtga gctbcgagcc catccccaac cactacbgcg cccccgccgg 2580 
cbbcaccabc ctgaagbgca aggacaagaa gttcaacggc aagggccccb gcaccaacgt 2640 
gagcaccgbg cagbgcaccc acggcabccg ccccgbggtg agcacccagc bgcbgcbgaa 2700 
cgaccgccbg gccgaggagg aggbggbgab ccgcagcgcc aacbtcgccg acaacgccaa 2760 
ggbgabcabc gtgcagctga acgagagcgb ggagabcaac tgcacccgcc ccaacaacaa 2 820 
cacccgcaag agcabccaca tcggccccgg ccgcgccbbc tacaccaccg gcgagabcab 2 880 
cggcgacabc cgccaggccc acbgcaaccb gagccgcgcc aagbggaacg acacccbgaa 2940 
caagabcgtg abcaagctgc gcgagcagbt cggcaacaag accabcgbgb bcaagcacag 3000 
cagcggcggc gaccccgaga tcgbgaccca cagctbcaac bgcggcggcg agbtcbtcba 3060 
cbgcaacagc acccagcbgb tcaacagcac cbggbbcaac agcaccbgga gcaccgaggg 3120 
cagcaacaac accgagggca gcgacaccat caccctgccc bgccgcatca agcagabcab 3180 
caacatgbgg cagaaggtgg gcaaggccat gbacgccccc cccabcagcg gccagabccg 324 0 
ctgcagcagc aacatcaccg gccbgcbgct gacccgcgac ggcggcaaca gcaacaacga 33 00 
gagcgagatc tbccgccbgg gcggcggcga catgcgcgac aacbggcgca gcgagcbgba 3360 
caagbacaag gtggtgaaga tcgagccccb gggcgbggcc cccaccaagg ccaagcgccg 3420 
cgtggbgcag cgcgagaagc gcgccgbggg catcggcgcc cbgbbccbgg gcbtccbggg 34 80 
cgccgccggc agcaccabgg gcgccgccag catgacccbg accgbgcagg cccgccagcb 354 0 
gcbgagcggc abcgtgcagc agcagaacaa cctgctgcgc gccabcgagg cccagcagca 3600 
ccbgctgcag cbgaccgtgb ggggcatcaa gcagcbgcag gcccgcabcc bggccgbgga 3660 
gcgctacctg aaggaccagc agcbgcbggg catctggggc bgcagcggca agcbbabcbg 3720 
caccaccgcc gtgccctgga acgccagcbg gagcaacaag agccbggagc agabcbggaa 3780 
ccacaccacc tggabggagb gggaccgcga gatcaacaac bacaccagcc bgabccacag 3840 
ccbgatcgag gagagccaga accagcagga gaagaacgag caggagcbgc bggagcbgga 3900 
caagbgggcc agcabgggcg gcaagtggag caagagcagc gbgatcggcb ggcccgccgt 3960 
gcgcgagcgc abgcgccgcg ccgagcccgc cgccgacggc gbgggcgccg bgagccgcga 4 02 0 
ccbggagaag cacggcgcca bcaccagcag caacaccgcc gccaacaacg ccgccbgcgc 4080 
cbggcbggag gcccaggagg aggaggaggb gggcbbcgca gbgaccgcac aggbggcacb 4140 
gcgcgcaatg accbacaagg ccgccgbgga cctgagccac bbccbgaagg agaagggcgg 4200 
ccbggagggc cbgabccaca gccagcgccg ccaggacabc cbggaccbgb ggabcbacca 4260 
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cacccagggc tacttccccg actggcagaa cfcacaccccc ggccccggcg bgcgcbaccc 4320 
cctgaccttc ggctggtgct acaagctggt gcccgtggag cccgacaagg tggaggaggc 43 80 
caacaagggc gagaacacca gcctgctgca ccccgtgagc ctgcacggca tggcagcacc 44 4 0 
cgagcgcgag gtgctggagt ggcgcttcga cagccgcctg gccbbccacc acgbggcccg 4500 
cgagctgcac cccgagtact tcaagaacbg ctgaggatcc agabcbgcbg tgccttctag 4560 
ttgccagcca tctgttgttt gcccctcccc cgbgccbbcc bbgacccbgg aaggtgccac 4 620 
tcccactgtc cbbbccbaat aaaatgagga aattgcatcg cattgtctga gbaggbgbca 4680 
ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag 4740 
caggcatgct ggggatgcgg bgggcbcbab gggtacccag gtgctgaaga attgacccgg 4 800 
tbccbcctgg gccagaaaga agcaggcaca tccccttctc tgtgacacac cctgtccacg 4860 
cccctggttc bbagtbccag ccccacbcab aggacactca tagctcagga gggctccgcc 4 920 
ttcaatccca cccgctaaag tacttggagc ggtctctccc tccctcatca gcccaccaaa 4980 
ccaaacctag cctccaagag tgggaagaaa ttaaagcaag ataggctatt aagtgcagag 5040 
ggagagaaaa tgcctccaac atgtgaggaa gtaatgagag aaabcabaga attttaaggc 5100 
catcatggcc ttaatcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 5160 
tgcggcgagc ggbatcagcb cacbcaaagg cggtaabacg gttatccaca gaabcagggg 5220 
ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgbaaaaagg 52 80 
ccgcgttgct ggcgtttttc cabaggcbcc gcccccctga cgagcatcac aaaaabcgac 5340 
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 5400 
gaagctccct cgtgcgctct ccbgbbccga ccctgccgct taccggatac ctgtccgcct 5460 
ttctcccttc gggaagcgtg gcgctttctc atagctcacg cbgbaggtab ctcagttcgg 5520 
tgtaggbcgb tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 5580 
gcgccttatc cggtaacbat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 5640 
tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 5700 
tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt atctgcgctc 5760 
tgctgaagcc agttaccttc ggaaaaagag ttggtagctc bbgabccggc aaacaaacca 5820 
ccgctggtag cggbggbbbb bbbgbbbgca agcagcagat tacgcgcaga aaaaaaggat 5880 
ctcaagaaga bccbbbgabc bbbbcbacgg ggtctgacgc bcagbggaac gaaaactcac 5940 
gttaagggat bbbggtcabg agabbabcaa aaaggabcbt cacctagatc cttttaaatt 6000 
aaaaatgaag btbbaaabca atctaaagta bababgagba aacttggtct gacagttacc 6060 
aahgcttaat cagtgaggca cctatctcag cgatctgtct abbbcgbbca tccatagttg 6120 
cctgactcgg gggggggggg cgcbgaggbc tgcctcgtga agaaggtgtt gctgactcat 6180 
accaggcctg aatcgcccca tcatccagcc agaaagtgag ggagccacgg ttgatgagag 624 0 
ctttgttgta ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtcbg 6300 
cgttgtcggg aagatgcgtg atctgatcct tcaactcagc aaaagttcga tttatbcaac 6360 
aaagccgccg bcccgbcaag bcagcgbaab gctcbgccag tgbbacaacc aabbaaccaa 6420 
tbcbgabbag aaaaacbcat cgagcabcaa abgaaactgc aabbbabtca batcaggabb 6480 
atcaabacca babtbbbgaa aaagccgttb cbgbaabgaa ggagaaaacb caccgaggca 6540 
gtbccabagg abggcaagat cctggtatcg gbcbgcgabb ccgacbcgbc caacabcaat 6600 
acaacctabb aabtbcccct cgtcaaaaab aaggbbabca agbgagaaab caccabgagb 6660 
gacgacbgaa bccggtgaga abggcaaaag cbbabgcabb tcbbbccaga cbbgbbcaac 6720 
aggccagcca bbacgcbcgt cabcaaaabc acbcgcabca accaaaccgb babbcabbcg 6780 
tgabtgcgcc bgagcgagac gaaabacgcg abcgcbgbba aaaggacaab bacaaacagg 684 0 
aabcgaatgc aaccggcgca ggaacacbgc cagcgcabca acaababbbb caccbgaabc 6900 
aggababtcb tcbaabacct ggaabgctgt bbtcccgggg abcgcagtgg bgagbaacca 6960 
tgcabcatca ggagbacgga baaaatgcbb gatggbcgga agaggcataa abbccgtcag 7020 
ccagbbbagb cbgaccabct cabctgbaac atcabbggca acgcbaccbb bgccabgbbb 7080 
cagaaacaac bcbggcgcat cgggctbccc atacaabcga bagabbgbcg caccbgabbg 7140 
cccgacabba bcgcgagccc abbbataccc ababaaabca gcabccabgb bggaabbbaa 7200 
bcgcggccbc gagcaagacg bbbcccgtbg aababggcbc abaacacccc bbgtabbacb 7260 
gbtbabgtaa gcagacagtt btattgbbca bgatgababa bbtbbabcbb gbgcaabgba 7320 
acabcagaga bbbbgagaca caacgtggcb bbcccccccc ccccabbabb gaagcabbba 7380 
tcagggbtab bgbcbcabga gcggatacab atbbgaabgb abbbagaaaa abaaacaaab 7440 
aggggbbccg cgcacabbbc cccgaaaagb gccaccbgac gbcbaagaaa ccabbabbab 7500 
catgacabba accbabaaaa abaggcgbab cacgaggccc bbtcgbc 7547 
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<210> 2 
<211> 8009 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 2 

fccgcgcgttb cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgbct gtaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
htggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca btgcatacgt tgtatccaba tcabaatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtb gacabbgatt atbgactagt tattaatagt aatcaattac 360 
ggggtcatta gbtcatagcc catatatgga gttccgcgtt acataactta cggtaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg bcaataabga cgbatgttcc 480 
catagtaacg ccaataggga cbttccatbg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta bbgacgbcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
bbggcagbac atctacgtat tagbcabcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgbggatagc ggtttgactc acggggattt ccaagtcbcc accccabbga 7 80 
cgtcaatggg agtbtgbtbt ggcaccaaaa tcaacgggac ttbccaaaab gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg bgggaggtcb atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt cctbabgcba 114 0 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tcbgbcctbc agagactgac acggactctg tabtbbbaca 1320 
ggatggggtc ccabtbabta bttacaaatb cacatataca acaacgccgt cccccgtgcc 1380 
cgcagbbttb abbaaacaba gcgbgggabc tccacgcgaa bcbcgggbac gtgbtccgga 144 0 
catgggctct bcbccggtag cggcggagct bccacatccg agccctggbc ccatgccbcc 1500 
agcggcbcat ggbcgctcgg cagctccttg ctcctaacag bggaggccag acctaggcac 1560 
agcacaatgc ccaccaccac cagtgbgccg cacaaggccg tggcggtagg gbabgbgtct 162 0 
gaaaatgagc gbggagabbg ggctcgcacg gctgacgcag abggaagact baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgtt gtabtctgat aagagtcaga ggbaacbccc 174 0 
gttgcggtgc tgbbaacggt ggagggcagt gtagtctgag cagtactcgb bgctgccgcg 1800 
cgcgccacca gacabaabag cbgacagact aacagactgb tcctbtccat gggbcbtbtc 1860 
tgcagbcacc gtcgtcgaca cgbgbgabca gabatcgcgg ccgcbcbaga acaccatgcg 1920 
cgbgaaggag aagbaccagc accbgbggcg ctggggcbgg cgcbggggca ccabgctgcb 1980 
gggcatgcbg atgabctgca gcgccaccga gaagcbgbgg gtgaccgfcgt actacggcgb 2040 
gcccgbgtgg aaggaggcca ccaccaccct gctctgcgcc agcgacgcca aggccbacga 2100 
caccgaggtg cacaacgbgt gggccaccca cgcctgcgtg cccaccgacc ccaaccccca 2160 
ggaggtggtg ccggbgaacg tgaccgagaa cttcgacatg tggaagaacg acatggbgga 2220 
gcagatgcac gaggacatca bcagccbgbg ggaccagagc cbgaagcccb gcgtgaagct 2280 
gacccccctg bgcgtgagcc bgaagtgcac cgaccbgaag aacgacacca acaccaacag 2340 
cagcagcggc cgcabgatca bggagaaggg cgagatcaag aactgcagct bcaacabcag 2400 
caccagcabc cgcggcaagg bgcagaagga gbacgccbbc tbctacaagc tggacabcat 2460 
ccccabcgac aacgacacca ccagctacag ccbgaccagc bgcaacacca gcgtgatcac 2520 
ccaggcctgc cccaaggbga gctbcgagcc cabccccaac cacbacbgcg cccccgccgg 2580 
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ctbcaccatc cbgaagbgca aggacaagaa 
gagcaccgtg cagtgcaccc acggcatccg 
cgaccgcctg gccgaggagg aggtggtgat 
ggtgatcatc gtgcagctga acgagagcgt 
cacccgcaag agcatccaca tcggccccgg 
cggcgacatc cgccaggccc acbgcaaccb 
caagatcgtg atcaagctgc gcgagcagtt 
cagcggcggc gaccccgaga tcgtgaccca 
ctgcaacagc acccagctgt tcaacagcac 
cagcaacaac accgagggca gcgacaccat 
caacatgtgg cagaaggtgg gcaaggccat 
ctgcagcagc aacatcaccg gccbgcbgcb 
gagcgagatc ttccgcctgg gcggcggcga 
caagtacaag gtggtgaaga tcgagcccct 
cgtggtgcag cgcgagaagc gcgccgtggg 
cgccgccggc agcaccatgg gcgccgccag 
gctgagcggc atcgtgcagc agcagaacaa 
cctgctgcag ctgaccgtgt ggggcatcaa 
gcgctacctg aaggaccagc agcbgctggg 
caccaccgcc gtgccctgga acgccagctg 
ccacaccacc tggatggagt gggaccgcga 
cctgatcgag gagagccaga accagcagga 
caagtgggcc agcctgtgga acbggbtcaa 
gttcatcatg abcgbgggcg gccbggtggg 
cgtgaaccgc gtgcgccagg gctacagccc 
ccgcggcccc gaccgccccg agggcatcga 
cacgatccgc ctggtgaacg gcagcctggc 
ccbgbbcagc taccaccgcc tgcgcgacct 
gctgggccgc cgcggctggg aggccctgaa 
ccaggagctg aagaacagcg ccgtgagcct 
gtggagcaag agcagcgtga tcggctggcc 
gcccgccgcc gacggcgtgg gcgccgtgag 
cagcagcaac accgccgcca acaacgccgc 
ggaggtgggc htcgcagtga ccgcacaggt 
cgbggaccbg agccacttcc tgaaggagaa 
gcgccgccag gacatcctgg acchgtggat 
gcagaactac acccccggcc ccggcgtgcg 
gcbggbgccc gtggagcccg acaaggtgga 
gctgcacccc gtgagcctgc acggcatggc 
cttcgacagc cgcctggcct tccaccacgt 
gaactgctga ggabccggab ccagatctgc 
tbgccccbcc cccgtgcctt ccfctgaccct 
ataaaatgag gaaattgcat cgcabbgbcb 
ggtggggcag gacagcaagg gggaggattg 
ggtgggctct abgggbaccc aggtgctgaa 
gaagcaggca catccccttc tctgtgacac 
agccccactc ataggacact catagctcag 
agtacttgga gcggtctctc cctccctcat 
agtgggaaga aattaaagca agabaggcba 
acatgtgagg aagtaatgag agaaatcata 
ccgcttcctc gctcactgac tcgctgcgct 
ctcactcaaa ggcggtaata cggbbabcca 
bgbgagcaaa aggccagcaa aaggccagga 
tccataggct ccgcccccct gacgagcatc 
gaaacccgac aggactataa agataccagg 



gttcaacggc aagggcccct gcaccaacgt 264 0 
ccccgtggtg agcacccagc tgctgctgaa 2700 
ccgcagcgcc .aacttcgccg acaacgccaa 2760 
ggagatcaac tgcacccgcc ccaacaacaa 2820 
ccgcgccttc tacaccaccg gcgagatcat 2880 
gagccgcgcc aaghggaacg acacccbgaa 2940 
cggcaacaag accatcgtgt tcaagcacag 3000 
cagcttcaac tgcggcggcg agbbcbtcba 3060 
ctggtbcaac agcacctgga gcaccgaggg 312 0 
cacccbgccc tgccgcatca agcagatcat 3180 
gtacgccccc cccabcagcg gccagatccg 3240 
gacccgcgac ggcggcaaca gcaacaacga 3300 
catgcgcgac aactggcgca gcgagctgta 33 60 
gggcgfcggcc cccaccaagg ccaagcgccg 342 0 
catcggcgcc ctgttcctgg gcttcctggg 3480 
catgaccctg accgtgcagg cccgccagcb 354 0 
cctgctgcgc gccatcgagg cccagcagca 3 60 0 
gcagctgcag gcccgcatcc tggccgtgga 3 660 
catctggggc tgcagcggca agcttatctg 3720 
gagcaacaag agcctggagc agatctggaa 3780 
gatcaacaac tacaccagcc tgatccacag 3840 
gaagaacgag caggagctgc tggagctgga 3900 
catcaccaac tggcbgbggb acatcaagct 3960 
cctgcgcatc gbgbbcgccg tgctgagcat 4020 
cctgagcttc cagacccacc tgcccatccc 4080 
ggaggagggc ggcgagcgcg accgcgaccg 414 0 
cctgatctgg gacgacctgc gcagcctgtg 4200 
gcbgcbgabc gtgacccgca bcgbggagcb 4260 
gtactggtgg aaccbgctgc agbacbggag 4320 
gctgaacgcc accgccatca tgggcggcaa 43 8 0 
cgccgtgcgc gagcgcatgc gccgcgccga 4 44 0 
ccgcgacctg gagaagcacg gcgccatcac 4500 
ctgcgcctgg ctggaggccc aggaggagga 4560 
ggcacbgcgc gcaabgaccb acaaggccgc 4620 
gggcggcccg gagggcctga tccacagcca 4680 
ctaccacacc cagggctact tccccgactg 4740 
ctaccccctg acchtcggct ggtgctacaa 4800 
ggaggccaac aagggcgaga acaccagcct 4860 
agcacccgag cgcgaggtgc tggagtggcg 4 920 
ggcccgcgag ctgcaccccg agtacttcaa 4980 
tgtgccttct agttgccagc cafccbgttgt 5040 
ggaaggtgcc actcccactg tcctttccta 5100 
gagtaggtgt cattctattc tggggggtgg 5160 
ggaagacaat agcaggcatg ctggggatgc 5220 
gaattgaccc ggttcctcct gggccagaaa 52 80 
accctgtcca cgccccbggt bcbbagtbcc 5340 
gagggctccg ccbbcaabcc cacccgcbaa 5400 
cagcccacca aaccaaaccb agccbccaag 5460 
bbaagbgcag agggagagaa aabgcctcca 5520 
gaatbbbaag gccabcabgg ccbbaabcbt 5580 
cggtcgbbcg gctgcggcga gcggbabcag 5640 
cagaabcagg ggabaacgca ggaaagaaca 5700 
accgbaaaaa ggccgcgbtg cbggcgtttt 5760 
acaaaaabcg acgcbcaagb cagaggtggc 582 0 
cgbbbccccc bggaagcbcc cbcgbgcgct 5880 
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ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 5940 
bggcgctbtc bcabagctca cgctgtaggt atctcagttc ggbgbaggtc gttcgctcca 6000 
agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaacb 6060 
abcgbctbga gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta 6120 
acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcbbgaag tggtggccta 6180 
actacggcta cactagaaga acagbabbbg gtatctgcgc tctgctgaag ccagttacct 6240 
tcggaaaaag agtbggbagc tcttgatccg gcaaacaaac caccgctggt agcggbggbc 6300 
tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga 6360 
bcbbtbcbac ggggbcbgac gctcagtgga acgaaaacbc acgttaaggg abbbbggbca 6420 
bgagabbabc aaaaaggabc ttcacctaga tccbbbbaaa bbaaaaabga agttttaaat 6480 
caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgcbba atcagtgagg 6540 
cacctatctc agcgatctgt cbabbbcgbb catccabagb bgcctgacbc gggggggggg 6600 
ggcgcbgagg bcbgccbcgb gaagaaggtg tbgcbgactc abaccaggcc bgaabcgccc 6660 
cabcatccag ccagaaagtg agggagccac ggttgabgag agcbbbgbtg baggtggacc 6720 
agtbggbgab bttgaacbtt tgcbtbgcca cggaacggtc bgcgtbgbcg ggaagatgcg 6780 
bgabcbgatc cttcaacbca gcaaaagbtc gabtbatbca acaaagccgc cgbcccgbca 6840 
agtcagcgta atgcbcbgcc agbgtbacaa ccaabbaacc aabbctgabb agaaaaactc 6900 
abcgagcabc aaabgaaact gcaatbbabt catatcagga tbatcaabac catatbttbg 6960 
aaaaagccgb bbctgbaatg aaggagaaaa cbcaccgagg cagbbccaba ggabggcaag 7020 
atccbggtab cggbcbgcga bbccgacbcg bccaacabca abacaaccba btaabtbccc 7080 
ctcgbcaaaa ataaggttab caagtgagaa atcaccatga gbgacgactg aatccggtga 7140 
gaatggcaaa agcbbatgca tbbcbbbcca gacttgbbca acaggccagc cabbacgcbc 7200 
gbcatcaaaa bcactcgcat caaccaaacc gtbabbcatt cgbgabbgcg ccbgagcgag 72 60 
acgaaabacg cgatcgcbgb baaaaggaca atbacaaaca ggaabcgaat gcaaccggcg 7320 
caggaacacb gccagcgcab caacaababb bbcaccbgaa bcaggababb cbbcbaabac 73 80 
cbggaabgcb gbbbtcccgg ggabcgcagb ggbgagbaac cabgcabcab caggagbacg 7440 
gataaaabgc bbgabggbcg gaagaggcat aaabtccgbc agccagbbba gbctgaccab 7500 
cbcabctgta acabcabbgg caacgcbacc tbtgccabgt ttcagaaaca acbcbggcgc 7560 
abcgggctbc ccatacaabc gabagabbgb cgcaccbgab bgcccgacat babcgcgagc 7620 
ccabbbabac ccababaaat cagcabccab gbbggaatbb aabcgcggcc bcgagcaaga 7680 
cgbtbcccgb bgaababggc bcabaacacc cctbgbatba cbgbbbatgb aagcagacag 7740 
bbtbatbgtb catgatgaba babbbttabc bbgbgcaabg baacabcaga gabbbbgaga 7800 
cacaacgbgg cbbbcccccc ccccccabba bbgaagcabb tabcagggbb abbgbctcab 7860 
gagcggabac ababbbgaab gbatbbagaa aaabaaacaa abaggggbbc cgcgcacabb 7920 
bccccgaaaa gbgccaccbg acgbctaaga aaccabbabb abcatgacat taaccbataa 7980 
aaabaggcgb abcacgaggc ccbtbcgbc 80Q9 



<210> 3 
<211> 7558 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVRl012x/s containing HIV genes 

<400> 3 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggtca 60 

cagcbbgbcb gtaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 

bbggcgggbg bcggggctgg cbbaacbabg cggcabcaga gcagabbgba ctgagagbgc 180 

accatabgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 

cbabbggcca bbgcabacgb bgbabccaba bcabaabatg bacabbbaba bbggcbcabg 300 

bccaacatba ccgccabgbb gacatbgabt abbgactagb babbaabagb aabcaabbac 360 

ggggbcatba gtbcabagcc cabababgga gttccgcgbb acabaactba cggbaaabgg 420 
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cccgcctggc bgaccgccca acgacccccg cccattgacg tcaataatga cgtabgtbcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt bacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tabgccaagt acgcccccta ttgacgtcaa 600 
bgacggbaaa bggcccgccb ggcabtatgc ccagtacatg accttatggg actttcctac 660 
ttggcagbac atctacgtat tagtcatcgc babbaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg cgtggatagc ggtttgactc acggggattt ccaagbcbcc accccatbga 780 
cgtcaatggg v agbbbgbbbb ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg bgggaggbcb ababaagcag 900 
agctcgttba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgtt tbgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggbgcabtg gaacgcggab 1020 
tccccgbgcc aagagtgacg taagtaccgc cbabagacbc babaggcaca cccctbbggc 1080 
bcbbabgcab gcbatactgb tbbbggcbbg gggccbatac acccccgcbb ccfctabgcba 1140 
baggbgabgg babagcttag cctabaggbg bgggtbatbg accabbatbg accacbcccc 1200 
tabbggbgac gabacbbbcc abbacbaabc cataacabgg cbcbbbgcca caactatcbc 1260 
tabbggcbab abgccaabac bcbgbccbbc agagacbgac acggacbcbg batbtbtaca 1320 
ggabggggbc ccabbtabba btbacaaatt cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbtbtt attaaacata gcgbgggatc bccacgcgaa tcbcgggbac gbgbtccgga 1440 
catgggcbcb tctccggbag cggcggagcb bccacatccg agcccbggbc ccatgccbcc 1500 
agcggctcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acttaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag ctgagbbgtb gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbtgcggtgc bgbtaacggb ggagggcagb gbagbcbgag cagbacbcgb tgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagacbgb tccbbbccab gggbcbbbtc 1860 
bgcagbcacc gbcgbcgaca cgbgbgabca gababcgcgg ccgcbctaga caccabgcgc 1920 
gbgaaggaga agbaccagca ccbgbggcgc bggggcbggc gcbggggcac cabgcbgcbg 1980 
ggcabgcbga bgabcbgcag cgccaccgag aagcbgbggg bgaccgbgba cbacggcgbg 204 0 
cccgtgtgga aggaggccac caccaccctg cbcbgcgcca gcgacgccaa ggccbacgac 2100 
accgaggbgc acaacgbgbg ggccacccac gccbgcgbgc ccaccgaccc caacccccag 2160 
gaggtggbgc bggtgaacgb gaccgagaac tbcgacabgb ggaagaacga cabggbggag 2220 
cagatgcacg aggacabcab cagccbgbgg gaccagagcc bgaagcccbg cgbgaagcbg 2280 
accccccbgb gcgbgagccb gaagbgcacc gaccbgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcab ggagaagggc gagabcaaga acbgcagcbt caacabcagc 24 00 
accagcabcc gcggcaaggb gcagaaggag bacgccbbcb bcbacaagct ggacatcabc 24 60 
cccabcgaca acgacaccac cagcbacagc cbgaccagcb gcaacaccag cgbgatcacc 2520 
caggccbgcc ccaaggbgag cbbcgagccc abccccaacc acbacbgcgc ccccgccggc 2580 
bbcaccabcc bgaagbgcaa caacaagacc bbcaacggca ccggccccbg caccaacgbg 2640 
agcaccgbgc agbgcaccca cggcatccgc cccgbggbga gcacccagcb gcbgctgaac 2700 
ggcagccbgg ccgaggagga ggbggbgabc cgcagcgbga acbbcaccga caacgccaag 2760 
accabcabcg bgcagctgaa caccagcgtg gagabcaacb gcacccgccc caacaacaac 2820 
acccgcaagc gaatbcgcat ccagcgcggc cccggccgcg cctbcgbgac cabcggcaag 2880 
abcggcaaca bgcgccaggc ccacbgcaac atcagccgcg ccaagbggaa caacacccbg 2940 
aagcagabcg acagcaagct gcgcgagcag bbcggcaaca acaagaccab cabcbbcaag 3000 
cagagcagcg gcggcgaccc cgagabcgbg acccacagcb bcaacbgcgg cggcgagbbc 3060 
bbcbacbgca acagcaccca gcbgtbcaac agcaccbggb bcaacagcac cbggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accabcaccc bgcccbgccg catcaagcag 3180 
abcabcaaca bgbggcagaa ggbgggcaag gccabgbacg ccccccccab cagcggccag 3240 
abccgcbgca gcagcaacab caccggccbg cbgcbgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agabcbbccg ccbgggcggc ggcgacabgc gcgacaacbg gcgcagcgag 3360 
cbgbacaagb acaaggbggb gaagabcgag ccccbgggcg bggcccccac caaggccaag 3420 
cgccgcgtgg bgcagcgcga gaagcgcgcc gbgggcabcg gcgcccbgbb ccbgggcbbc 34B0 
ctgggcgccg ccggcagcac cabgggcgcc gccagcabga cccbgaccgb gcaggcccgc 3540 
cagcbgctga gcggcabcgb gcagcagcag aacaaccbgc tgcgcgccab cgaggcccag 3600 
cagcaccbgc bgcagcbgac cgbgbggggc abcaagcagc tgcaggcccg cabccbggcc 3660 
gtggagcgcb accbgaagga ccagcagcbg cbgggcabcb ggggcbgcag cggcaagcbb 3720 
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abcbgcacca ccgccgbgcc ctggaacgcc agctggagca acaagagcct ggagcagatc 3780 
tggaaccaca ccacctggat ggagbgggac cgcgagatca acaactacac cagcctgatc 3 84 0 
cacagcctga tcgaggagag ccagaaccag caggagaaga acgagcagga gctgctggag 3900 
ctggacaagt gggccagcab gggcggcaag tggagcaaga gcagcgtgab cggctggccc 3960 
gccgtgcgcg agcgcatgcg ccgcgccgag cccgccgccg acggcgtggg cgccgtgagc 4020 
cgcgacctgg agaagcacgg cgccatcacc agcagcaaca ccgccgccaa caacgccgcc 4080 
tgcgcctggc bggaggccca ggaggaggag gaggtgggct tcgcagtgac cgcacaggtg 4140 
gcactgcgcg caatgaccta caaggccgcc gtggacctga gccacttcct gaaggagaag 4200 
ggcggcctgg agggcctgat ccacagccag cgccgccagg acatcctgga cctgtggatc 4260 
taccacaccc agggctactt ccccgactgg cagaactaca cccccggccc cggcgtgcgc 4320 
taccccctga ccttcggctg gtgctacaag ctggtgcccg tggagcccga caaggtggag 4380 
gaggccaaca agggcgagaa caccagcctg ctgcaccccg tgagcctgca cggcatggca 4440 
gcacccgagc gcgaggtgct ggagtggcgc ttcgacagcc gcctggcctt ccaccacgtg 4500 
gcccgcgagc tgcaccccga gtacttcaag aactgctgag gatccggatc cagatctgct 4560 
gtgccttcta gttgccagcc abcbgtbgbb tgcccctccc ccgtgccttc cbbgacccbg 4620 
gaaggtgcca ctcccactgt ccttbcctaa taaaatgagg aaattgcatc gcattgtctg 4680 
agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg 4740 
gaagacaata gcaggcatgc tggggatgcg gtgggctcta tgggtaccca ggbgctgaag 4800 
aattgacccg gtbccbccbg ggccagaaag aagcaggcac atccccttct cfcgtgacaca 4 860 
ccctgtccac gcccctggtt ctbagbbcca gccccactca taggacactc atagctcagg 4920 
agggctccgc cttcaatccc acccgctaaa gtacttggag cggtctctcc ctccctcatc 4980 
agcccaccaa accaaaccta gcctccaaga gtgggaagaa attaaagcaa gataggctat 5040 
taagtgcaga gggagagaaa atgcctccaa catgtgagga agtaatgaga gaaatcatag 5100 
aabbbbaagg ccatcatggc cbbaabctbc cgcttcctcg cbcacbgact cgctgcgctc 5160 
ggtcgttcgg ctgcggcgag cggbabcagc tcactcaaag gcggtaatac ggttatccac 5220 
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5280 
ccgtaaaaag gccgcgttgc tggcghtttt ccataggctc cgcccccctg acgagcatca 5340 
caaaaatcga cgctcaagtc agaggbggcg aaacccgaca ggactataaa gataccaggc 5400 
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5460 
cctgtccgcc bbbcbcccbt cgggaagcgt ggcgctttct catagctcac gctgfcaggta 5520 
tctcagttcg gtgtaggtcg bbcgcbccaa gctgggctgt gtgcacgaac cccccgttca 5580 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 564 0 
cttatcgcca cbggcagcag ccactggtaa caggahtagc agagcgaggt atgtaggcgg 5700 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 5760 
tatctgcgct cbgcbgaagc cagttacctt cggaaaaaga gthggtagct cttgatccgg 5820 
caaacaaacc accgctggba gcggtggbbb ttttgtttgc aagcagcaga ttacgcgcag 5880 
aaaaaaagga tctcaagaag atcctttgat cbbbbctacg gggtctgacg ctcagtggaa 594 0 
cgaaaactca cgbtaaggga fctttggtcat gagattatca aaaaggatct tcaccbagat 6000 
cctbbtaaab baaaaabgaa gbbbbaaabc aabcbaaagb abababgagt aaacbbggbc 6060 
bgacagbbac caabgcbbaa bcagbgaggc accbabcbca gcgabcbgbc babbbcgbbc 612 0 
abccabagbt gcctgacbcg gggggggggg gcgcbgaggb cbgccbcgbg aagaaggbgb 6180 
tgcbgactca baccaggccb gaabcgcccc atcabccagc cagaaagbga gggagccacg 624 0 
gttgatgaga gcbttgbbgb aggbggacca gbbggtgabb bbgaacbbbt gcbtbgccac 6300 
ggaacggbcb gcgbbgbcgg gaagabgcgb gabcbgabcc bbcaacbcag caaaagtbcg 6360 
abtbabbcaa caaagccgcc gbcccgbcaa gtcagcgbaa bgcbcbgcca gbgbbacaac 6420 
caabbaacca abbcbgabba gaaaaactca tcgagcabca aabgaaacbg caabbbabbc 6480 
abatcaggat babcaabacc ababbbbtga aaaagccgbb bcbgbaabga aggagaaaac 654 0 
bcaccgaggc agbtccabag gabggcaaga tccbggbabc ggbcbgcgab bccgacbcgb 6600 
ccaacabcaa bacaaccbat baabbbcccc bcgbcaaaaa baaggbbabc aagtgagaaa 6660 
tcaccabgag bgacgacbga abccggbgag aabggcaaaa gcbbabgcab bbctbbccag 6720 
acbbgbbcaa caggccagcc abbacgcbcg bcabcaaaab cacbcgcabc aaccaaaccg 6780 
tbabbcabbc gbgabbgcgc cbgagcgaga cgaaabacgc gabcgcbgbb aaaaggacaa 6840 
tbacaaacag gaatcgaabg caaccggcgc aggaacacbg ccagcgcabc aacaatabbb 6900 
tcaccbgaab caggababbc btcbaabacc tggaabgcbg bbbbcccggg gabcgcagbg 6960 
gbgagbaacc abgcabcabc aggagbacgg abaaaabgcb bgabggbcgg aagaggcaba 7020 
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aatbccgbca gccagtttag tctgaccatc 
bbgccabgbb tcagaaacaa ctctggcgca 
gcacctgatt gcccgacatt atcgcgagcc 
ttggaattta atcgcggccb cgagcaagac 
cbbgbabtac bgbttatgba agcagacagt 
tgtgcaabgb aacatcagag attttgagac 
tgaagcattt atcagggtta ttgtctcatg 
aataaacaaa taggggttcc gcgcacattt 
accabbabba bcabgacabt aacctataaa 



tcatctgtaa catcattggc aacgctacct 7080 
tcgggcttcc catacaatcg abagattgtc 7140 
cabttabacc catataaatc agcatccahg 7200 
gtttcccgtt gaatabggcb cataacaccc 7260 
ttbatbgbbc atgatgatat abbbtbabct 7320 
acaacgtggc tttccccccc cccccattat 7380 
agcggataca tatttgaatg tatttagaaa 7440 
ccccgaaaag tgccacctga cgtctaagaa 7500 
aataggcgta tcacgaggcc cbttcgtc 755B 



<210> 4 
<211> 7351 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 4 

tcgcgcgbbt cggtgatgac ggbgaaaacc tcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgtcb gbaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggbg bcggggctgg cbtaacbatg cggcabcaga gcagattgba ctgagagbgc 180 
accatatgcg gtgtgaaaba ccgcacagab gcgbaaggag aaaabaccgc atcagabtgg 240 
cbabtggcca ttgcabacgt tgbabccata bcataababg tacatbbata tbggcbcabg 300 
tccaacabba ccgccatgbt gacabbgabt abbgactagb babbaatagt aabcaabtac 360 
ggggbcabba gttcabagcc catababgga gbbccgcgtb acabaactba cggtaaabgg 420 
cccgcctggc bgaccgccca acgacccccg cccatbgacg bcaataatga cgbabgbbcc 480 
catagbaacg ccaataggga cttbccabbg acgbcaatgg gbggagtabb bacggbaaac 540 
tgcccacbbg gcagtacabc aagbgbabca babgccaagt acgcccccta ttgacgbcaa 600 
tgacggbaaa bggcccgcct ggcatbabgc ccagtacatg acctbabggg acbtbccbac 660 
tbggcagbac abctacgbab tagbcabcgc tabbaccabg gbgabgcggb tttggcagta 720 
cabcaabggg cgbggatagc ggtbtgacbc acggggabbt ccaagbcbcc accccabbga 780 
cgbcaabggg agbbtgbbtb ggcaccaaaa bcaacgggac tbbccaaaab gtcgtaacaa 840 
cbccgcccca bbgacgcaaa tgggcggtag gcgtgbacgg bgggaggtcb ababaagcag 900 
agcbcgtbba gtgaaccgtc agatcgccbg gagacgccat ccacgcbgbb bbgaccbcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcabbg gaacgcggab 1020 
tccccgbgcc aagagbgacg taagbaccgc ctabagacbc babaggcaca cccctbbggc 1080 
tctbatgcab gcbatactgb tttbggctbg gggccbabac acccccgcbb cctbatgcta 1140 
taggbgabgg babagcbbag cctabaggbg tgggttabbg accatbabbg accactcccc 1200 
tabbggbgac gatacbbbcc abtacbaatc cataacabgg cbctbbgcca caacbatcbc 1260 
tabbggcbab abgccaabac tctgtccbbc agagacbgac acggacbcbg batttbtaca 1320 
ggabggggbc ccabtbabba tbtacaaatt cacatabaca acaacgccgt cccccgbgcc 1380 
cgcagbbbbb abbaaacaba gcgbgggatc bccacgcgaa tcbcgggbac gbgtbccgga 1440 
cabgggcbct bcbccggbag cggcggagct bccacatccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggtcgcbcgg cagcbccbbg ctcctaacag tggaggccag acbtaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg tggcggbagg gtabgbgbcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbabtctgat aagagbcaga ggbaactccc 174 0 
gbbgcggbgc bgbbaacggt ggagggcagb gtagtctgag cagtacbcgb tgctgccgcg 1800 
cgcgccacca gacataabag ctgacagact aacagactgt tccttbccab gggtcbbbbc 1860 
tgcagbcacc gtcgtcgaca cgtgbgatca gabatcgcgg ccgcbcbaga caccabgcgc 1920 
gtgaaggaga agbaccagca cctgbggcgc bggggctggc gcbggggcac cabgcbgcbg 1980 
ggcatgctga bgabcbgcag cgccaccgag aagctgbggg bgaccgbgba ctacggcgbg 2040 
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cccgtgtgga aggaggccac caccaccctg 
accgaggtgc acaacgtgtg ggccacccac 
gaggtggtgc tggtgaacgt gaccgagaac 
cagatgcacg aggacatcat cagcctgtgg 
accccccbgt gcgtgagccb gaagtgcacc 
agcagcggcc gcatgatcat ggagaagggc 
accagcatcc gcggcaaggt gcagaaggag 
cccatcgaca acgacaccac cagctacagc 
caggcctgcc ccaaggtgag cbtcgagccc 
ttcaccatcc tgaagtgcaa caacaagacc 
agcaccgbgc agtgcaccca cggcatccgc 
ggcagcctgg ccgaggagga ggtggtgatc 
accatcatcg tgcagctgaa caccagcgtg 
acccgcaagc gaattcgcat ccagcgcggc 
atcggcaaca tgcgccaggc ccactgcaac 
aagcagabcg acagcaagct gcgcgagcag 
cagagcagcg gcggcgaccc cgagatcgtg 
ttctactgca acagcaccca gctgttcaac 
gagggcagca acaacaccga gggcagcgac 
abcabcaaca tgtggcagaa ggtgggcaag 
atccgctgca gcagcaacat caccggcctg 
aacgagagcg agatcttccg cctgggcggc 
ctgtacaagt acaaggtggt gaagabcgag 
cgccgcgtgg tgcagcgcga gaagcgcgcc 
ctgggcgccg ccggcagcac catgggcgcc 
cagctgctga gcggcatcgt gcagcagcag 
cagcacctgc bgcagcbgac cgtgtggggc 
gbggagcgcb acctgaagga ccagcagctg 
ggbggcaagb ggtcaaaaag bagbgbgabb 
cgagcbgagc cagcagcaga bggggbggga 
gcaatcacaa gtagcaatac agcagctaac 
gaggaggaag aggbgggbbb bccagbcaca 
aaggcagctg tagatctbag ccacbbbbba 
cactcccaaa gaagacaaga babccbbgab 
ccbgabbggc agaactacac accagggcca 
tgctacaagc tagtaccagt tgagccagat 
accagcttgt tacaccctgt gagccbgcab 
gagtggaggt ttgacagccg cctagcatbb 
tacbbcaaga actgctgagg atccagatct 
gtttgcccct cccccgtgcc bbccbbgacc 
taataaaatg aggaaattgc atcgcattgt 
ggggtggggc aggacagcaa gggggaggat 
gcggbgggcb ctatgggtac ccaggtgctg 
aagaagcagg cacatcccct tctctgtgac 
ccagccccac tcabaggaca ctcatagctc 
aaagtacttg gagcggtctc tccctccctc 
agagtgggaa gaaabbaaag caagataggc 
caacatgtga ggaagtaatg agagaaatca 
ttccgcttcc tcgctcactg actcgctgcg 
agctcactca aaggcggbaa tacggttatc 
catgtgagca aaaggccagc aaaaggccag 
tttccatagg cbccgccccc ctgacgagca 
gcgaaacccg acaggactat aaagatacca 
cbcbccbgbb ccgaccctgc cgcttaccgg 
cgtggcgctt tctcatagct cacgcbgbag 



cbcbgcgcca gcgacgccaa ggcctacgac 2100 
gcctgcgtgc ccaccgaccc caacccccag 2160 
bbcgacabgb ggaagaacga catggtggag 2220 
gaccagagcc tgaagccctg cgtgaagctg 2280 
gacctgaaga acgacaccaa caccaacagc 2340 
gagatcaaga acbgcagcbb caacatcagc 2400 
tacgccttct tctacaagct ggacatcatc 2460 
ctgaccagct gcaacaccag cgtgatcacc 2520 
abccccaacc actacbgcgc ccccgccggc 2580 
bbcaacggca ccggccccbg caccaacgbg 264 0 
cccgtggbga gcacccagct gctgctgaac 2700 
cgcagcgtga actbcaccga caacgccaag 2760 
gagabcaacb gcacccgccc caacaacaac 2820 
cccggccgcg cctbcgtgac catcggcaag 2880 
abcagccgcg ccaagbggaa caacacccbg 2940 
ttcggcaaca acaagaccat cabcttcaag 3000 
acccacagcb bcaacbgcgg cggcgagbbc 3060 
agcaccbggb bcaacagcac cbggagcacc 3120 
accabcaccc bgccctgccg catcaagcag 3180 
gccabgbacg ccccccccat cagcggccag 3240 
ctgcbgaccc gcgacggcgg caacagcaac 3300 
ggcgacatgc gcgacaacbg gcgcagcgag 3360 
ccccbgggcg tggcccccac caaggccaag 3420 
gbgggcabcg gcgccctgbt ccbgggcbbc 34 80 
gccagcabga cccbgaccgt gcaggcccgc 3540 
aacaaccbgc bgcgcgccat cgaggcccag 3600 
abcaagcagc bgcaggcccg cabcctggcc 3660 
cbgggcabcb ggggcbgcag cggcaagcbt 3720 
ggabggcctg cbgtaaggga aagaatgaga 3780 
gcagbabctc gagaccbaga aaaacabgga 3840 
aabgcbgctb gbgccbggct agaagcacaa 3900 
ccbcaggbac cbttaagacc aabgacbbac 3960 
aaagaaaagg ggggacbgga agggctaabb 4020 
cbgbggabcb accacacaca aggcbacbbc 4080 
ggggbcagab abccacbgac cbbbggabgg 4140 
aaggbagaag aggccaabaa aggagagaac 4200 
ggaabggabg accctgagag agaagtgbba 4260 
cabcacgbgg cccgagagcb gcabccggag 4320 
gcbgbgccbb cbagbbgcca gccabcbgbb 4380 
cbggaaggbg ccacbcccac bgbccbbbcc 4440 
ctgagbaggb gbcabbcbab bcbggggggt 4500 
tgggaagaca abagcaggca bgctggggab 4560 
aagaabbgac ccggbbcctc ctgggccaga 4620 
acacccbgtc cacgccccbg gtbcbbagbb 4680 
aggagggctc cgccbbcaab cccacccgct 4740 
abcagcccac caaaccaaac cbagccbcca 4800 
babbaagbgc agagggagag aaaabgccbc 4860 
bagaabbbba aggccabcab ggccbbaabc 4920 
cbcggbcgtb cggcbgcggc gagcggtabc 4980 
cacagaabca ggggabaacg caggaaagaa 5040 
gaaccgbaaa aaggccgcgb bgcbggcgbb 5100 
bcacaaaaab cgacgcbcaa gbcagaggtg 5160 
ggcgbbbccc ccbggaagcb cccbcgbgcg 5220 
abaccbgbcc gccbbbcbcc cbbcgggaag 52 80 
gbabcbcagb bcggbgbagg bcgbbcgcbc 5340 
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caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 5400 
ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 5460 
taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 5520 
taactacggc tacactagaa gaacagtatt bggtatctgc gctctgctga agccagttac 5580 
cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 5640 
tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt 5700 
gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt 5760 
catgagatta tcaaaaagga tcthcaccta gatcctttta aattaaaaat gaagttttaa 5820 
atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga 5880 
ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac tcgggggggg 5940 
99ggcgctga ggtctgcctc gtgaagaagg tgttgctgac tcataccagg cctgaatcgc 6000 
cccatcatcc agccagaaag tgagggagcc acggttgatg agagctttgt tgtaggtgga 6060 
ccagttggtg attttgaact tttgctttgc cacggaacgg tctgcghtgt cgggaagatg 6120 
cgtgatctga tccttcaact cagcaaaagt tcgatttatt caacaaagcc gccgtcccgt 6180 
caagtcagcg taatgctctg ccagtgttac aaccaattaa ccaattctga ttagaaaaac 6240 
tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat accatatttt 6300 
tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca 6360 
agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc tattaatttc 6420 
ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggt 6480 
gagaatggca aaagcttatg catttctttc cagacttgtt caacaggcca gccattacgc 6540 
tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg cgcctgagcg 6600 
agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga atgcaaccgg 6660 
cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata ttcttctaat 6720 
acctggaatg ctgttttccc ggggatcgca gtggtgagta accabgcatc atcaggagta 6780 
cggataaaat gcttgafcggt cggaagaggc ataaattccg tcagccagtt tagtctgacc 6840 
atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa caactctggc 6900 
gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac attatcgcga 6960 
gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg cctcgagcaa 7020 
gacgtttccc gttgaatatg gctcataaca ccccttgtat tactgtttat gtaagcagac 7080 
agttttattg ttcatgatga fcafcatfcttta tctbgtgcaa tgtaacatca gagattttga 7140 
gacacaacgt ggctttcccc ccccccccat tattgaagca tttatcaggg ttattgtctc 7200 
atgagcggat acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca 72 60 
bttccccgaa aagtgccacc tgacghctaa gaaapcatta ttatcatgac attaacctat 7320 
aaaaataggc gtatcacgag gccctttcgt c 7351 



<210> 5 
<211> 8008 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<4O0> 5 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 3 00 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
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tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta bbgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagbacabg accttatggg actttcctac 660 
bbggcagbac atctacgtat tagbcabcgc babbaccabg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac bbtccaaaab gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgbgbacgg tgggaggtct atataagcag 900 
agcbcgfcbta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcabtg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagbaccgc ctafcagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg babagcttag cctataggtg bgggbbabbg accabbabtg accactcccc 1200 
battggbgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tathggctat atgccaatac tctgtccttc agagactgac acggactctg tatbbbtaca 1320 
ggabggggbc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagabtg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagbtgbt gbabbctgab aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgb tcctttccat gggtcttttc 1860 
tgcagbcacc gbcgbcgaca cgbgbgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac cabgcbgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 22 80 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga acbgcagcbt caacatcagc 24 00 
accagcatcc gcggcaaggb gcagaaggag tacgccttct tctacaagct ggacatcabc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
btcaccatcc bgaagtgcaa caacaagacc tbcaacggca ccggccccbg caccaacgbg 2640 
agcaccgbgc agbgcaccca cggcabccgc cccgbggtga gcacccagct gctgctgaac 2700 
ggcagccbgg ccgaggagga ggtggtgatc cgcagcgtga acbbcaccga caacgccaag 2760 
accatcabcg bgcagctgaa caccagcgtg gagabcaacb gcacccgccc caacaacaac 2820 
acccgcaagc gaatbcgcab ccagcgcggc cccggccgcg ccbbcgtgac catcggcaag 2880 
atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagbggaa caacaccctg 294 0 
aagcagabcg acagcaagcb gcgcgagcag ttcggcaaca acaagaccat catcttcaag 3000 
cagagcagcg gcggcgaccc cgagatcgbg acccacagcb tcaactgcgg cggcgagtbc 3060 
tbctacbgca acagcaccca gctgttcaac agcacctggb bcaacagcac ctggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accabcaccc tgcccbgccg catcaagcag 3180 
atcabcaaca tgbggcagaa ggtgggcaag gccatgbacg ccccccccab cagcggccag 3240 
abccgctgca gcagcaacab caccggccbg cbgcbgaccc gcgacggcgg caacagcaac 33 00 
aacgagagcg agabcbtccg cctgggcggc ggcgacatgc gcgacaacbg gcgcagcgag 3360 
ctgbacaagt acaaggtggb gaagatcgag ccccbgggcg bggcccccac caaggccaag 3420 
cgccgcgtgg tgcagcgcga gaagcgcgcc gtgggcatcg gcgccctgtb cctgggcttc 3480 
ctgggcgccg ccggcagcac catgggcgcc gccagcatga cccbgaccgb gcaggcccgc 3540 
cagctgctga gcggcatcgb gcagcagcag aacaacctgc bgcgcgccab cgaggcccag 3600 
cagcacctgc tgcagctgac cgbgtggggc abcaagcagc bgcaggcccg catccbggcc 3660 
gbggagcgcb acctgaagga ccagcagcbg cbgggcatcb ggggcbgcag cggcaagctt 3720 
abcbgcacca ccgccgtgcc cbggaacgcc agcbggagca acaagagccb ggagcagatc 3780 
tggaaccaca ccacctggab ggagtgggac cgcgagatca acaacbacac cagccbgatc 3840 
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cacagcctga tcgaggagag ccagaaccag caggagaaga acgagcagga gctgctggag 3900 
ctggacaagt gggccagcct gtggaactgg ttcaacatca ccaactggct gtggtacatc 3960 
aagctgttca tcatgatcgt gggcggcctg gtgggcctgc gcatcgtght cgccgtgctg 4020 
agcatcgtga accgcgtgcg ccagggcbac agccccctga gctfcccagac ccacctgccc 4080 
atcccccgcg gccccgaccg ccccgagggc atcgaggagg agggcggcga gcgcgaccgc 4140 
gaccgcacga tccgcctggt gaacggcagc cbggccctga bcbgggacga cctgcgcagc 4200 
ctgtgcctgt tcagctacca ccgcctgcgc gacctgctgc tgatcgtgac ccgcatcgtg 4260 
gagctgctgg gccgccgcgg ctgggaggcc ctgaagtact ggtggaacct gctgcagtac 4320 
tggagccagg agctgaagaa cagcgccgtg agcctgctga acgccaccgc catcatgggc 43 80 
ggcaagtgga gcaagagcag cgbgabcggc tggcccgccg tgcgcgagcg catgcgccgc 44 4 0 
gccgagcccg ccgccgacgg cgtgggcgcc gtgagccgcg acctggagaa gcacggcgcc 45 00 
atcaccagca gcaacaccgc cgccaacaac gccgcctgcg ccbggcbgga ggcccaggag 4560 
gaggaggagg bgggcbbcgc agtgaccgca caggtggcac tgcgcgcaat gacctacaag 462 0 
gccgccgtgg acctgagcca cttcctgaag gagaagggcg gccbggaggg cctgatccac 4 680 
agccagcgcc gccaggacat cctggaccbg bggatcbacc acacccaggg ctacttcccc 4740 
gactggcaga actacacccc cggccccggc gtgcgctacc ccctgacctt cggctggtgc 4800 
bacaagcbgg tgcccgtgga gcccgacaag gtggaggagg ecaacaaggg cgagaacacc 4860 
agcctgcbgc accccgtgag cctgcacggc atggcagcac ccgagcgcga ggtgctggag 4920 
bggcgcbbcg acagccgcct ggccttccac cacgtggccc gcgagctgca ccccgagtac 4980 
btcaagaacb gctgaggatc cagatctgct gbgccbbcba gbtgccagcc atctgttgtt 5040 
tgcccctccc ccgbgcctbc cbbgacccbg gaaggtgcca chcccactgt cctbbccbaa 5100 
taaaatgagg aaattgcatc gcabbgbcbg agtaggtgtc abbcbabtcb ggggggbggg 5160 
9tggggcagg acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg 5220 
gtgggctcta tgggbaccca ggtgctgaag aattgacccg gbbccbcctg ggccagaaag 5280 
aagcaggcac atccccttct cbgbgacaca ccctgtccac gcccctggtt cttagttcca 5340 
gccccactca taggacactc atagctcagg agggctccgc cttcaatccc acccgctaaa 5400 
gtacttggag cggtctctcc ctccctcatc agcccaccaa accaaaccta gcctccaaga 5460 
gtgggaagaa abtaaagcaa gabaggcbat baagbgcaga gggagagaaa atgcctccaa 5520 
catgtgagga agtaabgaga gaaatcatag aatttfcaagg ccatcatggc cttaatcttc 5580 
cgcttcctcg ctcactgact cgcbgcgcbc ggbcgbtcgg cbgcggcgag cggtatcagc 5640 
tcactcaaag gcggtaatac ggbbabccac agaabcaggg gataacgcag gaaagaacab 5700 
gtgagcaaaa ggccagcaaa aggccaggaa ccgbaaaaag gccgcgbbgc tggcgbtbbt 5760 
ccabaggcbc cgccccccbg acgagcatca caaaaabcga cgcbcaagtc agaggtggcg 5820 
aaacccgaca ggactabaaa gabaccaggc gbtbcccccb ggaagcbccc bcgbgcgcbc 5880 
bccbgbtccg accctgccgc btaccggaba cctgtccgcc tbbcbccctb cgggaagcgb 594 0 
ggcgcbbtct cabagcbcac gcbgbaggta tctcagbtcg gbgbaggbcg bbcgctccaa 6000 
gcbgggctgb gtgcacgaac cccccgtbca gcccgaccgc tgcgccbbat ccggtaacba 6060 
bcgtcbbgag bccaacccgg baagacacga cbbatcgcca cbggcagcag ccactggbaa 6120 
caggattagc agagcgaggt atgbaggcgg tgcbacagag tbctbgaagt ggtggccbaa 6180 
cbacggcbac acbagaagaa cagbabbtgg tatcbgcgcb cbgcbgaagc cagbbaccbb 6240 
cggaaaaaga gbbggbagct ctbgabccgg caaacaaacc accgctggta gcggtggbbb 6300 
bbbbgbbtgc aagcagcaga btacgcgcag aaaaaaagga bctcaagaag abccbtbgab 6360 
cbbbbcbacg gggbcbgacg ctcagbggaa cgaaaacbca cgtbaaggga bbttggbcab 6420 
gagatbabca aaaaggabct bcaccbagat ccbbtbaaab taaaaatgaa gbtttaaatc 6480 
aabctaaagt abatabgagb aaacbbggtc tgacagbbac caabgcbbaa bcagtgaggc 6540 
accbatctca gcgabcbgtc batbbcgtbc atccabagtt gccbgacbcg gggggggggg 6600 
gcgctgaggb cbgccbcgbg aagaaggbgt tgctgacbca taccaggccb gaatcgcccc 6660 
atcatccagc cagaaagbga gggagccacg gbbgatgaga gcbbbgbbgb aggbggacca 6720 
gbbggbgabb bbgaacbbtt gcbbbgccac ggaacggbct gcgttgbcgg gaagabgcgb 6780 
gabctgabcc bbcaacbcag caaaagbbcg atbbabtcaa caaagccgcc gbcccgbcaa 6840 
gbcagcgtaa bgctctgcca gtgfcbacaac caattaacca atbcbgabta gaaaaacbca 6900 
tcgagcabca aabgaaacbg caatbbatbc ababcaggab babcaabacc atabbbbbga 6960 
aaaagccgtb bctgbaabga aggagaaaac tcaccgaggc agbfcccabag gabggcaaga 7020 
bccbggbabc ggtcbgcgat bccgacbcgb ccaacabcaa bacaaccbab baabbbcccc 7080 
bcgbcaaaaa baaggbtabc aagbgagaaa bcaccabgag bgacgacbga abccggbgag 7140 



- 13 



WO 02/32943 



PCT/US01/25721 



aatggcaaaa gcttatgcat ttctttccag acbtgbbcaa caggccagcc attacgctcg 7200 
tcatcaaaat cactcgcatc aaccaaaccg bbabbcabbc gtgattgcgc ctgagcgaga 7260 
cgaaatacgc gabcgcbgtt aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc 7320 
aggaacactg ccagcgcatc aacaatattb bcaccbgaab caggatattc btcbaabacc 7380 
fc ggaabgctg bbtbcccggg gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg 7440 
ataaaatgct tgatggtcgg aagaggcata aattccgtca gccagtttag bcbgaccabc 7500 
tcatctgtaa cabcabtggc aacgctaccb btgccabgbt bcagaaacaa ctcbggcgca 7560 
tcgggcbtcc cabacaabcg atagattgtc gcacctgabt gcccgacatb abcgcgagcc 7620 
catbtabacc cababaaatc agcabccatg ttggaabtba atcgcggcct cgagcaagac 7680 
gttbcccgbb gaababggct cabaacaccc cttgbabfcac bgttbabgba agcagacagb 7740 
bbtattgtbc atgabgatab atbtbtatct tgtgcaatgb aacabcagag atbbtgagac 7800 
acaacgbggc btbccccccc cccccattat bgaagcatbt abcagggtba ttgbcbcabg 7860 
agcggabaca babbbgaatg tafcbbagaaa aabaaacaaa baggggbbcc gcgcacatbb 7920 
ccccgaaaag bgccaccbga cgtcbaagaa accabtabba bcabgacabb aaccbabaaa 7980 
aataggcgba bcacgaggcc cbbbcgbc 8008 



<210> 6 i 
<211> 6970 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 6 

bcgcgcgbtb cggbgabgac ggbgaaaacc bcbgacacab gcagctcccg gagacggbca 60 
cagcbtgbcb gbaagcggat gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 12 0 
bbggcgggbg bcggggcbgg cbbaacbatg cggcatcaga gcagabbgba cbgagagbgc 180 
accatabgcg gbgtgaaaba ccgcacagat gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabtggcca bbgcabacgb tgbatccata bcabaababg bacabbbaba btggcbcabg 300 
bccaacatba ccgccabgbt gacabbgabt abtgactagb babbaabagb aabcaabtac 360 
ggggtcabba gbbcabagcc cababatgga gbbccgcgbb acabaactta cggtaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaabaatga cgbatgbbcc 480 
cabagbaacg ccaataggga cbbbccabbg acgbcaabgg gbggagtabb bacggbaaac 54 0 
bgcccacbbg gcagbacabc aagbgbabca babgccaagb acgcccccba bbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabtabgc ccagtacabg accbbatggg acbbtccbac 660 
bbggcagbac atctacgbafc bagbcabcgc babbaccabg gbgabgcggb bbbggcagta 720 
cabcaabggg cgbggabagc ggbtbgacbc acggggabbb ccaagtctcc accccabbga 780 
cgbcaabggg agbbtgbbbt ggcaccaaaa bcaacgggac bbbccaaaab gbcgtaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggtcb ababaagcag 900 
agcbcgbbba gbgaaccgbc agabcgcctg gagacgccab ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgat ccagccbccg cggccgggaa cggbgcatbg gaacgcggat 1020 
bccccgbgcc aagagtgacg taagbaccgc ctabagactc babaggcaca cccctbtggc 1080 
tcbbabgcab gcbatacbgb bbbbggcbbg gggcctabac acccccgcbb ccbbabgcta 1140 
taggbgabgg batagcbbag ccbabaggbg tgggttabbg accabtabtg accacbcccc 1200 
babbggbgac gabacbbbcc abbacbaabc cabaacabgg cbcbbtgcca caactabctc 1260 
babbggcbab abgccaabac bcbgbccbbc agagacbgac acggactctg batbbbbaca 1320 
ggatggggbc ccabtbabba bttacaaabb cacatataca acaacgccgb cccccgbgcc 1380 
cgcagbbbbb abbaaacaba gcgtgggabc bccacgcgaa bcbcgggbac gtgbtccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggbcgctcgg cagcbccbbg ctcctaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagtgbgccg cacaaggccg bggcggtagg gtabgbgbct 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
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gcagaagaag atgcaggcag ctgagfctgtt 
gttgcggtgc bgbtaacggb ggagggcagb 
cgcgccacca gacataatag ctgacagact 
bgcagbcacc gbcgbcgaca cgtgtgatca 
gtgaaggaga agtaccagca cctgtggcgc 
ggcatgctga bgabcbgcag cgccaccgag 
cccgtgtgga aggaggccac caccaccctg 
accgaggtgc acaacgtgtg ggccacccac 
gaggbggtgc bggbgaacgb gaccgagaac 
cagatgcacg aggacabcab cagccbgtgg 
accccccbgb gcgbgagccb gaagbgcacc 
agcagcggcc gcabgabcab ggagaagggc 
accagcatcc gcggcaaggb gcagaaggag 
cccabcgaca acgacaccac cagcbacagc 
caggccbgcc ccaaggbgag cbbcgagccc 
bbcgccatcc bgaagbgcaa caacaagacc 
agcaccgbgc agbgcaccca cggcabccgc 
ggcagccbgg ccgaggagga ggbggtgabc 
accabcabcg bgcagcbgaa caccagcgtg 
acccgcaagc gaatbcgcab ccagcgcggc 
atcggcaaca bgcgccaggc ccacbgcaac 
aagcagabcg acagcaagcb gcgcgagcag 
cagagcagcg gcggcgaccc cgagatcgbg 
bbcbacbgca acagcaccca gcbgbbcaac 
gagggcagca acaacaccga gggcagcgac 
abcabcaaca bgbggcagaa ggbgggcaag 
abccgcbgca gcagcaacab caccggccbg 
aacgagagcg agabcbbccg ccbgggcggc 
cbgbacaagb acaaggbggb gaagatcgag 
cgccgcgbgg bgcagcgcga gaagcgcgcc 
cbgggcgccg ccggcagcac cabgggcgcc 
cagcbgcbga gcggcabcgt gcagcagcag 
cagcaccbgc bgcagcbgac cgbgbggggc 
gbggagcgcb accbgaagga ccagcagcbg 
abcbgcacca ccgccgtgcc cbggaacgcc 
bggaaccaca ccaccbggab ggagbgggac 
cacagccbga tcgaggagag ccagaaccag 
cbggacaagb gggccagcct gbggaacbgg 
bccagabctg cbgbgccbbc bagbbgccag 
bcctbgaccc bggaaggbgc cacbcccacb 
bcgcabbgtc bgagtaggbg bcabbcbabb 
9999 a 99 att gggaagacaa bagcaggcat 
caggbgcbga agaatbgacc cggtbccbcc 
ctctgbgaca cacccbgbcc acgccccbgg 
tcatagcbca ggagggcbcc gccbbcaatc 
cccbcccbca bcagcccacc aaaccaaacc 
aagabaggct abbaagbgca gagggagaga 
gagaaabcab agaabbbbaa ggccabcatg 
ctcgcbgcgc bcggbcgbbc ggcbgcggcg 
acggbbabcc acagaabcag gggabaacgc 
aaaggccagg aaccgbaaaa aggccgcgbb 
bgacgagcab cacaaaaabc gacgcbcaag 
■ aagabaccag gcgbbbcccc cbggaagcbc 
gctbaccgga baccbgbccg ccbbbctccc 
acgcbgbagg batctcagbb cggbgbaggb 



gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbagbcbgag cagbacbcgb bgcbgccgcg 1800 
aacagactgb tcctbbccab gggbcbbbbc 1860 
gababcgcgg ccgcbcbaga caccabgcgc 1920 
tggggcbggc gcbggggcac cabgcbgcbg 1980 
aagcbgbggg bgaccgbgba cbacggcgbg 2040 
tbcbgcgcca gcgacgccaa ggccbacgac 2100 
gccbgcgbgc ccaccgaccc caacccccag 2160 
ttcgacabgt ggaagaacga cabggtggag 2220 
gaccagagcc bgaagcccbg cgbgaagcbg 22 80 
gaccbgaaga acgacaccaa caccaacagc 2340 
gagabcaaga acbgcagcbb caacabcagc 2400 
bacgccbbcb bcbacaagcb ggacabcabc 2460 
ctgaccagcb gcaacaccag cgbgatcacc 2520 
abccccabcc actacbgcgc ccccgccggc 2580 
bbcaacggca ccggccccbg caccaacgtg 2640 
cccgbggbga gcacccagcb gcbgctgaac 2700 
cgcagcgbga acbbcaccga caacgccaag 2760 
gagabcaact gcacccgccc caacaacaac 2820 
cccggccgcg cctbcgbgac cabcggcaag 2880 
abcagccgcg ccaagtggaa caacacccbg 2940 
bbcggcaaca acaagaccab cabcbbcaag 3000 
acccacagcb bcaacbgcgg cggcgagbbc 3060 
agcaccbggb bcaacagcac cbggagcacc 3120 
accabcaccc bgcccbgccg cabcaagcag 3180 
gccabgbacg ccccccccab cagcggccag 3240 
cbgcbgaccc gcgacggcgg caacagcaac 3300 
ggcgacabgc gcgacaacbg gcgcagcgag 33 60 
ccccbgggcg bggcccccac caaggccaag 3420 
gbgggcabcg gcgcccbgbb ccbgggcbbc 34 80 
gccagcabga ccctgaccgb gcaggcccgc 354 0 
aacaaccbgc tgcgcgccab cgaggcccag 3600 
abcaagcagc bgcaggcccg cabccbggcc 3 660 
cbgggcabcb ggggcbgcag cggcaagcbb 3720 
agctggagca acaagagccb ggagcagabc 3780 
cgcgagabca acaacbacac cagccbgabc 3840 
caggagaaga acgagcagga gcbgcbggag 3900 
btcaacabca ccaacbggcb gbggtgagga 3960 
ccabcbgbbg bbbgccccbc ccccgbgccb 4020 
gbccbbbccb aabaaaabga ggaaabbgca 4080 
ctggggggtg gggtggggca ggacagcaag 4140 
gcbggggabg cggbgggcbc babgggbacc 4200 
tgggccagaa agaagcaggc acabcccctb 4260 
bbcbbagbbc cagccccacb cabaggacac 4320 
ccacccgcba aagbacbbgg agcggbcbcb 43 80 
bagccbccaa gagbgggaag aaabbaaagc 4440 
aaabgccbcc aacabgbgag gaagbaabga 4500 
gccbbaabcb bccgctbccb cgcbcacbga 4560 
agcggbabca gcbcacbcaa aggcggtaab 4620 
aggaaagaac abgtgagcaa aaggccagca 4680 
gcbggcgbbb bbccabaggc bccgcccccc 4740 
bcagaggbgg cgaaacccga caggactaba 4800 
ccbcgbgcgc bcbccbgbbc cgaccctgcc 4860 
bbcgggaagc gbggcgcbtb cbcatagcbc 4920 
cgbbcgcbcc aagcbgggcb gbgbgcacga 4980 
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accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 5040 
ggbaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 5100 
gbabgbaggc ggtgctacag agbbctbgaa gbggbggccb aactacggct acactagaag 5160 
aacagtattt ggtatctgcg ctctgctgaa gccagbbacc ttcggaaaaa gagttggtag 5220 
cbcbbgabcc ggcaaacaaa ccaccgcbgg tagcggtggt tttfcttgttt gcaagcagca 5280 
gattacgcgc agaaaaaaag gatctcaaga agatccbbbg afcctthtcta cggggtctga 5340 
cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 5400 
cbbcaccbag atccbbbbaa attaaaaatg aagbbtbaaa tcaabcbaaa gtatatatga 5460 
gtaaactbgg tctgacagtt accaatgctt aatcagbgag gcacctatct cagcgatctg 5520 
bcbabbbcgb bcabccabag ttgcctgact cggggggggg gggcgctgag gtctgcctcg 5580 
tgaagaaggt gttgctgact cataccaggc cbgaatcgcc ccatcatcca gccagaaagt 5640 
gagggagcca cggbbgabga gagcbbbgbb gtaggtggac cagbbggbga ttttgaactt 5700 
ttgctttgcc acggaacggt ctgcgttgtc gggaagatgc gtgatctgat ccttcaacbc 5760 
agcaaaagbt cgabbbabbc aacaaagccg ccgtcccgbc aagbcagcgb aabgcbcbgc 5820 
cagtgtbaca accaabbaac caatbcbgab bagaaaaacb cabcgagcat caaatgaaac 5880 
bgcaabbbab bcababcagg abbatcaaba ccababbbbb gaaaaagccg bttcbgbaab 5940 
gaaggagaaa acbcaccgag gcagbbccab aggabggcaa gabccbggba bcggbcbgcg 6000 
abbccgacbc gbccaacabc aabacaaccb abbaabbbcc ccbcgbcaaa aabaaggbba 6060 
bcaagbgaga aabcaccabg agtgacgacb gaabccggtg agaabggcaa aagcbbabgc 6120 
atbbctbbcc agacbbgbbc aacaggccag ccabbacgcb cgbcabcaaa atcacbcgca 6180 
bcaaccaaac cgbbabbcab bcgbgabbgc gccbgagcga gacgaaabac gcgabcgcbg 6240 
bbaaaaggac aabbacaaac aggaabcgaa bgcaaccggc gcaggaacac tgccagcgca 6300 
bcaacaabab bbbcaccbga abcaggabab bcbbcbaaba ccbggaabgc bgbbbbcccg 6360 
gggabcgcag bggbgagbaa ccabgcabca bcaggagbac ggabaaaabg cbbgabggbc 6420 
ggaagaggca baaabbccgb cagccagbbb agbcbgacca bcbcabcbgb aacabcabbg 6480 
gcaacgcbac cbbbgccabg btbcagaaac aacbcbggcg cabcgggcbb cccabacaab 6540 
cgabagabbg bcgcacctga bbgcccgaca bbabcgcgag cccabbbata cccabataaa 6600 
bcagcabcca bgbbggaabb baabcgcggc cbcgagcaag acgbbbcccg bbgaababgg 6660 
cbcabaacac cccbbgbabb acbgbbbabg baagcagaca gbbbbabbgb bcabgabgab 6720 
ababbbbbab cbbgbgcaab gbaacabcag agabtbbgag acacaacgbg gcbbbccccc 6780 
cccccccabb abtgaagcab bbabcagggb babbgbcbca bgagcggaba cababbbgaa 6840 
bgbabbbaga aaaabaaaca aabaggggbb ccgcgcacat bbccccgaaa agbgccaccb 6900 
gacgbcbaag aaaccabbab babcabgaca btaaccbaba aaaabaggcg babcacgagg 6960 
cccbbbcgbc ~ ,- q70 



<210> 7 
<211> 6787 
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<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 7 

bcgcgcgtbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbct gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbtaacbabg cggcatcaga gcagabbgba cbgagagtgc 180 
acca-babgcg gbgtgaaaba ccgcacagat gcgbaaggag aaaabaccgc abcagabbgg 24 0 
cbabtggcca bbgcabacgb bgtatccaba bcabaababg bacabbbaba bbggcbcabg 300 
tccaacabba ccgccabgbb gacabbgabb abbgacbagb babbaabagb aabcaabbac 360 
ggggtcabba gbbcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgtabgbbcc 480 
cabagbaacg ccaabaggga cbbbccatbg acgbcaabgg gbggagbabb tacggtaaac 540 
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tgcccacttg gcagtacatc aagbgbabca tatgccaagt acgcccccta bbgacgbcaa 600 
bgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg acbbtccbac 660 
bbggcagtac atctacgtat bagtcabcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggbbbgacbc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agbtbgbbbb ggcaccaaaa tcaacgggac tttccaaaat ghcgtaacaa 840 
ctccgcccca tbgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gbgaaccgtc agatcgccbg gagacgccat ccacgcbgbb bbgacctcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgbgcc aagagbgacg baagbaccgc cbabagacbc babaggcaca cccctbbggc 1080 
bctbabgcab gcbabacbgb bbbtggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggbgabgg tabagcbbag ccbabaggbg bgggbbabbg accabtabbg accacbcccc 1200 
babbggbgac gabacbbbcc abbacbaabc cabaacabgg cbcbbbgcca caactabcbc 1260 
babbggcbab abgccaabac bcbgbccbbc agagacbgac acggacbcbg babbtbbaca 1320 
ggabggggtc ccabtbatta bbbacaaatt cacababaca acaacgccgt cccccgtgcc 1380 
cgcagtbbtb abtaaacaba gcgtgggabc bccacgcgaa bcbcgggbac gbgbtccgga 1440 
cabgggcbcb bcbccggtag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbtg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagtgbgccg cacaaggccg bggcggtagg gbabgbgbct 1620 
gaaaabgagc gbggagatbg ggctcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbabbcbgab aagagbcaga ggbaactccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagb gbagbcbgag cagbacbcgb bgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagactgb bccbbbccab gggbcbbbbc 1860 
bgcagbcacc gbcgbcgaca cgbgbgabca gababcgcgg ccgcbcbaga caccabgcgc 1920 
gbgaaggaga agbaccagca ccbgbggcgc bggggcbggc gcbggggcac cabgcbgcbg 1980 
ggcabgcbga bgabcbgcag cgccaccgag aagcbgbggg bgaccgtgba cbacggcgbg 2 04 0 
cccgbgbgga aggaggccac caccacccbg cbcbgcgcca gcgacgccaa ggcctacgac 2100 
accgaggbgc acaacgbgbg ggccacccac gccbgcgbgc ccaccgaccc caacccccag 2160 
gaggbggbgc bggbgaacgt gaccgagaac bbcgacatgt ggaagaacga cabggbggag 2220 
cagabgcacg aggacabcat cagccbgbgg gaccagagcc bgaagcccbg cgbgaagcbg 2280 
accccccbgb gcgbgagccb gaagbgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcabgabcab ggagaagggc gagabcaaga acbgcagcbb caacabcagc 2400 
accagcabcc gcggcaaggb gcagaaggag tacgccbbcb bcbacaagct ggacabcabc 2460 
cccabcgaca acgacaccac cagcbacagc cbgaccagcb gcaacaccag cgbgabcacc 2520 
caggccbgcc ccaaggbgag cbbcgagccc abccccaacc acbacbgcgc ccccgccggc 2580 
bbcaccabcc bgaagbgcaa caacaagacc tbcaacggca ccggccccbg caccaacgbg 2640 
agcaccgbgc agbgcaccca cggcabccgc cccgbggtga gcacccagcb gcbgcbgaac 2700 
ggcagccbgg ccgaggagga ggbggbgatc cgcagcgtga acbtcaccga caacgccaag 2760 
accabcabcg bgcagcbgaa caccagcgbg gagabcaacb gcacccgccc caacaacaac 2820 
acccgcaagc gaabbcgcab ccagcgcggc cccggccgcg ccbtcgbgac cabcggcaag 2880 
abcggcaaca bgcgccaggc ccacbgcaac abcagccgcg ccaagbggaa caacacccbg 2940 
aagcagabcg acagcaagcb gcgcgagcag bbcggcaaca acaagaccab cabctbcaag 3000 
cagagcagcg gcggcgaccc cgagabcgbg acccacagcb bcaacbgcgg cggcgagbbc 3060 
btctacbgca acagcaccca gcbgbbcaac agcaccbggb bcaacagcac cbggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accabcaccc bgcccbgccg cabcaagcag 3180 
abcabcaaca bgbggcagaa ggbgggcaag gccabgbacg ccccccccab cagcggccag 3240 
abccgcbgca gcagcaacab caccggccbg cbgcbgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agabcbbccg ccbgggcggc ggcgacatgc gcgacaacbg gcgcagcgag 3360 
cbgbacaagt acaaggbggt gaagabcgag ccccbgggcg bggcccccac caaggccaag 3420 
cbbaccgbgc aggcccgcca gcbgcbgagc ggcabcgbgc agcagcagaa caacctgcbg 3480 
cgcgccabcg aggcccagca gcaccbgcbg cagcbgaccg bgbggggcab caagcagcbg 3540 
caggcccgca bccbggccgb ggagcgcbac cbgaaggacc agcagcbgga gcagabcbgg 3600 
aaccacacca ccbggabgga gbgggaccgc gagabcaaca acbacaccag ccbgabccac 3660 
agccbgabcg aggagagcca gaaccagcag gagaagaacg agcaggagcb gcbggagcbg 3 720 
gacaagbggg ccagccbgbg gaacbggbbc aacabcacca acbggcbgbg gbgaggabcc 3780 
agabcbgcbg bgccbbctag tbgccagcca bcbgbbgtbb gccccbcccc cgtgccbbcc 3840 
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ttgaccctgg aaggtgccac tcccactgtc cbbbcctaab aaaatgagga aattgcatcg 3900 
cabtgtcbga gtaggtgtca bbcbabbcbg gggggtgggg tggggcagga cagcaagggg 3960 
gaggattggg aagacaabag caggcabgcb ggggahgcgg tgggctctat gggtacccag 4020 
gtgctgaaga attgacccgg ttcctcctgg gccagaaaga agcaggcaca bccccbbcbc 4080 
tgtgacacac cctgtccacg cccctggttc ttagbtccag ccccactcat aggacactca 4140 
tagctcagga gggctccgcc bbcaatccca cccgctaaag bacbbggagc ggtctctccc 4200 
tccctcatca gcccaccaaa ccaaaccbag ccbccaagag tgggaagaaa ttaaagcaag 4260 
abaggcbabb aagtgcagag ggagagaaaa tgcctccaac abgbgaggaa gbaabgagag 4320 
aaatcataga abbbbaaggc catcatggcc bbaabcbbcc gctbcctcgc tcactgacbc 4380 
gctgcgctcg gbcgbbcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg 4440 
gttatccaca gaatcagggg ataacgcagg aaagaacatg bgagcaaaag gccagcaaaa 4500 
ggccaggaac cgtaaaaagg ccgcgttgct ggcgtbbbbc cataggctcc gcccccctga 4560 
cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag 4 620 
ataccaggcg tttccccctg gaagctccct . cgtgcgctct ccbgbbccga ccctgccgct 4680 
taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgcbbbcbc atagctcacg 4740 
cbgbaggbab ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgbg bgcacgaacc 4800 
ccccgtbcag cccgaccgcb gcgccbbabc cggbaacbat cgbcbbgagb ccaacccggt 4860 
aagacacgac btabcgccac bggcagcagc cactggtaac aggabtagca gagcgaggta 4920 
tgbaggcggb gcbacagagb bcbbgaagtg gtggccbaac bacggctaca ctagaagaac 4980 
agbabbbggb abcbgcgcbc tgcbgaagcc agtbacctbc ggaaaaagag bbggbagcbc 504 0 
bbgabccggc aaacaaacca ccgctggbag' cggbggttbt tbtgttbgca agcagcagab 5100 
bacgcgcaga aaaaaaggab cbcaagaaga bccbbbgabc bbbbcbacgg ggbcbgacgc 5160 
tcagbggaac gaaaacbcac gtbaagggat tttggtcatg agabbabcaa aaaggabcbb 5220 
cacctagabc cbbtbaaabt aaaaabgaag bbtbaaabca abcbaaagba bababgagta 5280 
aactbggtcb gacagtbacc aatgcbtaab cagbgaggca ccbabctcag cgatcbgbcb 5340 
abbbcgtbca tccabagbtg ccbgacbcgg gggggggggg cgctgaggtc bgccbcgbga 5400 
agaaggbgbb gcbgacbcat accaggccbg aabcgcccca bcabccagcc agaaagtgag 5460 
ggagccacgg bbgabgagag cbtbgbtgba ggtggaccag bbggtgatbb tgaacbbbbg 5520 
cbttgccacg gaacggtctg cgtbgbcggg aagatgcgbg abctgatccb bcaacbcagc 5580 
aaaagbtcga bbtattcaac aaagccgccg tcccgtcaag bcagcgbaab gctcbgccag 5640 
bgtbacaacc aabtaaccaa bbcbgabbag aaaaacbcab cgagcabcaa abgaaacbgc 5700 
aabbtabbca babcaggabb abcaabacc'a babbbbbgaa aaagccgbbb cbgtaabgaa 5760 
ggagaaaact caccgaggca gbtccabagg atggcaagab ccbggbabcg gbcbgcgabb 5820 
ccgacbcgbc caacabcaab acaacctabb aabbbccccb cgtcaaaaab aaggbbatca 5880 
agbgagaaab caccabgagb gacgacbgaa bccggbgaga abggcaaaag cbtatgcabb 5940 
bcbbbccaga cbbgbbcaac aggccagcca bbacgcbcgb cabcaaaabc acbcgcatca 6000 
accaaaccgb babbcatbcg tgabbgcgcc bgagcgagac gaaatacgcg atcgcbgtba 6060 
aaaggacaab bacaaacagg aabcgaabgc aaccggcgca ggaacactgc cagcgcatca 6120 
acaabattbt caccbgaabc aggababbcb bcbaabaccb ggaabgcbgb bbbcccgggg 6180 
abcgcagtgg bgagbaacca tgcabcabca ggagbacgga taaaabgcbb gatggbcgga 6240 
agaggcabaa abbccgbcag ccagtbbagb cbgaccabcb cabcbgbaac abcabbggca 6300 
acgcbaccbb bgccabgbbb cagaaacaac bcbggcgcab cgggcbbccc abacaatcga 63 60 
bagabbgbcg caccbgabbg cccgacabba bcgcgagccc abbbataccc abataaabca 6420 
gcabccabgb bggaatbbaa bcgcggccbc gagcaagacg bbbcccgbtg aababggcbc 64 80 
abaacacccc bbgtabbacb gbbbabgtaa gcagacagbb bbabbgtbca bgabgababa 6540 
btbtbabcbb gbgcaabgba acabcagaga btbbgagaca caacgbggcb bbcccccccc 6600 
ccccabtabb gaagcabtba bcagggbtab bgbctcabga gcggabacab abbbgaabgb 6660 
abbbagaaaa abaaacaaat aggggbbccg cgcacabbbc cccgaaaagb gccaccbgac 6720 
gbcbaagaaa ccabtabbat cabgacatba accbabaaaa abaggcgbab cacgaggccc 6780 
bbbcgbc ~ ~~ fi7p7 
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<223> plasmid pVR1012x/s containing HIV genes 
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tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgacgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga actgcagctt caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tctacaagct ggacatcatc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcaccatcc tgaagtgcaa caacaagacc ttcaacggca ccggcccctg caccaacgtg 2640 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagct gctgctgaac 2700 
ggcagcctgg ccgaggagga ggtggtgatc cgcagcgtga acttcaccga caacgccaag 2760 
accatcatcg tgcagctgaa caccagcgtg gagatcaact gcacccgccc caacaacaac 2820 
acccgcaagc gaattcgcat ccagcgcggc cccggccgcg ccttcgtgac catcggcaag 2880 
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atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagtggaa caacaccctg 2940 
aagcagatcg acagcaagct gcgcgagcag bbcggcaaca acaagaccat catcttcaag 3000 
cagagcagcg gcggcgaccc cgagatcgtg acccacagct bcaacbgcgg cggcgagbbc 3060 
tbcbactgca acagcaccca gcbgbbcaac agcacctggb tcaacagcac ctggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accatcaccc tgccctgccg catcaagcag 3180 
atcatcaaca tgtggcagaa ggtgggcaag gccatgtacg ccccccccat cagcggccag 3240 
atccgctgca gcagcaacat caccggcctg ctgctgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agatcttccg cctgggcggc ggcgacatgc gcgacaactg gcgcagcgag 3360 
ctgtacaagt acaaggbggb gaagatcgdg cccctgggcg tggcccccac caaggccaag 3420 
cttaccgfcgc aggcccgcca gctgctgagc ggcatcgbgc agcagcagaa caacctgctg 3480 
cgcgccatcg aggcccagca gcacctgctg cagctgaccg tgtggggcat caagcagctg 3540 
caggcccgca tcctggccgt ggagcgctac ctgaaggacc agcagctcga gtgaggatcc 3 600 
agatctgctg tgccttctag ttgccagcca bcbgbbgbbb gcccctcccc cgtgccttcc 3660 
ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg 3720 
cattgtctga gbaggbgbca ttctattctg gggggtgggg tggggcagga cagcaagggg 3780 
gaggattggg aagacaabag caggcatgct ggggatgcgg tgggcbcbat gggtacccag 3840 
gtgctgaaga attgacccgg ttcctcctgg gccagaaaga agcaggcaca tccccttctc 3900 
tgtgacacac cctgtccacg ccccbggttc ttagttccag ccccactcat aggacactca 3960 
tagctcagga gggctccgcc ttcaatccca cccgctaaag tacttggagc ggtctctccc 4020 
tccctcatca gcccaccaaa ccaaacctag cctccaagag tgggaagaaa ttaaagcaag 4 080 
ataggctatt aagtgcagag ggagagaaaa bgcctccaac abgbgaggaa gtaatgagag 4140 
aaatcataga. attttaaggc catcatggcc ttaatcttcc gcttcctcgc tcactgactc 4200 
gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cacbcaaagg cggtaabacg 4260 
gttatccaca gaatcagggg abaacgcagg aaagaacatg tgagcaaaag gccagcaaaa 4320 
ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga 4380 
cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gacbataaag 4440 
ataccaggcg ttbccccctg gaagcbcccb cgbgcgctcb cctgbbccga cccbgccgct 4500 
baccggabac cbgtccgccb ttctcccttc gggaagcgbg gcgcttbctc abagcbcacg 4560 
cbgbaggbab cbcagbbcgg tgtaggbcgb tcgctccaag cbgggctgtg bgcacgaacc 4620 
ccccgbtcag cccgaccgcb gcgcctbatc cggbaacbat cgbctbgagb ccaacccggb 4680 
aagacacgac tbatcgccac tggcagcagc cactggtaac aggattagca gagcgaggba 4740 
tgbaggcggt gctacagagb tcbbgaagbg gbggcctaac bacggcbaca ctagaagaac 4800 
agtattbggb atctgcgcbc tgctgaagcc agbbaccttc ggaaaaagag bbggtagcbc 4860 
ttgabccggc aaacaaacca Ccgctggtag cggbggbttt bttgbbbgca agcagcagab 4920 
tacgcgcaga aaaaaaggab ctcaagaaga bccbttgatc bbbbcbacgg ggbctgacgc 4980 
bcagtggaac gaaaacbcac gbbaagggab bbbggtcabg agabtabcaa aaaggatctt 5040 
cacctagatc cttttaaabb aaaaabgaag tbbbaaatca atctaaagba tabatgagba 5100 
aacbtggbcb gacagttacc aabgcbbaab cagbgaggca ccbabcbcag cgabcbgbcb 5160 
abtbcgttca tccatagbtg ccbgacbcgg gggggggggg cgctgaggbc tgcctcgtga 5220 
agaaggbgtt gctgactcab accaggcctg aabcgcccca bcabccagcc agaaagbgag 5280 
ggagccacgg tbgatgagag cbbbgbbgba ggbggaccag bbggbgabbb bgaacbbbbg 5340 
cbtbgccacg gaacggtcbg cgbtgbcggg aagatgcgbg abcbgabcct bcaacbcagc 54 00 
aaaagttcga bbtabtcaac aaagccgccg bcccgbcaag tcagcgbaab gcbcbgccag 5460 
tgbbacaacc aattaaccaa btcfcgattag aaaaacbcab cgagcabcaa abgaaacbgc 5520 
aabbtatbca batcaggabb abcaabacca tabbtbbgaa aaagccgbbb cbgtaatgaa 5580 
ggagaaaacb caccgaggca gtbccabagg abggcaagab ccbggbabcg gbctgcgatt 5640 
ccgacbcgbc caacabcaab acaaccbatb aabbtccccb cgbcaaaaab aaggbtabca 5700 
agbgagaaab caccatgagb gacgacbgaa tccggbgaga abggcaaaag cbbatgcabb 5760 
tcbbbccaga cbbgbbcaac aggccagcca tbacgcbcgb cabcaaaabc acbcgcabca 5820 
accaaaccgt tattcabbcg tgatbgcgcc bgagcgagac gaaabacgcg abcgcbgbba 5880 
aaaggacaab bacaaacagg aabcgaabgc aaccggcgca ggaacactgc cagcgcabca 5940 
acaababbtb caccbgaatc aggababbcb bcbaatacct ggaabgctgb bbtcccgggg 6000 
abcgcagbgg bgagbaacca tgcabcabca ggagtacgga taaaabgcbb gabggbcgga 6060 
agaggcabaa attccgbcag ccagbbbagb ctgaccatct cabcbgbaac abcabbggca 6120 
acgcbacctt bgccabgbbb cagaaacaac bcbggcgcab cgggcbbccc abacaabcga 6180 
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bagabbgbcg caccbgatbg cccgacatta tcgcgagccc atttataccc ababaaabca 6240 
gcatccatgt tggaatttaa bcgcggcctc gagcaagacg tttcccgttg aatatggctc 6300 
ataacacccc ttgtattact gbtbabgtaa gcagacagtt ttattgttca bgatgatata 6360 
tttttatctt gtgcaatgta acatcagaga ttttgagaca caacgtggcb btcccccccc 6420 
ccccatbabt gaagcattta tcagggtbab tgtctcatga gcggabacab abtbgaabgt 6480 
abbbagaaaa abaaacaaab aggggfcbccg cgcacattbc cccgaaaagt gccaccbgac 6540 
gbcbaagaaa ccabbatbab catgacabba accbataaaa abaggcgbab cacgaggccc 6600 
bbbcgbc " " 6607 



<210> 9 
<211> 7042 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 9 

tcgcgcgtbb cggbgabgac ggbgaaaacc tctgacacab gcagcbcccg gagacggtca 60 
cagcbtgtcb gtaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggtg 12 0 
tbggcgggtg tcggggctgg cttaacbabg cggcabcaga gcagabtgta ctgagagbgc 180 
accababgcg gbgbgaaata ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 24 0 
cbatbggcca bbgcabacgt bgbabccata tcabaababg bacabbbaba bbggctcatg 300 
tccaacabba ccgccabgbb gacabtgatb atbgacbagb babbaabagb aabcaabtac 360 
ggggbcabba gbbcabagcc catababgga gtbccgcgbt acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabtgacg tcaabaabga cgbatgtbcc 480 
catagbaacg ccaabaggga cbbtccabbg acgbcaatgg gbggagbabt bacggbaaac 540 
bgcccacbbg gcagbacabc aagtgbatca babgccaagb acgcccccba bbgacgbcaa 600 
tgacggbaaa bggcccgccb ggcabbatgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagtac abcbacgbab bagbcabcgc babtaccabg gtgatgcggb ttbggcagba 720 
cabcaabggg cgbggatagc ggbttgacbc acggggatbt ccaagbcbcc accccabbga 780 
cgbcaabggg agbbbgbbbt ggcaccaaaa bcaacgggac bbbccaaaab gbcgtaacaa 84 0 
cbccgcccca bbgacgcaaa bgggcggtag gcgbgbacgg bgggaggtcb abataagcag 900 
agctcgbbba gbgaaccgbc agabcgcctg gagacgccat ccacgcbgtb btgacctcca 960 
bagaagacac cgggaccgat ccagccbccg cggccgggaa cggtgcabtg gaacgcggab 1020 
bccccgbgcc aagagtgacg baagtaccgc cbabagactc babaggcaca ccccbttggc 1080 
bctbatgcab gcbabacbgb bbbtggcbbg gggccbabac acccccgctb ccbbatgcba 1140 
baggbgabgg batagcbtag ccbataggbg bgggbbabbg accatbatbg accacbcccc 1200 
babbggbgac gabacbtbcc abbacbaabc cabaacabgg ctcbtbgcca caacbatcbc 1260 
batbggcbab abgccaabac bcbgbccbbc agagacbgac acggacbcbg babtbbtaca 1320 
ggabggggbc ccabbbabta bbbacaaatb cacabataca acaacgccgb cccccgbgcc 1380 
cgcagtbbbb abbaaacata gcgbgggatc bccacgcgaa bcbcgggbac gbgbbccgga 1440 
cabgggcbcb tcbccggtag cggcggagcb bccacabccg agcccbggbc ccatgcctcc 150 0 
agcggcbcab ggbcgcbcgg cagcbccbtg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggtagg gtatgbgbct 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gctgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbtgbb gbabtcbgab aagagbcaga ggtaacbccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagt gbagbcbgag cagbacbcgb tgcbgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagacbgb tccbtbccab gggbcbbbbc 1860 
bgcagbcacc gtcgbcgaca cgbgtgabca gababcgcgg ccgctcbaga caccabgcgc 1920 
gbgaaggaga agbaccagca ccbgbggcgc bggggcbggc gcbggggcac cabgcbgcbg 1980 
ggcabgcbga bgabcbgcag cgccaccgag aagcbgbggg bgaccgbgba cbacggcgbg 2040 
cccgbgtgga aggaggccac caccacccbg ctcbgcgcca gcgacgccaa ggcctacgac 2100 
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accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga actgcagctt caacatcagc 24 00 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tctacaagct ggacatcatc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcaccatcc tgaagtgcaa caacaagacc ttcaacggca ccggcccctg caccaacgtg 2640 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagct gctgctgaac 2700 
ggcagcctgg ccgaggagga ggtggtgatc cgcagcgtga acttcaccga caacgccaag 2760 
accatcatcg tgcagctgaa caccagcgtg gagatcaact gcacccgccc caacaacaac 2820 
acccgcaagc gaattcgcat ccagcgcggc cccggccgcg ccttcgtgac catcggcaag 2880 
atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagtggaa caacaccctg 294 0 
aagcagatcg acagcaagct gcgcgagcag ttcggcaaca acaagaccat catcttcaag 3000 
cagagcagcg gcggcgaccc cgagatcgtg acccacagct tcaactgcgg cggcgagttc 3 060 
ttctactgca acagcaccca gctgttcaac agcacctggt tcaacagcac ctggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accatcaccc tgccctgccg catcaagcag 3180 
atcatcaaca tgtggcagaa ggtgggcaag gccatgtacg ccccccccat cagcggccag 3240 
atccgctgca gcagcaacat caccggcctg ctgctgaccc gcgacggcgg caacagcaac 33 00 
aacgagagcg agatcttccg cctgggcggc ggcgacatgc gcgacaactg gcgcagcgag 3360 
ctgtacaagt acaaggtggt gaagatcgag cccctgggcg tggcccccac caaggccaag 34 2 0 
cgccgcgtgg tgcagcgcga gaagcgcgcc gtgggcatcg gcgccctgtt cctgggcttc 34 80 
ctgggcgccg ccggcagcac catgggcgcc gccagcatga ccctgaccgt gcaggcccgc 3540 
cagctgctga gcggcatcgt gcagcagcag aacaacctgc tgcgcgccat cgaggcccag 3600 
cagcacctgc tgcagctgac cgtgtggggc atcaagcagc tgcaggcccg catcctggcc 3 660 
gtggagcgct acctgaagga ccagcagctg ctgggcatct ggggctgcag cggcaagctt 3720 
atctgcacca ccgccgtgcc ctggaacgcc agctggagca acaagagcct ggagcagatc 3780 
tggaaccaca ccacctggat ggagtgggac cgcgagatca acaactacac cagcctgatc 3840 
cacagcctga tcgaggagag ccagaaccag caggagaaga acgagcagga gctgctggag 39 00 
ctggacaagt gggccagcct gtggaactgg ttcaacatca ccaactggct gtggtacatc 3960 
aagctgttca tcatgatcgt gggcggcctg gtgggcctgc gcatcgtgtt cgccgtgctg 4020 
agcatctgag gatccagatc tgctgtgcct tctagttgcc agccatctgt tgtttgcccc 4 080 
tcccccgtgc cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat 4140 
gaggaaattg catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg 4200 
caggacagca agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc 4260 
tctatgggta cccaggtgct gaagaattga cccggttcct cctgggccag aaagaagcag 4320 
gcacatcccc ttctctgtga cacaccctgt ccacgcccct ggttcttagt tccagcccca 43 80 
ctcataggac actcatagct caggagggct ccgccttcaa tcccacccgc taaagtactt 4440 
ggagcggtct ctccctccct catcagccca ccaaaccaaa cctagcctcc aagagtggga 4500 
agaaattaaa gcaagatagg ctattaagtg cagagggaga gaaaatgcct ccaacatgtg 4560 
aggaagtaat gagagaaatc atagaatttt aaggccatca tggccttaat cttccgcttc 4620 
ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc 4680 
aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 4740 
aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 4800 
gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 4860 
gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 4920 
tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ' 4 980 
ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 5040 
ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 5100 
tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 5160 
tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 5220 
ctacactaga agaacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 5280 
aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 534 0 
ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 5400 
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tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggabtbbgg tcatgagatt 5460 
atcaaaaagg abcbbcaccb agatcctttt aaattaaaaa tgaagtttta aatcaatcta 5520 
aagtatatat gagtaaactb ggtctgacag ttaccaatgc ttaatcagtg aggcacctat 5580 
ctcagcgatc tgtctatttc gttcatccat agbbgccbga ctcggggggg gggggcgctg 5640 
aggtctgcct cgbgaagaag gtgttgctga ctcataccag gcctgaatcg ccccatcatc 5700 
cagccagaaa gbgagggagc cacggttgat gagagctttg ttgtaggtgg accagttggt 5760 
gabbbbgaac bbtbgcbbbg ccacggaacg gtctgcgttg tcgggaagat gcgtgatcbg 5820 
atccttcaac tcagcaaaag tbcgabbtab bcaacaaagc cgccgbcccg tcaagtcagc 5880 
gtaatgctct gccaghgtta caaccaatta accaabtctg ahtagaaaaa ctcatcgagc 5940 
atcaaatgaa acbgcaabtb attcatatca ggabbabcaa taccatattt ttgaaaaagc 6000 
cgfctbcbgba abgaaggaga aaactcaccg aggcagttcc ataggatggc aagabccbgg 6060 
tatcggtctg cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca 6120 
aaaataaggt tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc 6180 
aaaagcttat gcabbtcbbb ccagacttgt tcaacaggcc agccattacg ctcgtcatca 6240 
aaatcactcg catcaaccaa accgtbabtc atbcgbgabb gcgcctgagc gagacgaaab 6300 
acgcgabcgc tgttaaaagg acaattacaa acaggaatcg aatgcaaccg gcgcaggaac 6360 
actgccagcg catcaacaat attttcacct gaatcaggat attcttctaa baccbggaab 6420 
gcbgbbbbcc cggggatcgc agtggtgagt aaccatgcat catcaggagt acggataaaa 6480 
tgcttgatgg tcggaagagg cataaattcc gtcagccagt btagbcbgac catctcatct 6540 
gtaacatcat tggcaacgct acctttgcca tgtttcagaa acaactctgg cgcatcgggc 6600 
ttcccataca atcgatagat tgtcgcacct gattgcccga cattatcgcg agcccattta 6660 
bacccatata aatcagcatc catgttggaa tttaatcgcg gcctcgagca agacgtttcc 6720 
cgbbgaabab ggctcataac accccttgta ttactgttta tgtaagcaga cagbtbbatt 6780 
gttcatgatg atatattttt atcttgtgca atgtaacatc agagatbttg agacacaacg 6840 
bggcbtbccc ccccccccca bbabbgaagc atttatcagg gbbattgtcb catgagcgga 6900 
bacababbbg aatgtabbba gaaaaataaa caaatagggg ttccgcgcac atttccccga 6960 
aaagtgccac cbgacgtcta agaaaccatt atbatcatga cabtaaccta baaaaabagg 7020 
cgbatcacga ggcccbbbcg be 7042 



<210> 10 
<211> 6859 
<212> DNA 

<213> Arbificial Sequence 

■ 

<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 10 

tegegegtbt eggbgabgae ggbgaaaacc tcbgacacat gcagctcccg gagaeggtea 60 
cagcbbgbcb gtaageggab geegggagea gacaagcccg teagggegeg teagegggbg 120 

tbggcgggtg beggggebgg ebbaacbabg eggcabcaga gcagabbgta cbgagagbgc 180 

accatabgcg gbgbgaaaba ccgcacagab gegbaaggag aaaabacege abcagabbgg 240 

cbabtggcca bbgcabacgt bgbabecaba bcataababg bacabtbaba bbggcbcabg 300 

becaacabba ccgccabgbt gacabbgabb abbgacbagb babbaabagt aabcaabtac 360 

ggggbcabba gbbcabagee cababatgga gbbccgcgtb acabaacbta eggbaaatgg 420 

cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaabaabga egbabgbtec 480 

cabagbaacg ccaabaggga ebbbecabbg aegtcaabgg gbggagbabb tacggbaaac 540 

tgcccactbg gcagbacabc aagbgbabca babgecaagb acgcccccba bbgacgbcaa 600 

bgacggbaaa bggcccgcct ggcatbabgc ccagbacabg accbbabggg actbtccbac 660 

bbggcagbac abebaegbat bagbcabege babbaccabg gbgabgcggb tttggcagba 720 

cabcaabggg egbggabage ggbbbgaebe aeggggabtb ccaagbcbcc accccabbga 780 

cgbcaatggg agbbbgbbbt ggcaccaaaa teaaegggae bbbecaaaab gbcgbaacaa 840 

cbccgcccca bbgacgcaaa bgggeggtag gcgbgbacgg bgggaggbcb ababaagcag 900 
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agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgab ccagcctccg cggccgggaa cggtgcabtg gaacgcggat 1020 
bccccgbgcc aagagtgacg taagtaccgc ctatagactc tataggcaca ccccbtbggc 1080 
bcbbabgcab gctatactgt tbbtggcbbg gggcctatac acccccgctt ccttatgcta 1140 
ta 99tgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc abbactaabc cataacatgg ctctttgcca caacbatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tbbacaaatb cacatabaca acaacgccgb cccccgtgcc 1380 
cgcagttttt attaaacata gcgbgggabc bccacgcgaa tctcgggbac gbgtbccgga 1440 
catgggcbct tctccggtag cggcggagcb fcccacatccg agcccbggbc ccatgccbcc 1500 
agcggctcab ggbcgctcgg cagcbccttg cbcctaacag tggaggccag acbtaggcac 1560 
agcacaatgc ccaccaccac cagfcgtgccg cacaaggccg tggcggbagg gbatgtgtcb 1620 
gaaaatgagc gbggagabbg ggcbcgcacg gctgacgcag abggaagact baaggcagcg 1680 
gcagaagaag abgcaggcag ctgagtbgbb gbattctgat aagagbcaga ggbaacbccc 174 0 
gttgcggbgc bgttaacggt ggagggcagb gbagtcbgag cagtactcgt bgctgccgcg 1800 
cgcgccacca gacabaatag ctgacagacb aacagacbgb bccbtbccat gggtcttbbc 1860 
tgcagtcacc gbcgbcgaca cgtgbgatca gabatcgcgg ccgctctaga caccatgcgc 1920 
gfcgaaggaga agtaccagca cctgtggcgc bggggcbggc gctggggcac catgctgctg 1980 
ggcabgctga tgatctgcag cgccaccgag aagctgtggg bgaccgbgba cbacggcgtg 2040 
cccgbgtgga aggaggccac caccacccbg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgbg ggccacccac gccbgcgbgc ccaccgaccc caacccccag 2160 
gaggbggtgc bggbgaacgb gaccgagaac ttcgacatgt ggaagaacga cabggtggag 2220 
cagabgcacg aggacatcab cagccbgbgg gaccagagcc bgaagccctg cgtgaagctg 22 80 
acccccctgt gcgtgagccb gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcabgatcab ggagaagggc gagatcaaga actgcagcbb caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag tacgccttct bctacaagct ggacatcatc 24 60 
cccabcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggccbgcc ccaaggtgag cbtcgagccc atccccaacc acbacbgcgc ccccgccggc 2580 
ttcaccatcc bgaagtgcaa caacaagacc ttcaacggca ccggccccbg caccaacgtg 2640 
agcaccgtgc agtgcaccca cggcatccgc cccgtggbga gcacccagcb gctgcbgaac 2700 
ggcagcctgg ccgaggagga ggtggtgabc cgcagcgbga acttcaccga caacgccaag 2760 
accabcatcg bgcagcbgaa caccagcgbg gagabcaacb gcacccgccc caacaacaac 2820 
acccgcaagc gaabtcgcat ccagcgcggc cccggccgcg cctbcgtgac catcggcaag 2 8 BO 
atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagbggaa caacaccctg 2940 
aagcagabcg acagcaagct gcgcgagcag bbcggcaaca acaagaccab cabcttcaag 3000 
cagagcagcg gcggcgaccc cgagabcgtg acccacagct bcaactgcgg cggcgagbtc 3060 
btctacbgca acagcaccca gcbgttcaac agcacctggt bcaacagcac cbggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accabcaccc bgccctgccg cabcaagcag 3180 
abcatcaaca bgbggcagaa ggtgggcaag gccatgtacg ccccccccab cagcggccag 3240 
atccgcbgca gcagcaacat caccggccbg ctgcbgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agabcbbccg ccbgggcggc ggcgacabgc gcgacaacbg gcgcagcgag 33 60 
ctgbacaagt acaaggbggt gaagatcgag ccccbgggcg bggcccccac caaggccaag 3420 
cbtaccgtgc aggcccgcca gcbgctgagc ggcabcgbgc agcagcagaa caaccbgcbg 34 80 
cgcgccabcg aggcccagca gcacctgcbg cagcbgaccg bgtggggcat caagcagcbg 3540 
caggcccgca bccbggccgt ggagcgcbac ctgaaggacc agcagctgga gcagabcbgg 3600 
aaccacacca ccbggabgga gbgggaccgc gagabcaaca actacaccag cctgabccac 3660 
agccbgabcg aggagagcca gaaccagcag gagaagaacg agcaggagcb gcbggagctg 3720 
gacaagtggg ccagcctgtg gaactggbtc aacabcacca actggcbgtg gbacabcaag 3780 
ctgbbcabca bgabcgbggg cggcctggbg ggccbgcgca tcgbgbbcgc cgbgcbgagc 3840 
abctgaggab ccagabcbgc bgbgcctbcb agbtgccagc cabcbgbtgb tbgccccbcc 3900 
cccgbgccbt ccbtgaccct ggaaggtgcc acbcccacbg bccbbbccba abaaaabgag 3960 
gaaafctgcat cgcabbgbct gagtaggbgt catbcbabbc tggggggbgg ggbggggcag 4 020 
gacagcaagg gggaggabtg ggaagacaat agcaggcatg cbggggabgc ggbgggcbct 4080 
atgggtaccc aggbgctgaa gaattgaccc ggbbccbcct gggccagaaa gaagcaggca 4140 
cabcccctbc bcbgbgacac acccbgtcca cgccccbggb bctbagbbcc agccccacbc 4200 
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ataggacact cabagctcag gagggctccg ccttcaatcc cacccgctaa agbacttgga 4260 
gcggbcbcbc cctccctcat cagcccacca aaccaaacct agcctccaag agtgggaaga 4320 
aattaaagca agataggcta tbaagtgcag agggagagaa aabgccbcca acatgtgagg 4380 
aagtaatgag agaaatcaba gaattttaag gccatcatgg ccttaatctt ccgcttcctc 4440 
gcbcacbgac tcgctgcgct cggbcgtbcg gcbgcggcga gcggtatcag ctcactcaaa 4500 
ggcgghaata cggttatcca cagaatcagg ggataacgca ggaaagaaca bgbgagcaaa 4 560 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 4 620 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 4680 
aggacbabaa agataccagg cgtttccccc tggaagctcc chcgtgcgct ctcctgttcc 4740 
gaccctgccg cttaccggat accbgtccgc ctttctccct tcgggaagcg tggcgctttc 4 800 
tcabagcbca cgctgtaggt abcbcagbbc ggtgtaggtc gttcgctcca agctgggctg 4 860 
bgbgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 4920 
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 4 980 
cagagcgagg tatgtaggcg gbgcbacaga gttcttgaag tggtggccta actacggcta 5040 
cactagaaga acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag 5100 
agbbggbagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 5160 
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcbbbbcbac 522 0 
ggggtctgac gctcagtgga acgaaaactc acgttaaggg atttbggbca tgagatbabc 5280 
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 5340 
babababgag taaactbggt ctgacagtba ccaatgctta atcagtgagg cacctatctc 5400 
agcgatctgt ctattbcgtt catccatagt tgcctgactc gggggggggg ggcgctgagg 5460 
tctgcctcgt gaagaaggtg ttgctgactc ataccaggcc tgaatcgccc catcatccag 5520 
ccagaaagtg agggagccac ggbbgabgag agcbbbgtbg taggtggacc agttggtgat 5580 
tttgaactbb tgcbbbgcca cggaacggfcc bgcgttgtcg ggaagatgcg tgatctgatc 5 640 
cttcaactca gcaaaagttc gatttattca acaaagccgc cgtcccgtca agtcagcgta 5700 
atgcbctgcc agtgttacaa ccaattaacc aattctgatt agaaaaactc abcgagcabc 5760 
aaatgaaact gcaatttatt catatcagga ttatcaatac catatttttg aaaaagccgt 5820 
ttctgtaatg aaggagaaaa ctcaccgagg cagttccata ggatggcaag atcctggtat 5 880 
cggbcbgcga ttccgactcg tccaacatca atacaaccta bbaatbbccc ctcgtcaaaa 5940 
ataaggttat caagtgagaa atcaccatga gtgacgactg aatccggtga gaatggcaaa 6000 
agcttatgca tttctttcca gacttgttca acaggccagc cattacgctc gtcatcaaaa 6060 
tcactcgcat caaccaaacc gttattcatt cgtgattgcg cctgagcgag acgaaatacg 6120 
cgatcgctgt taaaaggaca attacaaaca ggaatcgaat gcaaccggcg caggaacact 6180 
gccagcgcat caacaatatt ttcacctgaa tcaggatatt cttctaatac ctggaatgct 6240 
gtbttcccgg ggatcgcagt ggtgagtaac catgcabcab caggagbacg gabaaaatgc 6300 
bbgatggtcg gaagaggcat aaatbccgbc agccagbbba gtctgaccab cbcabcbgta 6360 
acatcabtgg caacgcbacc bbbgccabgb bbcagaaaca acbcbggcgc abcgggctbc 6420 
ccabacaatc gabagabbgt cgcaccbgat bgcccgacab batcgcgagc ccabtbatac 64 80 
ccababaaat cagcatccat gbtggaabbb aabcgcggcc bcgagcaaga cgtbtcccgb 654 0 
bgaabatggc bcataacacc ccbtgbabba cbgbttabgb aagcagacag btbtabbgbb 6600 
cabgabgaba babtbtbabc bbgtgcaabg taacatcaga gabtbbgaga cacaacgtgg 6660 
cbbbcccccc ccccccatba tbgaagcabt tabcagggbb abtgbctcab gagcggatac 6720 
ababbbgaab gbatbtagaa aaabaaacaa abaggggbbc cgcgcacabb bccccgaaaa 6780 
gbgccaccbg acgtcbaaga aaccatbabb abcatgacab baaccbabaa aaabaggcgb 6840 
abcacgaggc ccbtbcgbc 6859 



<210> 11 
<2U> 6965 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR!012x/s conbaining HIV genes 
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<400> 11 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagctfcgtct gbaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg bcggggcbgg cbtaactatg cggcatcaga gcagattgta ctgagagtgc 1B0 
accatatgcg gbgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt abbgacbagb tattaatagt aatcaatbac 360 
ggggtcatta gttcatagcc catatatgga gbbccgcgbb acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgbcaatgg gtggagtatt fcacggtaaac 540 
tgcccacttg gcagbacabc aagbgbabca babgccaagb acgcccccba ttgacgtcaa 600 
bgacggbaaa tggcccgcct ggcatbabgc ccagtacatg accttatggg actttcctac 660 
bbggcagbac atctacgtat tagtcatcgc battaccabg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggbbbgacbc acggggattt ccaagtctcc accccattga 780 
cgtcaabggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat ghcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt btgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
bctbabgcat gctatactgt ttttggcttg gggcctatac acccccgcbt ccbbabgcba 1140 
baggtgatgg tatagcttag cctataggbg tgggttattg accattattg . accactcccc 1200 
tattggtgac gabacttbcc attactaatc cataacabgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg babbtbtaca 1320 
ggatggggtc ccatbtatta bbtacaaabb cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
cabgggcbcb tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcab ggtcgctcgg cagcbccbtg cbccbaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagtbgtb gbabbcbgab aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcbbbbc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga acaccatgcg 1920 
cgtgaaggag aagtaccagc acctgtggcg ctggggctgg cgctggggca ccatgctgct 1980 
gggcatgctg atgatctgca gcgccaccga gaagctgtgg gtgaccgtgt actacggcgb 2040 
gcccgtgtgg aaggaggcca ccaccaccct gctctgcgcc agcgacgcca aggcctacga 2100 
caccgaggtg cacaacgtgt gggccaccca cgcctgcgtg cccaccgacc ccaaccccca 2160 
ggaggtggtg ctggtgaacg tgaccgagaa cttcgacatg tggaagaacg acabggbgga 2220 
gcagatgcac gaggacatca tcagcctgtg ggaccagagc ctgaagccct gcgtgaagct 2280 
gacccccctg tgcgtgagcc tgaagtgcac cgacctgaag aacgacacca acaccaacag 2340 
cagcagcggc cgcatgatca tggagaaggg cgagatcaag aactgcagct tcaacatcag 24 00 
caccagcatc cgcggcaagg tgcagaagga gtacgccttc ttctacaagc tggacahcat 2460 
ccccatcgac aacgacacca ccagctacag cctgaccagc tgcaacacca gcgtgatcac 2520 
ccaggcctgc cccaaggtga gcttcgagcc catccccaac cactactgcg cccccgccgg 2580 
cttcaccatc ctgaagtgca aggacaagaa gbtcaacggc aagggcccct gcaccaacgt 2640 
gagcaccgbg cagbgcaccc acggcatccg ccccgbggbg agcacccagc bgctgcbgaa 2700 
cgaccgccbg gccgaggagg aggbggbgat ccgcagcgcc aacttcgccg acaacgccaa 2760 
ggbgabcabc gbgcagctga acgagagcgb ggagabcaac tgcacccgcc ccaacaacaa 2 820 
cacccgcaag agcabccaca bcggccccgg ccgcgccbbc tacaccaccg gcgagabcab 2880 
cggcgacabc cgccaggccc acbgcaaccb gagccgcgcc aagtggaacg acacccbgaa 2940 
caagatcgtg abcaagctgc gcgagcagbb cggcaacaag accabcgtgb bcaagcacag 3000 
cagcggcggc gaccccgaga bcgtgaccca cagcbbcaac tgcggcggcg agbbcbbcba 3060 
cbgcaacagc acccagcbgb bcaacagcac ctggbbcaac agcaccbgga gcaccgaggg 3120 
cagcaacaac accgagggca gcgacaccab cacccbgccc bgccgcabca agcagabcab 3180 
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caacatgtgg cagaaggbgg gcaaggccat 
ctgcagcagc aacatcaccg gcctgctgct 
gagcgagatc ttccgcctgg gcggcggcga 
caagtacaag gtggtgaaga tcgagcccct 
cgbggtgcag cgcgagaagc gcgccgtggg 
cgccgccggc agcaccatgg gcgccgccag 
gctgagcggc atcgtgcagc agcagaacaa 
cctgctgcag cbgaccgtgt ggggcatcaa 
gcgctacctg aaggaccagc agctgctggg 
caccaccgcc gbgcccbgga acgccagctg 
ccacaccacc tggatggagt gggaccgcga 
cctgatcgag gagagccaga accagcagga 
caagtgggcc agccbgbgga actggttcaa 
atctgctgtg ccttctagtt gccagccatc 
gaccctggaa ggtgccactc ccactgtcct 
ttgtctgagt aggtgtcatt cbatbcbggg 
ggattgggaa gacaatagca ggcabgctgg 
gcbgaagaat: tgacccggtt cctcctgggc 
tgacacaccc tgtccacgcc cctggbbcbb 
gctcaggagg gctccgcctt caatcccacc 
cctcatcagc ccaccaaacc aaacctagcc 
aggctattaa gtgcagaggg agagaaaatg 
atcatagaat tttaaggcca tcatggcctt 
bgcgctcggt cgttcggctg cggcgagcgg 
tatccacaga atcaggggat aacgcaggaa 
ccaggaaccg taaaaaggcc gcgttgctgg 
agcatcacaa aaatcgacgc tcaagtcaga 
accaggcgtt tccccctgga agctccctcg 
ccggatacct gtccgccttt cbcccbbcgg 
gtaggtatct cagtbcggtg taggtcgttc 
ccgbtcagcc cgaccgctgc gcctbabccg 
gacacgactt atcgccactg gcagcagcca 
taggcggtgc tacagagttc ttgaagbggt 
tatttggtat ctgcgctctg ctgaagccag 
gatccggcaa acaaaccacc gctggtagcg 
cgcgcagaaa aaaaggabct caagaagatc 
agtggaacga aaactcacgt taagggatbb 
ccbagabccb tttaaattaa aaatgaagbt 
ctbggtctga cagttaccaa tgcttaatca 
ttcgttcatc cabagttgcc tgactcgggg 
aaggtgttgc tgactcatac caggcctgaa 
agccacggtt gatgagagcb ttgbbgbagg 
bbgccacgga acggbctgcg ttgbcgggaa 
aagbbcgatt tattcaacaa agccgccgbc 
tbacaaccaa btaaccaatb cbgabtagaa 
tfctabtcata bcaggatbat caabaccata 
agaaaacbca ccgaggcagt tccabaggab 
gacbcgbcca acatcaabac aacctabbaa 
bgagaaabca ccatgagbga cgacbgaabc 
bbbccagacb bgttcaacag gccagccatb 
caaaccgbba bbcabbcgbg atbgcgcctg 
aggacaatba caaacaggaa tcgaatgcaa 
aatatttbca cctgaatcag gatabbcttc 
cgcagtggtg agbaaccabg catcabcagg 
aggcataaat bccgbcagcc agtbbagbcb 



gbacgccccc cccatcagcg gccagatccg 3240 
gacccgcgac ggcggcaaca gcaacaacga 3300 
catgcgcgac aactggcgca gcgagctgba 3 3 60 
999cgtggcc cccaccaagg ccaagcgccg 3 4 20 
cabcggcgcc ctgtbccbgg gcttccbggg 3480 
cabgacccbg accgbgcagg cccgccagct 3540 
ccbgcbgcgc gccatcgagg cccagcagca 3600 
gcagctgcag gcccgcatcc bggccgbgga 366 0 
catcbggggc tgcagcggca agcttabctg 3720 
gagcaacaag agcctggagc agabcbggaa 3 780 
gatcaacaac bacaccagcc tgabccacag 3840 
gaagaacgag caggagcbgc tggagcbgga 390 0 
catcaccaac tggcbgtggb gaggatccag 3960 
bgbbgbbbgc ccctcccccg bgccbbccbt 4020 
bbccbaabaa aabgaggaaa btgcabcgca 4080 
gggtggggtg gggcaggaca gcaaggggga 4140 
ggatgcggbg ggctctabgg gbacccaggt 4200 
cagaaagaag caggcacabc ccctbctcbg 4260 
agttccagcc ccacbcatag gacacbcata 4320 
cgcbaaagba cbbggagcgg tcbctcccbc 4380 
bccaagagbg ggaagaaabb aaagcaagat 444 0 
cctccaacab gbgaggaagb aabgagagaa 4500 
aatcttccgc btcctcgctc acbgacbcgc 4560 
batcagcbca cbcaaaggcg gtaabacggt 4620 
agaacabgtg agcaaaaggc cagcaaaagg 4680 
cgbbbtbcca baggctccgc ccccctgacg 4740 
ggtggcgaaa cccgacagga ctabaaagat 4800 
bgcgcbcbcc bgbtccgacc ctgccgcbta 4860 
gaagcgbggc gcbttcbcat agcbcacgct 4920 
gctccaagcb gggcbgbgbg cacgaacccc 4980 
gtaacbabcg bcbbgagbcc aacccggbaa 5040 
cbggbaacag gabtagcaga gcgaggbatg 5100 
ggccbaacta cggctacacb agaagaacag 5160 
btaccttcgg aaaaagagbt ggbagcbctt 5220 
gtggtttbtb bgbtbgcaag cagcagabba 5280 
ctttgabcbb bbcbacgggg bcbgacgcbc 5340 
bggtcabgag atbatcaaaa aggatcbbca 5400 
tbaaatcaab cbaaagbaba babgagbaaa 5460 
gbgaggcacc tabcbcagcg abcbgtcbat 5520 
ggggggggcg cbgaggtcbg ccbcgtgaag 55 80 
tcgccccabc abccagccag aaagbgaggg 564 0 
bggaccagbb ggbgabbbbg aacbttbgcb 5700 
gabgcgbgab cbgabcctbc aacbcagcaa 5760 
ccgbcaagbc agcgbaabgc tcbgccagtg 5820 
aaacbcatcg agcabcaaab gaaacbgcaa 5880 
bbbtbgaaaa agccgttbct gbaabgaagg 5940 
ggcaagabcc tggtatcggb ctgcgabbcc 6000 
tbbccccbcg tcaaaaabaa ggbbabcaag 6060 
cggbgagaab ggcaaaagcb babgcabbbc 6120 
acgcbcgbca bcaaaatcac bcgcatcaac 6180 
agcgagacga aabacgcgab cgcbgbbaaa 624 0 
ccggcgcagg aacactgcca gcgcabcaac 6300 
taabaccbgg aabgctgtbb bcccggggab 6360 
agbacggata aaabgcbbga bggbcggaag 64 20 
gaccabcbca bcbgbaacat cabbggcaac 6480 
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gctacctttg ccatgtttca gaaacaactc tggcgcatcg ggcttcccat acaatcgata 6540 
gattgtcgca cctgattgcc cgacattatc gcgagcccat ttahacccat ataaatcagc 6600 
atccatgttg gaatttaatc gcggcctcga gcaagacgtt tcccgttgaa tatggctcat 6660 
aacaccccbt gtattactgt ttatgtaagc agacagtttt attgttcatg atgatatatt 6720 
tttatcttgt gcaatgtaac atcagagatt ttgagacaca acgtggcttt cccccccccc 6780 
ccattafctga agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat 6840 
ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt 6900 
ctaagaaacc athattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt 6960 
tcgbc S965 



<210> 12 
<211> 6784 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 12 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatca ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagcatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
cafc gggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagcfcccfctg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 18 00 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgfccgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
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cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac bbcgacabgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagccbgbgg gaccagagcc tgaagccctg cgtgaagctg 22 80 
acccccctgt gcgbgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcabgabcab ggagaagggc gagatcaaga actgcagctt caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag bacgccbbct tctacaagct ggacatcatc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcgccatcc tgaagtgcaa ggacaagaag ttcaacggca agggcccctg caccaacgtg 2640 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagct gctggttacg 2700 
ggtaacctgg ccgaggagga ggtggtgatc cgcagcgcta acttcgccga caacgccaag 2760 
gtgabcabcg tgcagctgaa cgagagcgtg gagatcaact gcacccgccc caacaacaac 2820 
acccgcaaga gcatccacat cggccccggc cgcgccttct acaccaccgg cgagatcatc 2 8 80 
ggcgacatcc gccaggccca ctgcaaccbg agccgcgcca agtggaacga caccctgaac 2940 
aagatcgtga tcaagctgcg cgagcagttc ggcaacaaga ccabcgbgbb caagcacagc 3000 
agcggcggcg accccgagat cgbgacccac agcttcaact gcggcggcga gttcttctac 3060 
tgcaacagca cccagctgbb caacagcacc tggttcaaca gcacctggag caccgagggc 3120 
agcaacaaca ccgagggcag cgacaccatc accctgccct gccgcatcaa gcagatcatc 3180 
aacatgtggc agaaggtggg caaggccatg tacgcccccc ccatcagcgg ccagatccgc 3240 
tgcagcagca acatcaccgg cctgctgctg acccgcgacg gcggcaacag caacaacgag 3300 
agcgagatct bccgccbggg cggcggcgac atgcgcgaca actggcgcag cgagcbgbac 3360 
aagtacaagg tggtgaagat cgagccccbg ggcgtggccc ccaccaaggc caagcfctacc 342 0 
gtccaggccc gccagcbgcb gagcggcatc gtgcagcagc agaacaacct gctgcgcgcc 34 80 
atcgaggccc agcagcacct gctgcagctg accgtghggg gcatcaagca gctgcaggcc 3540 
cgcaccctgg ccgtggagcg ctacctgaag gaccagcagc tgctcgagca gatctggaac 3600 
cacaccacct ggatggagtg ggaccgcgag atcaacaact acaccagcct gatccacagc 3660 
ctgatcgagg agagccagaa ccagcacgag aagaacgagc aggagctgct ggagctggac 3720 
aagbgggcca gcctgbggaa ctggttcaac atcaccaact ggctgtggtg aggatccaga 3780 
tctgctgtgc cbtcbagbtg ccagccatct gttgtttgcc cctcccccgt gccttccttg 3840 
accctggaag gtgccactcc cactgtcctt tccbaabaaa atgaggaaat tgcatcgcat 3900 
tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag 3960 
gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggg tacccaggtg 4020 
ctgaagaatt gacccggttc ctcctgggcc agaaagaagc aggcacatcc ccttctctgt 4080 
gacacaccct gtccacgccc cbggttcbba gttccagccc cactcatagg acactcatag 4140 
ctcaggaggg ctccgccttc aatcccaccc gctaaagtac bbggagcggt ctcbcccbcc 4200 
ctcatcagcc caccaaacca aacctagcct ccaagagtgg gaagaaatta aagcaagata 4260 
ggctattaag tgcagaggga gagaaaatgc ctccaacatg tgaggaagta atgagagaaa 4320 
tcatagaatt ttaaggccat cabggccbta abcbbccgct tcctcgctca ctgactcgct 4380 
gcgctcggtc gttcggctgc ggcgagcggb atcagctcac tcaaaggcgg taatacggtt 4440 
atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 4500 
caggaaccgb aaaaaggccg cgttgctggc gbbbbtccat aggctccgcc cccctgacga 4560 
gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 4620 
ccaggcgttt ccccctggaa gcbccctcgb gcgctctccb gttccgaccc tgccgcbbac 4680 
cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 4740 
taggtatctc agttcggtgb aggbcgbbcg ctccaagctg ggctgtgtgc acgaaccccc 4800 
cgttcagccc gaccgctgcg ccttatccgg taactatcgt ctbgagtcca acccggtaag 4860 
acacgactta tcgccactgg cagcagccac tggtaacagg abbagcagag cgaggtatgt 4 920 
a 99cggtgct acagagttct tgaagtggtg gcctaactac ggcbacacba gaagaacagt 4 980 
atbbggbabc bgcgcbctgc bgaagccagt bacctbcgga aaaagagbtg gbagctcbbg 504 0 
atccggcaaa caaaccaccg ctggbagcgg bggbttbbtt gbbbgcaagc agcagabbac 5100 
gcgcagaaaa aaaggabcbc aagaagatcc btbgabcbbt tcbacggggb ctgacgcbca 5160 
gbggaacgaa aacbcacgbb aagggatbtb ggtcabgaga tbatcaaaaa ggatcttcac 5220 
cbagabcctb tbaaabtaaa aabgaagbbt taaabcaatc taaagtabab abgagtaaac 5280 
ttggbcbgac agbbaccaat gcttaabcag bgaggcacct abctcagcga bctgtctabb 5340 
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tcgttcatcc atagttgcct gactcggggg gggggggcgc tgaggtctgc ctcgtgaaga 5400 
aggtgttgct gactcatacc aggcctgaat cgccccatca tccagccaga aagtgaggga 5460 
gccacggttg atgagagctb tgttgtaggt ggaccagttg gtgattttga acbbbbgcbb 5520 
tgccacggaa cggtctgcgb bgbcgggaag atgcgtgatc tgatccttca actcagcaaa 5580 
agbbcgabbb attcaacaaa gccgccgtcc cgbcaagbca gcgtaatgct ctgccagtgh 5640 
tacaaccaat baaccaattc bgatbagaaa aactcatcga gcatcaaatg aaactgcaat 5700 
tbattcatat caggattatc aataccatat bbbbgaaaaa gccgtbtcbg taatgaagga 5760 
gaaaacbcac cgaggcagbb ccataggatg gcaagabcct ggtatcggbc bgcgabtccg 5820 
actcgbccaa catcaabaca accbabbaab bbccccbcgb caaaaabaag gbbabcaagb 5880 
gagaaabcac catgagbgac gacbgaabcc ggbgagaabg gcaaaagcbb abgcabbbcb 5940 
btccagacbb gbtcaaeagg ccagccabba cgcbcgtcat caaaatcacb cgcatcaacc 6000 
aaaccgttat bcabbcgbga tbgcgccbga gcgagacgaa abacgcgabc gcbgttaaaa 6060 
ggacaabbac aaacaggaab cgaabgcaac cggcgcagga acacbgccag cgcatcaaca 6120 
atabbttcac cbgaabcagg ababbcbbcb aataccbgga abgcbgbbbb cccggggatc 6180 
gcagbggbga gbaaccabgc abcabcagga gbacggabaa aabgcbbgab ggbcggaaga 6240 
ggcabaaabb ccgfccagcca gbbbagbcbg accabcbcat cbgbaacabc abbggcaacg 6300 
cbaccbbbgc catgbbbcag aaacaacbcb ggcgcabcgg gcbbcccata caabcgabag 6360 
atbgtcgcac cbgabbgccc gacabtatcg cgagcccabt tabacccaba taaabcagca 6420 
bccabgtbgg aatbbaabcg cggcctcgag caagacgbbt cccgttgaab abggcbcaba 6480 
acaccccbbg babbactgbt tabgbaagca gacagtbbba bbgbbcabga bgabababbt 6540 
bbabcbbgbg caabgtaaca tcagagabbb bgagacacaa cgbggcbbbc cccccccccc 6600 
cabbabbgaa gcabbbabca gggtbabbgb ctcabgagcg gabacababb bgaabgtabb 6660 
bagaaaaata aacaaabagg ggbtccgcgc acabbbcccc gaaaagbgcc accbgacgbc 6720 
taagaaacca bbabtabcat gacabbaacc babaaaaaba ggcgbabcac gaggcccbbt 6780 

cgbc ~ 6784 
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<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 13 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagctbgbcb gtaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
ttggcgggbg bcggggctgg ctbaacbabg cggcabcaga gcagabbgta ctgagagbgc 180 
accabatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabbggcca bbgcatacgb bgbabccaba bcabaatabg tacabtbata ttggcbcabg 300 
bccaacabba ccgccabgbb gacabbgabb abbgacbagb babtaatagb aabcaabbac 3 60 
ggggbcabba gbbcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabtgacg bcaabaabga cgtabgbbcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgtcaabgg gtggagtabb bacggbaaac 540 
bgcccacbtg gcagbacabc aagbgbabca babgccaagb acgcccccba bbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabtabgc ccagbacabg accbbatggg acbbbccbac 660 
bbggcagbac abcbacgbab bagbcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbtbgbbbb ggcaccaaaa tcaacgggac bbbccaaaat gbcgbaacaa 840 
cbccgcccca tbgacgcaaa bgggcggbag gcgbgbacgg bgggaggtct ababaagcag 900 
agcbcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgctgtt tbgaccbcca 960 
bagaagacac cgggaccgab ccagcctccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgbgcc aagagtgacg baagbaccgc cbabagacbc babaggcaca ccccbbbggc 1080 



- 30 - 



WO 02/32943 PCT/US01/25721 

* 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
cattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
9gatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 ' 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga acaccatgcg 1920 
cgtgaaggag aagtaccagc acctgtggcg ctggggctgg cgctggggca ccatgctgct 1980 
gggcatgctg atgatctgca gcgccaccga gaagctgtgg gtgaccgtgt actacggcgt 204 0 
gcccgtgtgg aaggaggcca ccaccaccct gctctgcgcc agcgacgcca aggcctacga 2100 
caccgaggtg. cacaacgtgt gggccaccca cgcctgcgtg cccaccgacc ccaaccccca 2160 
ggaggtggtg ctggtgaacg tgaccgagaa cttcgacatg tggaagaacg acatggtgga 2220 
gcagatgcac gaggacatca tcagcctgtg ggaccagagc ctgaagccct gcgtgaagct 2280 
gacccccctg tgcgtgagcc tgaagtgcac cgacctgaag aacgacacca acaccaacag 2340 
cagcagcggc cgcatgatca tggagaaggg cgagatcaag aactgcagct tcaacatcag 2400 
caccagcatc cgcggcaagg tgcagaagga gtacgccttc ttctacaagc tggacatcat 2460 
ccccatcgac aacgacacca ccagctacag cctgaccagc tgcaacacca gcgtgatcac 2520 
ccaggcctgc cccaaggtga gcttcgagcc catccccaac cactactgcg cccccgccgg 2580 
cttcaccatc ctgaagtgca aggacaagaa gttcaacggc aagggcccct gcaccaacgt 2640 
gagcaccgtg cagtgcaccc acggcatccg ccccgtggtg agcacccagc tgctgctgaa 2700 
cgaccgcctg gccgaggagg aggtggtgat ccgcagcgcc aacttcgccg acaacgccaa 2760 
ggtgatcatc gtgcagctga acgagagcgt ggagatcaac tgcacccgcc ccaacaacaa 2 820 
cacccgcaag agcatccaca tcggccccgg ccgcgccttc tacaccaccg gcgagatcat 2880 
cggcgacatc cgccaggccc actgcaacct gagccgcgcc aagtggaacg acaccctgaa 2 94 0 
caagatcgtg atcaagctgc gcgagcagtt cggcaacaag accatcgtgt tcaagcacag 3000 
cagcggcggc gaccccgaga tcgtgaccca cagcttcaac tgcggcggcg agttcttcta 3060 
ctgcaacagc acccagctgt tcaacagcac ctggttcaac agcacctgga gcaccgaggg 3120 
cagcaacaac accgagggca gcgacaccat caccctgccc tgccgcatca agcagatcat 3180 
caacatgtgg cagaaggtgg gcaaggccat gtacgccccc cccatcagcg gccagatccg 3240 
ctgcagcagc aacatcaccg gcctgctgct gacccgcgac ggcggcaaca gcaacaacga 3300 
gagcgagatc ttccgcctgg gcggcggcga catgcgcgac aactggcgca gcgagctgta 3360 
caagtacaag gtggtgaaga tcgagcccct gggcgtggcc cccaccaagg ccaagcgccg 3420 
cgtggtgcag cgcgagaagc gcgccgtggg catcggcgcc ctgttcctgg gcttcctggg 3480 
cgccgccggc agcaccatgg gcgccgccag catgaccctg accgtgcagg cccgccagct 3540 
gctgagcggc atcgtgcagc agcagaacaa cctgctgcgc gccatcgagg cccagcagca 3600 
cctgctgcag ctgaccgtgt ggggcatcaa gcagctgcag gcccgcatcc tggccgtgga 3660 
gcgctacctg aaggaccagc agctgctggg catctggggc tgcagcggca agcttatctg 3720 
caccaccgcc gtgccctgga acgccagctg gagcaacaag agcctggagc agatctggaa 3780 
ccacaccacc tggatggagt gggaccgcga gatcaacaac tacaccagcc tgatccacag 3840 
cctgatcgag gagagccaga accagcagga gaagaacgag caggagctgc tggagctgga 3900 
caagtgggcc agcctgtgga actggttcaa catcaccaac tggctgtggt acatcaagct 3960 
gttcatcatg atcgtgggcg gcctggtggg cctgcgcatc gtgttcgccg tgctgagcat 4020 
ctgaggatcc agatctgctg tgccttctag ttgccagcca tctgttgttt gcccctcccc 4080 
cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga 4140 
aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga 4200 
cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg tgggctctat 4260 
gggtacccag gtgctgaaga attgacccgg ttcctcctgg gccagaaaga agcaggcaca 4320 
tccccttctc tgtgacacac cctgtccacg cccctggttc ttagttccag ccccactcat 4380 
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aggacactca tagctcagga gggcbccgcc ttcaatccca cccgctaaag tacttggagc 4440 
ggbcbcbccc bcccbcabca gcccaccaaa ccaaacctag cctccaagag tgggaagaaa 4500 
ttaaagcaag ataggctatt aagtgcagag ggagagaaaa bgcctccaac atgtgaggaa 4560 
gtaatgagag aaatcataga abbbbaaggc catcatggcc bbaabcbtcc gcbbccbcgc 4620 
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtabcagcb cactcaaagg 4680 
cggbaabacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 4740 
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgbbgcb ggcgbbbbbc cataggctcc 4800 
gcccccctga cgagcahcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 4860 
gacfcataaag abaccaggcg bbbccccctg gaagctccct cgtgcgctct ccbgbbccga 4920 
cccbgccgcb baccggabac cbgtccgccb bbcbcccbbc gggaagcgbg gcgctbtctc 4980 
abagcbcacg cbgbaggbab cbcagbbcgg bgtaggbcgb bcgctccaag ctgggctgbg 5040 
bgcacgaacc ccccgbbcag cccgaccgcb gcgccbbabc cggbaactab cgbcbtgagb 5100 
ccaacccggb aagacacgac bbabcgccac bggcagcagc cacbggbaac aggabtagca 5160 
gagcgaggta bgtaggcggb gcbacagagb bcbbgaagbg gtggccbaac bacggcbaca 5220 
cbagaagaac agtabbbggt abcbgcgctc bgcbgaagcc agbbaccbbc ggaaaaagag 5280 
bbggtagcbc bbgabccggc aaacaaacca ccgctggbag cggtggbbbb bbbgtbbgca 5340 
agcagcagab bacgcgcaga aaaaaaggab cbcaagaaga bcctbbgatc ttbbctacgg 5400 
ggbcbgacgc bcagbggaac gaaaacbcac gbbaagggat bttggbcatg agabtabcaa 5460 
aaaggatctb caccbagabc cbbbbaaabb aaaaabgaag bbbtaaabca abctaaagba 5520 
batabgagta aacbbggtcb gacagbtacc aatgcbbaab cagbgaggca ccbatcbcag 5580 
cgabctgbcb abtbcgbtca bccabagbbg ccbgacbcgg gggggggggg cgcbgaggbc 5640 
bgccbcgbga agaaggbgtt gcbgacbcab accaggccbg aabcgcccca bcatccagcc 5700 
agaaagbgag ggagccacgg btgabgagag cbbbgttgba ggbggaccag bbggtgabtt 5760 
bgaactbbtg cbbbgccacg gaacggbcbg cgbbgtcggg aagabgcgtg abcbgabccb 5820 
bcaacbcagc aaaagbbcga tbbabbcaac aaagccgccg bcccgbcaag tcagcgbaab 5880 
gcbcbgccag tgbbacaacc aabtaaccaa bbcbgabbag aaaaacbcab cgagcabcaa 5940 
abgaaacbgc aabtbabbca babcaggabt abcaabacca babbbbbgaa aaagccgbbb 6000 
cbgbaabgaa ggagaaaacb caccgaggca gbtccabagg abggcaagab ccbggbatcg 6060 
gbcbgcgabb ccgactcgbc caacabcaab acaacctabb aabbbccccb cgbcaaaaat 6120 
aaggtbabca agbgagaaab caccabgagb gacgacbgaa tccggtgaga atggcaaaag 6180 
cbbabgcabt bcbbbccaga cbbgbbcaac aggccagcca bbacgctcgt cabcaaaabc 6240 
acbcgcatca accaaaccgb babbcabbcg bgabbgcgcc bgagcgagac gaaabacgcg 6300 
abcgctgbba aaaggacaab bacaaacagg aabcgaabgc aaccggcgca ggaacacbgc 6360 
cagcgcatca acaababbbb caccbgaabc aggatabtcb bcbaabaccb ggaatgcbgt 6420 
bbbcccgggg abcgcagbgg bgagbaacca bgcabcabca ggagbacgga baaaabgctt 6480 
gatggbcgga agaggcabaa abtccgbcag ccagbbbagb cbgaccabcb cabcbgtaac 6540 
abcabbggca acgcbacctb bgccabgbbb cagaaacaac bcbggcgcab cgggcbbccc 6600 
abacaabcga bagabbgbcg caccbgabbg cccgacabba bcgcgagccc abbbabaccc 6660 
ababaaabca gcabccabgb bggaatbbaa bcgcggccbc gagcaagacg bbbcccgbtg 6720 
aababggcbc abaacacccc tbgbatbacb gtbbabgtaa gcagacagtt bbatbgbbca 6780 
bgabgababa bbbbbabctb gbgcaabgba acabcagaga bbbbgagaca caacgbggcb 6840 
bbcccccccc ccccabbabb gaagcabbba bcagggbbab . bgbcbcabga gcggabacab 6900 
abbbgaabgt atbbagaaaa abaaacaaab aggggbbccg cgcacabbbc cccgaaaagb 6960 
gccaccbgac gtcbaagaaa ccabbabbat cabgacabta accbabaaaa abaggcgbab 7020 
cacgaggccc btbcgbc ~ 7037 
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tcgcgcgttt cggtgatgac ggtgaaaacc 
cagcbbgtcb gtaagcggab gccgggagca 
bbggcgggfcg bcggggctgg cbbaacbabg 
accatatgcg gtgtgaaata ccgcacagat 
ctattggcca ttgcatacgt tgtatccata 
bccaacabba ccgccatgtt gacattgatt 
ggggbcabba gbbcabagcc catafcabgga 
cccgcctggc tgaccgccca acgacccccg 
cabagtaacg ccaataggga ctttccattg 
bgcccacbbg gcagtacatc aagtgtatca 
tgacggtaaa tggcccgcct ggcattatgc 
ttggcagtac abctacgbab tagtcatcgc 
catcaatggg cgtggatagc ggtttgacbc 
cgtcaatggg agttbgttbb ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agcbcgbbba gtgaaccgtc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
bcbbabgcab gctatactgt bbbbggctbg 
taggtgatgg tatagcttag cctataggtg 
babtggbgac gatactttcc abbactaabc 
tattggctat atgccaatac tctgtccttc 
ggatggggtc ccatttatta tttacaaatt 
cgcagtfcttt attaaacata gcgbgggatc 
cabgggcbcb tcbccggtag cggcggagcb 
agcggcbcab ggtcgcbcgg cagctccbtg 
agcacaabgc ccaccaccac cagbgtgccg 
gaaaabgagc gbggagatbg ggcbcgcacg 
gcagaagaag abgcaggcag cbgagbbgbb 
gbbgcggbgc bgtbaacggt ggagggcagb 
cgcgccacca gacabaabag ctgacagacb 
tgcagtcacc gtcgtcgaca cgtgtgabca 
gbgaaggaga agtaccagca cctgbggcgc 
ggcabgcbga bgabcbgcag cgccaccgag 
cccgbgtgga aggaggccac caccacccbg 
accgaggbgc acaacgbgbg ggccacccac 
gaggbggbgc bggbgaacgb gaccgagaac 
cagabgcacg aggacatcab cagccbgbgg 
accccccbgb gcgbgagccb gaagtgcacc 
agcagcggcc gcabgabcat ggagaagggc 
accagcafccc gcggcaaggb gcagaaggag 
cccabcgaca acgacaccac cagctacagc 
caggccbgcc ccaaggbgag cbbcgagccc 
tbcgccatcc bgaagbgcaa ggacaagaag 
agcaccgtgc agbgcaccca cggcatccgc 
ggbaacctgg ccgaggagga ggbggtgatc 
gbgatcatcg bgcagcbgaa cgagagcgbg 
acccgcaaga gcabccacab cggccccggc 
ggcgacabcc gccaggccca cbgcaaccbg 
aagabcgtga bcaagctgcg cgagcagttc 
agcggcggcg accccgagab cgtgacccac 
bgcaacagca cccagcbgtb caacagcacc 
agcaacaaca ccgagggcag cgacaccatc 



bctgacacat gcagcbcccg gagacggtca 60 
gacaagcccg bcagggcgcg tcagcgggtg 120 
cggcabcaga gcagatbgba cbgagagtgc 180 
gcgtaaggag aaaataccgc abcagabtgg . 24.0 
bcataababg tacatttata tbggcbcabg 300 
atbgactagt babtaatagt aabcaabbac 360 
gbbccgcgbt acabaacbba cggbaaabgg 420 
cccabbgacg bcaabaabga cgbabgbbcc 480 
acgbcaabgg gbggagtatb bacggbaaac 540 
batgccaagt acgcccccta tbgacgbcaa 600 
ccagtacabg acctbabggg actbbccbac 660 
batbaccabg gbgatgcggb bbtggcagba 720 
acggggatbb ccaagbcbcc accccabbga 780 
bcaacgggac bbtccaaaab gbcgbaacaa 840 
gcgbgbacgg bgggaggbcb atabaagcag 900 
gagacgccat ccacgctgtt tbgaccbcca 960 
cggccgggaa cggbgcattg gaacgcggab 1020 
cbabagactc tataggcaca cccctbbggc 1080 
gggcctabac acccccgcbb ccbbabgcba 1140 
bgggbtabbg accattabbg accacbcccc 1200 
cabaacabgg cbcbbtgcca caactatcbc 1260 
agagacbgac acggacbcbg babbbbtaca 1320 
cacabataca acaacgccgb cccccgtgcc 1380 
bccacgcgaa tctcgggtac gtgtbccgga 14'40 
bccacabccg agccctggbc ccabgcctcc 1500 
cbcctaacag bggaggccag acbtaggcac 1560 
cacaaggccg tggcggtagg gbabgtgbcb 162 0 
gcbgacgcag abggaagacb taaggcagcg 1680 
gbabtctgat aagagtcaga ggbaacbccc 1740 
gbagtcbgag cagtactcgt bgctgccgcg 1800 
aacagacbgt bcctbtccab gggtcbbbtc 1860 
gababcgcgg ccgcbctaga caccabgcgc 1920 
bggggctggc gcbggggcac cabgcbgcbg 1980 
aagctgtggg bgaccgbgba ctacggcgbg 2040 
cbcbgcgcca gcgacgccaa ggcctacgac 2100 
gccbgcgbgc ccaccgaccc caacccccag 2160 
bbcgacabgb ggaagaacga cabggbggag 2220 
gaccagagcc tgaagcccbg cgbgaagcbg 22 80 
gaccbgaaga acgacaccaa caccaacagc 2340 
gagabcaaga actgcagcbb caacabcagc 2400 
fcacgccbtct bctacaagcb ggacatcatc 2460 
cbgaccagcb gcaacaccag cgtgabcacc 2520 
abccccaacc acbacbgcgc ccccgccggc 2580 
bbcaacggca agggccccbg caccaacgtg 2640 
cccgbggbga gcacccagct gcbggbbacg 2700 
cgcagcgcta actbcgccga caacgccaag 2760 
gagabcaacb gcacccgccc caacaacaac 2820 
cgcgccbbct acaccaccgg cgagatcabc 2880 
agccgcgcca agtggaacga cacccbgaac 2940 
ggcaacaaga ccabcgtgbb caagcacagc 3000 
agcbbcaacb gcggcggcga . gbbcbbcbac 3060 
bggtbcaaca gcaccbggag caccgagggc 3120 
acccbgcccb gccgcabcaa gcagabcabc 3180 
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aacatgtggc agaaggtggg caaggccabg tacgcccccc ccabcagcgg ccagatccgc 3240 
tgcagcagca acatcaccgg cctgctgctg acccgcgacg gcggcaacag caacaacgag 3300 
agcgagatct tccgcctggg cggcggcgac atgcgcgaca actggcgcag cgagctgtac 3360 
aagtacaagg tggtgaagat cgagcccctg ggcgtggccc ccaccaaggc caagcttacc 3420 
gtccaggccc gccagctgcb gagcggcatc gtgcagcagc agaacaacct gctgcgcgcc 34 80 
atcgaggccc agcagcacct gctgcagctg accgbgbggg gcatcaagca gctgcaggcc 3540 
cgcacccbgg ccgtggagcg ctacctgaag gaccagcagc tgctcgagca gatctggaac 3600 
cacaccacct ggatggagtg ggaccgcgag atcaacaact acaccagcct gatccacagc 3660 
ctgatcgagg agagccagaa ccagcacgag aagaacgagc aggagctgct ggagctggac 3720 
aagbgggcca gcctgtggaa cbggbtcaac atcaccaact ggcbgbggba catcaagctg 3780 
bbcabcabga tcgtgggcgg cctggtgggc cbgcgcabcg bgbbcgccgb gctgagcatc 3840 
bgaggabcca gabcbgctgb gccttctagt tgccagccat cbgbtgbbbg cccctccccc 3 900 
gtgccttcct bgacccbgga aggtgccact cccactgtcc bbbccbaaba aaatgaggaa 3960 
abbgcabcgc abbgbcbgag taggtgtcat tctattctgg ggggtggggt ggggcaggac 4020 
agcaaggggg aggattggga agacaatagc aggcabgcbg gggabgcggb gggctctatg 4080 
ggtacccagg tgctgaagaa bbgacccggb tccbccbggg ccagaaagaa gcaggcacat 4140 
ccccbbcbcb gtgacacacc ctgtccacgc ccctggttct tagttccagc cccactcata 4200 
ggacactcat agctcaggag ggctccgcct tcaatcccac ccgctaaagt acttggagcg 4260 
gtctctccct ccctcatcag cccaccaaac caaacctagc ctccaagagt gggaagaaat 4320 
taaagcaaga taggctatta agtgcagagg gagagaaaat gccbccaaca bgbgaggaag 4380 
taatgagaga aatcatagaa ttttaaggcc atcatggcct taatcttccg cttcctcgct 4440 
cactgactcg ctgcgctcgg tcgbtcggct gcggcgagcg gtatcagctc actcaaaggc 4500 
ggtaatacgg bbabccacag aatcagggga fcaacgcagga aagaacatgt gagcaaaagg 4560 
ccagcaaaag gccaggaacc gtaaaaaggc cgcgbbgcbg gcgttfcttcc ataggcbccg 4620 
cccccctgac gagcabcaca aaaabcgacg cbcaagtcag aggbggcgaa acccgacagg 4680 
actabaaaga baccaggcgb tbcccccbgg aagcbcccbc gbgcgcbcbc ctgbbccgac 4740 
ccbgccgcbb accggabacc bgbccgccbb bcbcccbtcg ggaagcgbgg cgcbbbcbca 4800 
tagcbcacgc bgbaggbabc tcagbbcggb gbaggbcgbt cgcbccaagc bgggcbgbgb 4860 
gcacgaaccc cccgbbcagc ccgaccgcbg cgccbbabcc ggbaacbabc gtcbbgagtc 4920 
caacccggba agacacgacb tabcgccacb ggcagcagcc acbggbaaca ggabtagcag 4980 
agcgaggtab gbaggcggbg cbacagagbb cbbgaagbgg bggccbaacb acggcbacac 5040 
tagaagaaca gbabbbggba bcbgcgctcb gcbgaagcca gbbaccbbcg gaaaaagagb 5100 
bggfcagctcb bgabccggca aacaaaccac cgctggbagc ggbggbbbbb btgbbbgcaa 5160 
gcagcagabb acgcgcagaa aaaaaggabc bcaagaagab cctbbgabcb tttctacggg 5220 
gbcbgacgcb cagbggaacg aaaacbcacg bbaagggatt bbggbcabga gabbabcaaa 5280 
aaggatctbc accbagatcc tbbbaaabba aaaabgaagt bbbaaabcaa bcbaaagbab 5340 
ababgagtaa acbbggbcbg acagbbacca abgcbbaabc agtgaggcac ctabcbcagc 5400 
gabcbgbcba bbbcgbbcab ccabagbtgc cbgacbcggg gggggggggc gcbgaggtcb 54 60 
gccbcgbgaa gaaggbgbbg cbgactcaba ccaggccbga abcgccccab cabccagcca 552 0 
gaaagbgagg gagccacggb tgabgagagc bbbgbbgbag gbggaccagt bggtgabbtb 5580 
gaacbbbtgc bbbgccacgg aacggbctgc gbbgtcggga agabgcgbga bcbgabcctb 564 0 
caacbcagca aaagbbcgab bbabbcaaca aagccgccgb cccgbcaagb cagcgbaatg 5700 
cbcbgccagb gbbacaacca abbaaccaab bcbgabbaga aaaacbcabc gagcabcaaa 5760 
bgaaacbgca abbbabbcab abcaggatba bcaabaccab abbbbbgaaa aagccgbbbc 5820 
bgbaabgaag gagaaaacbc accgaggcag btccabagga bggcaagabc ctggbabcgg 5880 
bcbgcgabbc cgacbcgbcc aacabcaaba caaccbabba abbbccccbc gtcaaaaata 5940 
aggbbabcaa gbgagaaabc accabgagbg acgacbgaab ccggbgagaa bggcaaaagc 6000 
bbabgcabbb cbbtccagac bbgbbcaaca ggccagccab bacgcbcgbc atcaaaabca 6060 
cbcgcabcaa ccaaaccgbb abbcabbcgb gabbgcgccb gagcgagacg aaabacgcga 6120 
bcgcbgbtaa aaggacaabb acaaacagga abcgaabgca accggcgcag gaacactgcc 6180 
agcgcabcaa caaCabbbtc accbgaatca ggatatbcbb cbaabaccbg gaabgctgtb 6240 
btcccgggga bcgcagtggb gagbaaccab gcabcabcag gagtacggab aaaatgcbtg 6300 
abggbcggaa gaggcabaaa tbccgbcagc cagbbbagbc bgaccabcbc abcbgtaaca 6360 
bcabbggcaa cgctaccbbb gccabgbbbc agaaacaacb cbggcgcabc gggcbbccca 6420 
bacaabcgab agabbgbcgc accbgabtgc ccgacabbab cgcgagccca bbbabaccca 6480 
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tataaatcag catccatgtt ggaatttaat 
atatggctca taacacccct tgbattacbg 
gatgatatat ttttatcttg tgcaatgtaa 
tccccccccc cccabtattg aagcatttat 
tbbgaatgba tttagaaaaa taaacaaata 
ccacctgacg tctaagaaac cattatfcatc 
acgaggccct ttcgtc 



cgcggcctcg agcaagacgt bbcccgttga 6540 
tttatgtaag cagacagbtt tattgfctcat 6600 
catcagagab btbgagacac aacgtggctt 6660 
cagggbtabt gbctcatgag cggatacaba 6720 
999gttccgc gcacatbbcc ccgaaaagbg 6780 
atgacatbaa ccbataaaaa baggcgbatc 6840 

6856 



<210> 15 
<211> 6505 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 15 

bcgcgcgbbb cggbgatgac ggtgaaaacc bcbgacacab gcagctcccg gagacggbca 60 
cagcttgbct gtaagcggat gccgggagca gacaagcccg bcagggcgcg tcagcgggtg 120 
ttggcgggtg bcggggctgg cbtaactatg cggcatcaga gcagatbgta. ctgagagbgc 180 
accatatgcg gtgtgaaata ccgcacagab gcgbaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcabacgt tgbabccata tcataatatg tacabbbaba bbggctcabg 300 
tccaacatta ccgccatgbt gacattgabt abbgactagt tattaatagt aatcaattac 360 
ggggtcatta gbbcabagcc catabatgga gttccgcgtt acataactba cggtaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtabgbtcc 480 
cabagbaacg ccaataggga cttbccatbg acgtcaatgg gtggagtatt tacggbaaac 540 
tgcccacttg gcagtacabc aagbgtabca tabgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgccb ggcatbatgc ccagtacatg accttatggg acbbbcctac 660 
ttggcagtac atcbacgtat bagtcatcgc tabtaccatg gtgatgcggt tttggcagba 720 
catcaatggg cgbggatagc ggttbgactc acggggabbt ccaagtctcc accccabbga 780 
cgtcaatggg agtbtgtttt ggcaccaaaa bcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca btgacgcaaa tgggcggtag gcgtgtacgg tgggaggtcb atabaagcag 900 
agctcgttba gtgaaccgbc agatcgcctg gagacgccat ccacgctgtt ttgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg baagtaccgc cbatagactc bataggcaca ccccttbggc 1080 
bctbatgcab gctatactgt ttbbggcttg gggcctatac acccccgcbb ccbtatgcba 1140 
baggtgatgg tatagcbtag ccbabaggtg bgggtbattg accattattg accactcccc 12 00 
tatbggtgac gatactttcc attactaatc cataacatgg cbctttgcca caactabcbc 1260 
tattggctat atgccaatac tctgbccttc agagactgac acggacbcbg tattttbaca 1320 
ggatggggtc ccatttatta ttbacaaatt cacatataca acaacgccgt cccccgbgcc 13 80 
cgcagtttbt attaaacata gcgbgggatc tccacgcgaa bctcgggtac gtgttccgga 1440 
catgggctcb bctccggbag cggcggagcb bccacatccg agcccbggbc ccatgcctcc 1500 
agcggctcat ggtcgcbcgg cagcbccbtg ctcctaacag bggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagbgtgccg cacaaggccg bggcggtagg gtatgtgtct 1620 
gaaaatgagc gbggagabtg ggcbcgcacg gctgacgcag atggaagact baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgtb gtattctgab aagagtcaga ggbaacbccc 1740 
gtbgcggtgc tgbbaacggt ggagggcagt gtagtctgag cagbacbcgt bgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagact aacagacbgb tccbbbccat gggbcttbtc 1860 
bgcagbcacc gbcgtcgaca cgbgtgabca gababcgcgg ccgcbcbaga caccabgcgc 1920 
gbgaaggaga agtaccagca cctgtggcgc bggggctggc gcbggggcac catgcbgcbg 1980 
ggcabgcbga bgabcbgcag cgccaccgag gcbagcagcc tgaagbgcac cgaccbgaag 2040 
aacgacacca acaccaacag cagcagcggc cgcabgabca tggagaaggg cgagabcaag 2100 
aacbgcagcb tcaacabcag caccagcatc cgcggcaagg bgcagaagga gtacgccbtc 2160 
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ttctacaagc tggacatcat ccccatcgac aacgacacca ccagctacag cctgaccagc 2220 
tgcaacacca gcgtgatcac ccaggcctgc cccaaggtga gcttcgagcc catccccaac 2280 
cactactgcg cccccgccgg cttcgccatc ctgaagtgca aggacaagaa gttcaacggc 2340 
aagggcccct gcaccaacgt gagcaccgtg cagtgcaccc acggcatccg ccccgtggtg 24 00 
agcacccagc tgctgctgaa cggtagcctg gccgaggagg aggtggtgat ccgcagcgct 2460 
aacttcgccg acaacgccaa ggtgatcatc gtgcagctga acgagagcgt ggagatcaac 2520 
tgcacccgcc ccaacaacaa cacccgcaag agcatccaca tcggccccgg ccgcgccttc 2580 
tacaccaccg gcgagatcat cggcgacatc cgccaggccc actgcaacct gagccgcgcc 264 0 
aagtggaacg acaccctgaa caagatcgtg atcaagctgc gcgagcagtt cggcaacaag 2700 
accatcgtgt tcaagcacag cagcggcggc gaccccgaga tcgtgaccca cagcttcaac 2760 
tgcggcggcg agttcttcta ctgcaacagc acccagctgt tcaacagcac ctggttcaac 2820 
agcacctgga gcaccgaggg cagcaacaac accgagggca gcgacaccat caccctgccc 2880 
tgccgcatca agcagatcat caacatgtgg cagaaggtgg gcaaggccat gtacgccccc 2940 
cccatcagcg gccagatccg ctgcagcagc aacatcaccg gcctgctgct gacccgcgac 3000 
ggcggcaaca gcaacaacga gagcgagatc ttccgtccgg gcggcggcga catgcgcgac 3060 
aactggcgca gcgagctgta caagtacaag gtggtgaaga tcgagcccct gggcgtggcc 3120 
cccaccaagg ccaagcttac cgtccaggcc cgccagctgc tgagcggcat cgtgcagcag 3180 
cagaacaacc tgctgcgcgc catcgaggcc cagcagcacc tgctgcagct gaccgtgtgg 324 0 
ggcatcaagc agctgcaggc ccgcaccctg gccgtggagc gctacctgaa ggaccagcag 3300 
cagctcgagc agatctggaa ccacaccacc tggatggagt gggaccgcga gatcaacaac 3360 
tacaccagcc tgatccacag cctgatcgag gagagccaga accagcagga gaagaacgag 3420 
caggagctgc tggagctgga caagtgggcc agcctgtgga actggttcaa catcaccaac 3480 
tggctgtggt gaggatccag atctgctgtg ccttctagtt gccagccatc tgttgtttgc 3540 
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 3600 
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 3660 
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggatgcggtg 3720 
ggctctatgg gtacccaggt gctgaagaat tgacccggtt cctcctgggc cagaaagaag 3780 
caggcacatc cccttctctg tgacacaccc tgtccacgcc cctggttctt agttccagcc 3840 
ccactcatag gacactcata gctcaggagg gctccgcctt caatcccacc cgctaaagta 3900 
cttggagcgg tctctccctc cctcatcagc ccaccaaacc aaacctagcc tccaagagtg 3960 
ggaagaaatt aaagcaagat aggctattaa gtgcagaggg agagaaaatg cctccaacat 4020 
gtgaggaagt aatgagagaa atcatagaat tttaaggcca tcatggcctt aatcttccgc 40 80 
ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca 4140 
ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 4200 
agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca 4260 
taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 4320 
cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc 43 80 
tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc 4440 
gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct 4500 
gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg 4560 
tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag 4620 
gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 4680 
cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg 4740 
aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt 4800 
tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 4860 
ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag 4920 
attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat 4980 
ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 5040 
tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactcgggg ggggggggcg 5100 
ctgaggtctg cctcgtgaag aaggtgttgc tgactcatac caggcctgaa tcgccccatc 5160 
atccagccag aaagtgaggg agccacggtt gatgagagct ttgttgtagg tggaccagtt 5220 
ggtgattttg aacttttgct ttgccacgga acggtctgcg ttgtcgggaa gatgcgtgat 5280 
ctgatccttc aactcagcaa aagttcgatt tattcaacaa agccgccgtc ccgtcaagtc 5340 
agcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 54 00 
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 5460 
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agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 5520 

tggtatcggt ctgcgattcc gactcgtcca acatcaatac aaccfcattaa tttcccctcg 5580 

tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaabc cggtgagaat 5640 

ggcaaaagct tatgcatttc bttccagact tgttcaacag gccagccatt acgctcgtca 5700 

tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 5760 

aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcgaatgcaa ccggcgcagg 5820 

aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taabacctgg 5880 

aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 5940 

aaatgcttga tggtcggaag aggcabaaat tccgtcagcc agtttagtcb gaccabctca 6000 

bctgbaacab cabbggcaac gctaccttbg ccatgtttca gaaacaactc tggcgcatcg 6060 

ggcbbcccab acaabcgaba gatbgbcgca cctgattgcc cgacattabc gcgagcccat 6120 

btatacccat ataaatcagc abccabgttg gaatttaatc gcggcctcga gcaagacgbt 6180 

tcccgttgaa tatggctcafc aacacccctt gtattactgt ttatgtaagc agacagtbbb 6240 

abtgttcatg atgababatt tttatcttgt gcaatgbaac atcagagatb tbgagacaca 6300 

acgtggcttt cccccccccc ccabbattga agcabtbabc agggbtabtg tctcatgagc 63 60 

ggatacatat bbgaabgbab btagaaaaat aaacaaatag gggttccgcg cacatbtccc 6420 

cgaaaagtgc cacctgacgb cbaagaaacc attattatca bgacatbaac ctabaaaaat 6480 

aggcgtatca cgaggccctb bcgtc 6505 



<210> 16 
<211> 6505 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 16 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggbg tcggggctgg cttaactatg cggcatcaga gcagatbgta ctgagagtgc 180 
accatabgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagatbgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata btggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcabta gttcatagcc cabatatgga gbtccgcgtt acataactta cggbaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gbggagtatb tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
bgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tbbccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttabgcta 1140 
taggtg'atgg tatagcttag cctabaggbg tgggttattg accattatbg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 13 80 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgtbccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggbc ccatgcctcc 1500 
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agcggcbcab ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 15 60 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gbabgbgbcb 1620 
gaaaabgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbtgbb gbabbctgab aagagtcaga ggtaactccc 1740 
gttgcggtgc bgbbaacggb ggagggcagt gbagbctgag cagtacbcgt bgcbgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggbcbbbtc 1860 
tgcagtcacc gtcgtcgaca cgtgbgabca gababcgcgg ccgctctaga caccatgcgc 1920 
gfcgaaggaga agtaccagca cctgtggcgc tggggcbggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagcbgbggg bgaccgbgba ctacggcgtg 2040 
cccgtgbgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gccbgcgbgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacabgt ggaagaacga cabggtggag 2220 
cagatgcacg aggacabcab cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga acbgcagcbt caacatcagc 24 00 
accagcatcc gcggcaaggt gcagaaggag tacgcctbct tctacaagct ggacatcatc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccgc tagcatcaac 2520 
tgcacccgcc ccaacaacaa cacccgcaag agcatccaca tcggccccgg ccgcgccttc 2580 
tacaccaccg gcgagatcat cggcgacatc cgccaggccc actgcaacct gagccgcgcc 264 0 
aagtggaacg acaccctgaa caagabcgbg atcaagctgc gcgagcagtt cggcaacaag 2700 
accatcgtgt tcaagcacag cagcggcggc gaccccgaga tcgtgaccca cagcttcaac 2 760 
tgcggcggcg agttcttcta ctgcaacagc acccagctgt tcaacagcac ctggttcaac 2820 
agcacctgga gcaccgaggg cagcaacaac accgagggca gcgacaccat caccctgccc 2880 
tgccgcatca agcagatcat caacatgtgg cagaaggtgg gcaaggccat gtacgccccc 2940 
cccatcagcg gccagatccg cbgcagcagc aacatcaccg gcctgctgct gacccgcgac 3 000 
ggcggcaaca gcaacaacga gagcgagatc ttccgtccgg gcggcggcga cabgcgcgac 3060 
aacbggcgca gcgagctgta caagtacaag gbggbgaaga bcgagcccct gggcgbggcc 3120 
cccaccaagg ccaagcbbac cgbccaggcc cgccagcbgc bgagcggcab cgbgcagcag 3180 
cagaacaacc bgcbgcgcgc cabcgaggcc cagcagcacc tgcbgcagcb gaccgbgtgg 324 0 
ggcabcaagc agcbgcaggc ccgcacccbg gccgbggagc gcbaccbgaa ggaccagcag 3300 
cagctcgagc agabctggaa ccacaccacc bggabggagb gggaccgcga gabcaacaac 3360 
bacaccagcc bgatccacag ccbgatcgag gagagccaga accagcagga gaagaacgag 3420 
caggagcbgc bggagcbgga caagbgggcc agccbgbgga acbggbtcaa catcaccaac 34 80 
bggctgbggb gaggabccag atcbgcbgbg ccbbcbagbb gccagccabc bgtbgtbtgc 3540 
cccbcccccg bgccbbccbb gacccbggaa ggbgccacbc ccacbgbccb bbccbaataa 3600 
aatgaggaaa btgcabcgca btgbcbgagb aggbgbcabb cbabbcbggg gggbggggbg 3660 
gggcaggaca gcaaggggga ggabbgggaa gacaabagca ggcabgctgg ggabgcggbg 3720 
ggcbcbatgg gbacccaggb gcbgaagaab bgacccggtb ccbccbgggc cagaaagaag 3780 
caggcacabc cccbbcbctg bgacacaccc bgbccacgcc ccbggtbcbb agbbccagcc 384.0 
ccacbcabag gacacbcata gcbcaggagg gcbccgcctb caabcccacc cgcbaaagba 3900 
cbbggagcgg bcbcbcccbc ccbcabcagc ccaccaaacc aaaccbagcc tccaagagbg 3960 
ggaagaaabt aaagcaagab aggcbatbaa gbgcagaggg agagaaaabg ccbccaacab 4020 
gbgaggaagb aabgagagaa abcabagaab bbbaaggcca bcabggccbb aabcbbccgc 4080 
bbccbcgcbc acbgacbcgc bgcgctcggb cgbtcggcbg cggcgagcgg tabcagcbca 4140 
cbcaaaggcg gbaabacggb batccacaga abcaggggab aacgcaggaa agaacabgbg 4200 
agcaaaaggc cagcaaaagg ccaggaaccg baaaaaggcc gcgbbgcbgg cgbbbbbcca 4260 
baggcbccgc cccccbgacg agcabcacaa aaabcgacgc bcaagbcaga ggbggcgaaa 4320 
cccgacagga ctabaaagab accaggcgbb bcccccbgga agcbcccbcg bgcgcbctcc 4380 
bgbbccgacc ctgccgcbha ccggabacct gbccgccbtb cbcccbbcgg gaagcgbggc 444 0 
gcbbtcbcab agcbcacgcb gbaggbabct cagbbcggbg taggbcgbbc gcbccaagcb 4500 
gggcbgbgbg cacgaacccc ccgbbcagcc cgaccgcbgc gccbbabccg gbaacbabcg 4560 
bcbbgagbcc aacccggbaa gacacgacbb atcgccacbg gcagcagcca cbggbaacag 4620 
gabbagcaga gcgaggtabg baggcggbgc bacagagbbc bbgaagbggt ggccbaacta 4680 
cggcbacacb agaagaacag batbbggbab cbgcgcbcbg cbgaagccag bbacctbcgg 4740 
aaaaagagbb ggbagctcbb gatccggcaa acaaaccacc gcbggbagcg gbggbbbttb 4 800 
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bgbbbgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc cbbbgabctb 4860 
ttctacgggg bctgacgcbc agtggaacga aaacbcacgt taagggattt bggbcabgag 4920 
abtabcaaaa aggatcttca cctagatcct tttaaattaa aaabgaagtt ttaaatcaat 4980 
cbaaagtaba batgagbaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 5040 
tatctcagcg abcbgbcbat ttcgttcatc catagttgcc tgactcgggg ggggggggcg 5100 
cbgaggbcbg cctcgtgaag aaggtgbbgc tgactcatac caggcctgaa bcgccccatc 5160 
ahccagccag aaagtgaggg agccacggtt gatgagagct ttgttgtagg tggaccagtt 5220 
ggbgabbbbg aacbbbbgct ttgccacgga acggtctgcg tbgbcgggaa gabgcgbgab 52B0 
cbgabccbbc aactcagcaa aagttcgatt babbcaacaa agccgccgtc ccgtcaagtc 5340 
agcgtaatgc bcbgccagbg ttacaaccaa ttaaccaatt cbgabbagaa aaactcatcg 5400 
agcatcaaat gaaactgcaa tttabtcata tcaggattat caabaccaba tttttgaaaa 5460 
agccgbbbcb gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 5520 
bggbatcggb ctgcgattcc gactcgtcca acatcaatac aaccbabbaa tbtcccctcg 5580 
bcaaaaataa ggbbabcaag tgagaaatca ccatgagtga cgacbgaatc cggtgagaat 564 0 
ggcaaaagct babgcabbbc tttccagact tgbbcaacag gccagccatt acgcbcgbca 5700 
bcaaaabcac tcgcatcaac caaaccgtta bbcatbcgbg abtgcgcctg agcgagacga 5760 
aabacgcgab cgcbgbbaaa aggacaabba caaacaggaa bcgaabgcaa ccggcgcagg 5820 
aacacbgcca gcgcabcaac aabatbbtca ccbgaabcag gababbcbtc baabaccbgg 5880 
aabgcbgbbb bcccggggab cgcagbggbg agtaaccabg cabcabcagg agbacggaba 5940 
aaabgcbtga bggbcggaag aggcabaaat tccgbcagcc agttbagbct gaccabcbca 6000 
tcbgtaacab cattggcaac gcbacctbtg ccabgtbbca gaaacaactc bggcgcatcg 6060 
ggcbbcccat acaabcgaba gabbgbcgca cctgatbgcc cgacabbabc gcgagcccab 6120 
tbatacccab abaaabcagc atccabgbbg gaabtbaabc gcggcctcga gcaagacgbb 6180 
bcccgbbgaa batggcbcab aacaccccbb gbabbacbgt bbatgbaagc agacagbbfcb 6240 
abtgbbcabg abgabababb tbbabcbbgt gcaatgbaac atcagagabb btgagacaca 6300 
acgtggcbbt cccccccccc ccatbatbga agcabbtatc agggbbabbg bctcabgagc 6360 
ggatacabat bbgaabgtab bbagaaaaab aaacaaatag gggbbccgcg cacatttccc 6420 
cgaaaagbgc caccbgacgt cbaagaaacc abtabbatca bgacattaac cbataaaaab 6480 
aggcgbabca cgaggcccbb bcgbc 6505 



<210> 17 
<211> 6646 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 17 

tcgcgcgtbb cggbgabgac ggtgaaaacc bctgacacab gcagcbcccg gagacggbca 60 
cagcbbgbct gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
tbggcgggbg bcggggcbgg cbbaacbabg cggcatcaga gcagabtgba cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabbggcca bbgcabacgb tgbabccaba bcabaababg bacabbtata bbggcbcatg 300 
tccaacabba ccgccabgbb gacabbgabt abtgacbagb babbaatagb aabcaabbac 360 
ggggbcabta gtbcabagcc cababatgga gttccgcgbb acabaactba cggbaaabgg 420 
cccgcctggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbatgbbcc 4 80 
catagtaacg ccaabaggga cbbtccabbg acgbcaatgg gtggagtatb bacggbaaac 5.40 
bgcccacbbg gcagbacabc aagbgbabca batgccaagb acgcccccba bbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbtatggg acbbtccbac 660 
tbggcagbac abcbacgbab bagbcabcgc batbaccabg gbgabgcggb bbtggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggatbb ccaagtcbcc accccatbga 780 
cgbcaatggg agbbbgbtbb ggcaccaaaa bcaacgggac bttccaaaab gbcgtaacaa 840 
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ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agcbcgbbta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctabagactc babaggcaca cccctttggc 1080 
bctbabgcat gctatactgt bbbbggcbbg gggccbabac acccccgctt ccttatgcta 1140 
baggbgabgg tatagcttag ccbataggbg tgggttattg accattattg accactcccc 1200 
tattggtgac gabacbbbcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat abgccaatac bcbgtccbbc agagacbgac acggacbcbg babbbbbaca 1320 
ggabggggbc ccabtbabba ttbacaaabb cacabataca acaacgccgb cccccgbgcc 1380 
cgcagtbtbb abbaaacata gcgtgggabc tccacgcgaa bctcgggbac gbgbtccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb tccacabccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbbg ctccbaacag bggaggccag actbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggtagg gbabgbgbcb 1620 
gaaaabgagc gtggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgtb gtabbcbgab aagagtcaga ggbaacbccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagt gtagbcbgag cagbactcgb tgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagacbgb bccbbtccab gggbctbtbc 1860 
bgcagbcacc gbcgbcgaca cgbgbgatca gababcgcgg ccgcbcbaga caccabgcgc 1920 
gbgaaggaga agtaccagca ccbgbggcgc bggggcbggc gcbggggcac cabgcbgcbg 1980 
ggcatgcbga bgatcbgcag cgccaccgag aagcbgbggg bgaccgbgba cbacggcgbg 2040 
cccgtgbgga aggaggccac caccacccbg ctcbgcgcca gcgacgccaa ggcctacgac 2100 
accgaggbgc acaacgbgbg ggccacccac gccbgcgbgc ccaccgaccc caacccccag 2160 
gaggbggtgc bggtgaacgt gaccgagaac bbcgacabgb ggaagaacga cabggbggag 2220 
cagabgcacg aggacabcab cagccbgtgg gaccagagcc bgaagcccbg cgbgaagctg 2280 
accccccbgb gcgbgagccb gaagbgcacc gaccbgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgabcab ggagaagggc gagabcaaga acbgcagcbb caacabcagc 2400 
accagcatcc gcggcaaggt gcagaaggag bacgcctbct bcbacaagcb ggacabcatc 24 60 
cccabcgaca acgacaccac cagcbacagc ctgaccagct gcaacaccag cgbgatcacc 2520 
caggccbgcc ccaaggbgag cbbcgagccc atccccaacc acbactgcgc ccccgccggc 25 8 0 
bbcgccabcc bgaagbgcaa ggacaagaag bbcaacggca agggccccbg caccaacgbg 2640 
agcaccgbgc agtgcaccca cggcabccgc cccgbggbga gcacccagcb gcbgcbgaac 2700 
ggbagccbgg ccgaggagga ggtggbgabc cgcagcgcta acbbcgccga caacgccaag 2760 
gbgabcabcg bgcagcbgaa cgagagcgtg gagabcaact gcacccgccc caacaacaac 2820 
acccgcaaga gcabccacab cggccccggc cgcgccbbcb acaccaccgg cgagabcabc 2880 
ggcgacabcc gccaggccca cbgcaaccbg gcbagcbbct acbgcaacag cacccagcbg 2940 
bbcaacagca ccbggbbcaa cagcaccbgg agcaccgagg gcagcaacaa caccgagggc 3000. 
agcgacacca bcaccctgcc cbgccgcabc aagcagabca bcaacabgbg gcagaaggbg 3060 
ggcaaggcca bgbacgcccc ccccabcagc ggccagabcc gctgcagcag caacabcacc 3120 
ggccbgcbgc bgacccgcga cggcggcaac agcaacaacg agagcgagab cbbccgbccg 3180 
ggcggcggcg acabgcgcga caacbggcgc agcgagcbgt acaagbacaa ggbggbgaag 3240 
abcgagcccc bgggcgtggc ccccaccaag gccaagcbba ccgbccaggc ccgccagcbg 3300 
cbgagcggca bcgbgcagca gcagaacaac cbgcbgcgcg ccabcgaggc ccagcagcac 3360 
cbgcbgcagc bgaccgbgbg gggcabcaag cagctgcagg cccgcacccb ggccgbggag 3420 
cgctaccbga aggaccagca gcagctcgag cagatcbgga accacaccac ctggabggag 3480 
bgggaccgcg agabcaacaa cbacaccagc ctgatccaca gccbgabcga ggagagccag 3540 
aaccagcagg agaagaacga gcaggagctg cbggagcbgg acaagbgggc cagccbgbgg 3600 
aacbggbbca acabcaccaa cbggctgbgg bgaggabcca gabcbgcbgb gccbbcbagb 3660 
bgccagccat cbgbbgbbbg ccccbccccc gbgcctbcct tgacccbgga aggbgccacb 3720 
cccacbgbcc bbbccbaaba aaabgaggaa abtgcatcgc attgbcbgag baggbgbcab 3780 
bcbabtcbgg ggggtggggt ggggcaggac agcaaggggg aggabbggga agacaatagc 3840 
aggcabgcbg gggabgcggt gggcbcbabg ggbacccagg bgcbgaagaa btgacccggb 3900 
bccbccbggg ccagaaagaa gcaggcacab cccctbcbct gbgacacacc cbgbccacgc 3960 
cccbggbbcb bagbbccagc cccacbcaba ggacacbcab agcbcaggag ggcbccgccb 4020 
bcaabcccac ccgcbaaagb acbbggagcg gbcbctccct cccbcabcag cccaccaaac 4080 
caaaccbagc cbccaagagt gggaagaaab baaagcaaga baggcbabba agbgcagagg 4140 
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gagagaaaat gcctccaaca bgbgaggaag taatgagaga aatcatagaa ttttaaggcc 42 00 
abcabggcct taabcbtccg cttcctcgct cacbgacbcg cbgcgctcgg tcgttcggct 4260 
gcggcgagcg gbabcagcbc actcaaaggc ggtaatacgg ttatccacag aabcagggga 4320 
taacgcagga aagaacabgb gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 4380 
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 4440 
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt bbcccccbgg 4500 
aagcbcccbc gtgcgctctc ctgttccgac cctgccgcbt accggatacc bgbccgccbb 4560 
tctcccttcg ggaagcgtgg cgcbtbcbca tagctcacgc tgtaggtatc tcagttcggt 4620 
gbaggbcgbb cgcbccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 4680 
cgcctbatcc ggtaactatc gtcttgaghc caacccggta agacacgact babcgccacb 4740 
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 4 800 
cttgaagtgg bggccbaacb acggctacac tagaagaaca gbabbtggba tctgcgctct 4860 
gctgaagcca gttaccttcg gaaaaagagt tggtagctcb bgabccggca aacaaaccac 4920 
cgctggtagc ggtggttttt ttgtttgcaa gcagcagabb acgcgcagaa aaaaaggatc 49B0 
tcaagaagat ccbbbgabcb tttctacggg gtctgacgct cagtggaacg aaaactcacg 5040 
ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 5100 
aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 5160 
atgcttaatc agbgaggcac ctatctcagc gatctgtcta tbbcgbbcab ccatagttgc 5220 
ctgacbcggg gggggggggc gcbgaggtct gccbcgbgaa gaaggbgbbg cbgacbcaba 5280 
ccaggcctga atcgccccab cabccagcca gaaagbgagg gagccacggb bgabgagagc 5340 
tbtgbbgtag gbggaccagb bggbgattbb gaacbbbbgc btbgccacgg aacggtctgc 5400 
gbtgtcggga agabgcgbga tctgabcctb caacbcagca aaagbtcgab ttattcaaca 5460 
aagccgccgb cccgtcaagt cagcgbaabg cbctgccagb gbtacaacca attaaccaab 5520 
tctgabbaga aaaactcabc gagcabcaaa bgaaacbgca abtbabtcat abcaggabba 5580 
bcaabaccab afctbtbgaaa aagccgttbc bgbaabgaag gagaaaacbc accgaggcag 5640 
bbccabagga tggcaagabc cbggbatcgg bcbgcgabtc cgacbcgbcc aacabcaaba 5700 
caaccbabba abbbccccbc gbcaaaaaba aggtbabcaa gbgagaaatc accabgagbg 5760 
acgacbgaab ccggbgagaa bggcaaaagc bbatgcabtb ctbbccagac bbgbbcaaca 5820 
ggccagccat bacgcbcgbc abcaaaabca cbcgcatcaa ccaaaccgbt abbcabtcgb 5880 
gabbgcgccb gagcgagacg aaatacgcga tcgcbgbbaa aaggacaabb acaaacagga 5940 
abcgaabgca accggcgcag gaacacbgcc agcgcatcaa caababbbbc accbgaabca 6000 
ggabattcbb cbaabaccbg gaabgcbgbb bbcccgggga bcgcagbggb gagbaaccab 6060 
gcabcabcag gagbacggab aaaabgcbbg abggbcggaa gaggcataaa tbccgbcagc 6120 
cagbbbagbc bgaccabcbc abcbgtaaca bcattggcaa cgctaccbbt gccatgbbtc 6180 
agaaacaacb cbggcgcabc gggcbtccca bacaabcgat agatbgtcgc accbgabtgc 6240 
ccgacabbat cgcgagccca bbbabaccca babaaabcag catccabgbb ggaabbbaab 6300 
cgcggccbcg agcaagacgb bbcccgtbga abatggcbca baacaccccb tgbabbacbg 6360 
bbbatgbaag cagacagbbb babbgbbcab gabgababab bbtbabcbtg tgcaabgtaa 6420 
cabcagagat bbbgagacac aacgtggcbb bccccccccc cccabbabbg aagcabbtab 6480 
cagggbbabt gbcbcatgag cggatacaba bbbgaatgba tbtagaaaaa taaacaaaba 6540 
ggggtbccgc gcacabbbcc ccgaaaagbg ccaccbgacg bctaagaaac catbatbabc 6600 
abgacabbaa ccbabaaaaa taggcgtabc acgaggcccb bbcgbc 6646 



<210> 18 
<211> 6670 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVTU012x/s conbaining HIV genes 
<400> 18 

bcgcgcgbbb cggtgabgac ggtgaaaacc bcbgacacat gcagcbcccg gagacggtca 60 
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cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgbaaggag aaaataccgc atcagattgg 240 
cbatbggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaatbac 360 
ggggtcabba gttcatagcc cabatabgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgbcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggbag gcgtgbacgg tgggaggbct abataagcag 900 
agcbcgtbta gbgaaccgtc agatcgccbg gagacgccab ccacgctgbt ttgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggab 102 0 
tccccgtgcc aagagtgacg taagtaccgc chatagactc tataggcaca ccccbttggc 1080 
tcbtatgcat gctabactgt ttttggcttg gggcctabac acccccgctt ccttatgcta 1140 
ta 99tgabgg batagcbbag cctataggtg bgggttattg accabtattg accactcccc 1200 
tabtggbgac gabactbbcc abbacbaabc cataacatgg ctctttgcca caactatctc 1260 
battggctat atgccaabac bcbgtcctbc agagacbgac acggacbctg babbtbbaca 1320 
ggatggggtc ccabttatta bbbacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagbttbt afcbaaacaba gcgbgggabc tccacgcgaa tchcgggbac gbgbtccgga 1440 
cabgggctct bcbccggtag cggcggagct bccacatccg agccctggtc ccatgccbcc 1500 
agcggctcat ggbcgcbcgg cagcbccttg cbcctaacag bggaggccag acbtaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg bggcggtagg gbabgtgtcb 1620 
gaaaatgagc gbggagabbg ggcbcgcacg gcbgacgcag atggaagact baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagttgtt gbattcbgab aagagbcaga ggbaacbccc 1740 
gttgcggtgc bgbtaacggt ggagggcagb gbagbcbgag cagtacbcgt bgctgccgcg 1800 
cgcgccacca gacataatag ctgacagacb aacagacbgb bcctttccat gggtcbtbbc 1860 
tgcagbcacc gbcgtcgaca cgbgtgabca gababcgcgg ccgctcbaga caccatgcgc 1920 
gtgaaggaga agbaccagca cctgtggcgc tggggcbggc gcbggggcac cabgctgcbg 1980 
ggcabgctga tgatctgcag cgccaccgag aagctgtggg bgaccgbgta ctacggcgtg 204 0 
cccgtgtgga aggaggccac caccacccbg ctcbgcgcca gcgacgccaa ggcctacgac 2100 
accgaggbgc acaacgbgtg ggccacccac gcctgcgbgc ccaccgaccc caacccccag 2160 
gaggbggbgc tggtgaacgt gaccgagaac ttcgacatgb ggaagaacga catggtggag 2220 
cagatgcacg aggacabcab cagccbgtgg gaccagagcc bgaagcccbg cgbgaagctg 22 80 
acccccctgt gcgtgagccb gaagbgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgabcat ggagaagggc gagatcaaga actgcagcbt caacatcagc 2400 
accagcatcc gcggcaaggb gcagaaggag fcacgccttct tctacaagct ggacatcatc 2460 
cccabcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgbgatcacc 2520 
caggccbgcc ccaaggbgag cbbcgagccc atccccaacc acbacbgcgc ccccgccggc 2 580 
bbcgccabcc bgaagbgcaa ggacaagaag ttcaacggca agggccccbg caccaacgtg 2640 
agcaccgbgc agbgcaccca cggcatccgc cccgtggbga gcacccagcb gcbgcbgaac 2 700 
ggbagccbgg ccgaggagga ggbggtgabc cgcagcgcta acttcgccga caacgccaag 2760 
gbgatcabcg bgcagcbgaa cgagagcgbg gagatcaact gcacccgccc caacaacaac 2820 
acccgcaaga gcatccacab cggccccggc cgcgcctbcb acaccaccgg cgagatcatc 2880 
ggcgacabcc gccaggccca cbgcaacctg agccgcgcca agtggaacga cacccbgaac 2940 
aagabcgbga bcaagcbgcg cgagcagbbc ggcaacaaga ccabcgtgtt caagcacagc 3000 
agcggcggcg accccgagab cgbgacccac agcbbcaact gcggcggcga gbbcbtcbac 3 060 
tgcaacagca cccagcbgtb caacagcacc bggbbcaaca gcacctggag caccgagggc 3120 
agcaacaaca ccgagggcag cgacaccabc accctgcccb gccgcabcgc bagcagcaac 3180 
aacgagagcg agabcbbccg bccgggcggc ggcgacabgc gcgacaacbg gcgcagcgag 3240 
cbgbacaagb acaaggbggb gaagabcgag ccccbgggcg tggcccccac caaggccaag 3300 
cbbaccgbcc aggcccgcca gcbgcbgagc ggcabcgbgc agcagcagaa caaccbgcbg 3360 
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cgcgccatcg aggcccagca gcacctgctg 
caggcccgca ccctggccgt ggagcgctac 
bggaaccaca ccacctggat ggagtgggac 
cacagccbga tcgaggagag ccagaaccag 
ctggacaagt gggccagcct gtggaactgg 
tccagatctg ctgtgccbbc tagttgccag 
tccttgaccc tggaaggtgc cactcccact 
tcgcattgtc tgagbaggtg tcattctatt 
ggggaggatt gggaagacaa tagcaggcat 
caggbgcbga agaattgacc cggtbccbcc 
ctctgtgaca caccctgtcc acgcccctgg 
tcatagctca ggagggctcc gccttcaatc 
ccctccctca tcagcccacc aaaccaaacc 
aagabaggcb attaagtgca gagggagaga 
gagaaatcab agaattttaa ggccatcatg 
ctcgctgcgc tcggtcgttc ggctgcggcg 
acggbtabcc acagaatcag gggataacgc 
aaaggccagg aaccgtaaaa aggccgcgtt 
tgacgagcat cacaaaaatc gacgctcaag 
aagataccag gcgtttcccc ctggaagctc 
gcttaccgga tacctgtccg ccbbbcbccc 
acgctgtagg tatctcagtt cggtgtaggt 
accccccgtt cagcccgacc gctgcgcctt 
ggtaagacac gacbbabcgc cactggcagc 
gtatgtaggc ggtgctacag agtbctbgaa 
aacagbabbb ggtatctgcg ctctgctgaa 
ctcbbgatcc ggcaaacaaa ccaccgctgg 
gathacgcgc agaaaaaaag gatctcaaga 
cgcbcagbgg aacgaaaact cacgttaagg 
cttcacctag atccttttaa attaaaaatg 
gtaaacttgg tctgacagtt accaatgctt 
tctatttcgt bcabccabag tbgcctgacb 
tgaagaaggt gttgctgact cataccaggc 
gagggagcca cggttgatga gagctttgtt 
ttgctttgcc acggaacggt ctgcgttgtc 
agcaaaagtt cgatttattc aacaaagccg 
cagtgbbaca accaattaac caattctgat 
bgcaatbbab tcatatcagg attatcaata 
gaaggagaaa actcaccgag gcagttccat 
attccgactc gtccaacatc aatacaacct 
tcaagbgaga aabcaccabg agtgacgact 
abbbcbbbcc agacttgttc aacaggccag 
tcaaccaaac cgttattcat tcgbgabbgc 
ttaaaaggac aabbacaaac aggaatcgaa 
tcaacaatat tttcacctga atcaggatat 
gggatcgcag tggtgagtaa ccatgcatca 
ggaagaggca baaabbccgb cagccagttt 
gcaacgctac cttfcgccatg bbbcagaaac 
cgatagattg bcgcaccbga htgcccgaca 
tcagcatcca tgttggaafct taatcgcggc 
ctcataacac cccbbgbabt actgtttatg 
abatbbbbab cbbgbgcaat gtaacatcag 
cccccccatt abbgaagcab ttatcagggt 
bgbabbbaga aaaabaaaca aabaggggbt 
gacgtctaag aaaccatbab tatcatgaca 



cagctgaccg tgbggggcah caagcagctg 3420 
ctgaaggacc agcagcagct cgagcagatc 3480 
cgcgagatca acaactacac cagcctgatc 3540 
caggagaaga acgagcagga gctgctggag 3600 
ttcaacatca ccaactggct gtggtgagga 3660 
ccatctgttg bbbgccccbc ccccgbgccb 3720 
gtcctttcct aahaaaabga ggaaattgca 3780 
ctggggggtg gggtggggca ggacagcaag 3840 

gctggggatg cggtgggctc tatgggbacc 3900 
tgggccagaa agaagcaggc acatcccctt 3960 
ttcttagttc cagccccact cataggacac 4020 
ccacccgcta aagtacttgg agcggtctct 4080 
tagcctccaa gagtgggaag aaattaaagc 4140 
aaatgcctcc aacatgtgag gaagtaatga 4200 
gccttaatct tccgcttcct cgctcactga 4260 
agcggtatca gctcactcaa aggcggtaat 4320 
aggaaagaac atgtgagcaa aaggccagca 43 8 0 
gctggcgttt ttccataggc tccgcccccc 4440 
tcagaggtgg cgaaacccga caggactata 4500 
ccfccgtgcgc tctcctgttc cgaccctgcc 4560 
ttcgggaagc gbggcgcttt cbcabagcbc 4620 
cgbtcgcbcc aagcbgggct gbgbgcacga 4 68 0 
abccggbaac batcgtctbg agtccaaccc 4740 
agccacbggb aacaggatba gcagagcgag 4 800 
gbggbggcct aacbacggct acacbagaag 4 860 
gccagtbacc tbcggaaaaa gagbbggbag 4920 
bagcggbggt bbbbbbgbbb gcaagcagca 4980 
agabccbbbg abcbbtbcba cggggbcbga 5040 
gabbbbggbc abgagatbab caaaaaggat 5100 
aagbbbbaaa bcaabctaaa gbatabatga 5160 
aabcagbgag gcacctabcb cagcgabcbg 5220 

cggggggggg gggcgcbgag gtcbgcctcg 5280 

cbgaabcgcc ccabcabcca gccagaaagb 5340 
gbaggbggac cagbtggtga tttbgaacbb 5400 
gggaagabgc gbgabctgab ccttcaacbc 5460 
ccgtcccgbc aagtcagcgb aatgcbcbgc 552 0 
tagaaaaact cabcgagcab caaabgaaac 5580 
ccatabbbbt gaaaaagccg bttcbgbaab 5640 
aggatggcaa gabcctggba bcggbcbgcg 5700 
abbaabtbcc ccbcgtcaaa aabaaggbba 5760 
gaabccggbg agaabggcaa aagctbabgc 5820 
ccabtacgcb cgtcatcaaa abcactcgca 5880 
gccbgagcga gacgaaatac gcgabcgcbg 5940 
bgcaaccggc gcaggaacac bgccagcgca 6000 
bcbtctaaba ccbggaabgc bgbbbbcccg 6060 
bcaggagbac ggabaaaabg cbtgabggtc 6120 
agbcbgacca tcbcabcbgb aacabcabtg 6180 
aacbcbggcg cabcgggctb cccabacaab 6240 
bbabcgcgag cccabbbaba cccabataaa 6300 
cbcgagcaag acgbbtcccg bbgaatabgg 6360 
baagcagaca gbbbbatbgt bcabgabgab 6420 
agabbbbgag acacaacgbg gcbtbccccc 6480 
babbgbcbca bgagcggaba catabbbgaa 6540 
ccgcgcacab btccccgaaa agbgccaccb '6600 
bbaaccbata aaaabaggcg babcacgagg 6660 
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tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gbaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accabatgcg gtgtgaaaba ccgcacagat gcgtaaggag aaaataccgc abcagabbgg 240 
ctattggcca bbgcabacgt tgtatccata bcataatatg tacatttata ttggctcatg 300 
bccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaatbac 360 
ggggtcatta gttcatagcc catatatgga gbbccgcgbb acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgbcaatgg gbggagbatt tacggtaaac 540 
tgcccacttg gcagbacabc . aagtgbatca batgccaagt acgcccccta ttgacgtcaa 600 
bgacggbaaa bggcccgcct ggcattatgc ccagtacatg acctbabggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc babbaccabg gtgatgcggt bbbggcagta 720 
cabcaabggg cgtggatagc ggtttgactc acggggattt ccaagtcbcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac bbtccaaaab gtcgtaacaa 840 
ctccgcccca btgacgcaaa tgggcggtag gcgtgtacgg tgggaggtcb atabaagcag 900 
agctcgttta gtgaaccgbc agatcgcctg gagacgccat ccacgcbgtt tbgacctcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcattg gaacgcggab 102 0 
tccccgtgcc aagagtgacg baagbaccgc ctatagactc tataggcaca cccctbtggc 1080 
bcbtabgcab gctatactgb ttbbggcttg gggcctatac acccccgccb ccbtatgcba 1140 
taggtgatgg tatagcttag cctabaggtg bgggtbabtg accabbattg accactcccc 1200 
battggbgac gabacbttcc atbactaabc cataacabgg ctctbtgcca caactabcbc 1260 
tabbggctab abgccaatac bcbgbccbbc agagacbgac acggactctg tabtttbaca 1320 
ggatggggbc ccabtbatba btbacaaatb cacatataca acaacgccgb cccccgbgcc 1380 
cgcagbbbtb atbaaacaba gcgtgggatc bccacgcgaa tctcgggtac gtgbtccgga 1440 
cabgggctcb bcbccggbag cggcggagcb bccacatccg agccctggbc ccabgccbcc 1500 
agcggctcab ggbcgcbcgg cagctccbbg cbccbaacag bggaggccag actbaggcac 1560 
agcacaatgc ccaccaccac cagtgbgccg cacaaggccg tggcggbagg gtatgbgbcb 1620 
gaaaabgagc gtggagabbg ggctcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbtgbb gtabbctgat aagagtcaga ggtaactccc 1740 
gtbgcggtgc bgbbaacggb ggagggcagb gtagbctgag cagbacbcgt tgctgccgcg 1800 
cgcgccacca gacabaabag cbgacagact aacagactgb tcctbbccab gggbcbbttc 1860 
bgcagtcacc gbcgbcgaca cgbgbgatca gatabcgcgg ccgcbcbaga caccatgcgc 1920 
gbgaaggaga agbaccagca cctgtggcgc tggggcbggc gctggggcac catgcbgctg 1980 
ggcatgcbga bgatcbgcag cgccaccgag aagcbgbggg tgaccgbgba ctacggcgtg 2040 
cccgbgbgga aggaggccac caccacccbg cbctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgbg ggccacccac gcctgcgbgc ccaccgaccc caacccccag 2160 
gaggtggtgc bggtgaacgt gaccgagaac ttcgacabgt ggaagaacga catggtggag 2220 
cagatgcacg aggacabcab cagcctgtgg gaccagagcc bgaagccctg cgbgaagcbg 2280 
acccccctgb gcgtgagccb gaagbgcacc gaccbgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgabcat ggagaagggc gagabcaaga actgcagctt caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag bacgccbbct tctacaagcb ggacabcabc 2460 
cccabcgaca acgacaccac cagcbacagc ctgaccagcb gcaacaccag cgbgatcacc 2520 
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caggcctgcc ccaaggtgag cttcgagccc 
bbcgccabcc tgaagtgcaa ggacaagaag 
agcaccgtgc agtgcaccca cggcatccgc 
ggbagccbgg ccgaggagga ggbggbgabc 
gbgatcabcg hgcagctgaa cgagagcgtg 
acccgcaaga gcabccacab cggccccggc 
ggcgacatcc gccaggccca ctgcaacctg 
aagatcgtga tcaagctgcg cgagcagttc 
agcggcggcg accccgagat cgtgacccac 
tgcaacagca cccagctgtt caacagcacc 
agcaacaaca ccgagggcag cgacaccatc 
aacatgtggc agaaggtggg caaggccatg 
bgcagcagca acatcaccgg ccbgcbgcbg 
agcgagatcb tccgtccggg cggcggcgcb 
ctgctgagcg gcabcgbgca gcagcagaac 
cacctgctgc agctgaccgt gtggggcatc 
gagcgctacc tgaaggacca gcagcagctc 
gagtgggacc gcgagatcaa caactacacc 
cagaaccagc aggagaagaa cgagcaggag 
bggaactggb bcaacabcac caactggctg 
agttgccagc catctgttgt ttgcccctcc 
actcccactg tcctttccta ataaaatgag 
cattctattc tggggggtgg ggtggggcag 
agcaggcatg ctggggatgc ggtgggctct 
ggtbccbcct gggccagaaa gaagcaggca 
cgcccctggt tcttagttcc agccccactc 
ccbbcaatcc cacccgctaa agtacttgga 
aaccaaacct agcctccaag agtgggaaga 
agggagagaa aatgcctcca acatgbgagg 
gccabcabgg ccbbaabcbb ccgcbbccbc 
gctgcggcga gcggtatcag ctcactcaaa 
ggataacgca ggaaagaaca tgtgagcaaa 
ggccgcgttg cbggcgtttt bccabaggcb 
acgcbcaagt cagaggtggc gaaacccgac 
tggaagcbcc cbcgbgcgcb cbcctgttcc 
ctbbctcccb bcgggaagcg bggcgcttbc 
ggbgbaggbc gbbcgcbcca agcbgggcbg 
ctgcgccbba bccggbaacb abcgbcbtga 
acbggcagca gccacbggta acaggabtag 
gbbcbtgaag tggtggccta acbacggcba 
tcbgctgaag ccagbbaccb bcggaaaaag 
caccgcbggt agcggbggtb tbbbbgbbbg 
abcbcaagaa gabccbbbga bcbbtbcbac 
acgbbaaggg abbtbggbca bgagabbabc 
tbaaaaatga agttbbaaab caabctaaag 
ccaabgcbba abcagbgagg caccbabcbc 
bgccbgacbc gggggggggg ggcgctgagg 
abaccaggcc bgaabcgccc cabcatccag 
agctbbgtbg baggbggacc agbbggbgab 
bgcgbbgtcg ggaagabgcg bgatctgatc 
acaaagccgc cgbcccgbca agtcagcgba 
aabbcbgabb agaaaaacbc abcgagcabc 
bbabcaabac cabatbtbtg aaaaagccgb 
cagbbccaba ggabggcaag atccbggbab 
abacaaccba bbaabbbccc cbcgbcaaaa 



abccccaacc acbacbgcgc ccccgccggc 2580 
tbcaacggca agggcccctg caccaacgbg 2640 
cccgbggbga gcacccagcb gctgcbgaac 2700 
cgcagcgcba acbbcgccga caacgccaag 2760 
gagabcaact gcacccgccc caacaacaac 2820 
cgcgccbtcb acaccaccgg cgagabcabc 2880 
agccgcgcca agbggaacga cacccbgaac 2940 
ggcaacaaga ccabcgbgbb caagcacagc 3000 
agcbbcaact gcggcggcga gbbcbbctac 3060 
bggttcaaca gcaccbggag caccgagggc 3120 
acccbgcccb gccgcabcaa gcagabcabc 3180 
bacgcccccc ccatcagcgg ccagatccgc 3240 
acccgcgacg gcggcaacag caacaacgag 3300 
agcgccaagc btaccgbcca ggcccgccag 3360 
aaccbgcbgc gcgccatcga ggcccagcag 3420 
aagcagcbgc aggcccgcac ccbggccgbg 3480 
gagcagabcb ggaaccacac caccbggabg 3540 
agccbgatcc acagcctgab cgaggagagc 3600 
cbgcbggagc bggacaagtg ggccagcctg 3 660 
tggbgaggab ccagabcbgc tgbgccbbcb 3720 
cccgbgccbb ccbbgacccb ggaaggtgcc 3780 
gaaabbgcab cgcabbgbcb gagbaggbgb 3840 
gacagcaagg gggaggabbg ggaagacaat 3900 
abgggbaccc aggtgcbgaa gaabbgaccc 3960 
cabccccbbc bcbgbgacac acccbgbcca 4020 
abaggacacb cabagctcag gagggcbccg 4080 
gcggbcbcbc ccbcccbcat cagcccacca 4140 
aabbaaagca agabaggcba bbaagtgcag 4200 
aagtaatgag agaaabcata gaatbtbaag 4260 
gctcacbgac bcgctgcgcb cggtcgbbcg 4320 
99 c ggbaaba cggbtabcca cagaabcagg 4380 
aggccagcaa aaggccagga accgbaaaaa 4440 
ccgccccccb gacgagcabc acaaaaabcg 4500 
aggactataa agataccagg cgtbbccccc 4560 
gaccctgccg cbtaccggab accbgbccgc 4620 
bcatagcbca cgcbgbaggb atcbcagbbc 4680 
bgbgcacgaa ccccccgtbc agcccgaccg 4740 
gbccaacccg gbaagacacg acbbabcgcc 4800 
cagagcgagg babgbaggcg gbgcbacaga 4860 
cactagaaga acagbabtbg gtatcbgcgc 4 920 
a 9btggbagc bcbbgabccg gcaaacaaac 4980 
caagcagcag abbacgcgca gaaaaaaagg 5040 
ggggbcbgac gcbcagtgga acgaaaacbc 5100 
aaaaaggatc bbcaccbaga bccbbbbaaa 5160 
babababgag baaacbbggb cbgacagbba 5220 
agcgatcbgb cbabbbcgbb cabccabagb 5280 
bcbgccbcgt gaagaaggbg bbgcbgacbc 5340 
ccagaaagbg agggagccac ggbbgabgag 54 00 
bbtgaacbbb bgcbbbgcca cggaacggbc 5460 
cbbcaacbca gcaaaagbbc gabbbabbca 5520 
abgctcbgcc agbgbbacaa ccaabbaacc 5580 
aaatgaaacb gcaabbbabb cababcagga 5640 
ttctgbaabg aaggagaaaa cbcaccgagg 5700 
cggbcbgcga bbccgacbcg bccaacabca 5760 
ataaggbbab caagbgagaa abcaccabga 5820 
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gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca gacttgttca 5880 
acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc gttattcatt 5940 
cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca attacaaaca 6000 
ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt ttcacctgaa 6060 
tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt ggtgagtaac 6120 
catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat aaattccgtc 6180 
agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc tttgccatgt 6240 
ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt cgcacctgat 6300 
tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat gttggaattt 6360 
aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc ccttgtatta 6420 
ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc ttgtgcaatg 6480 
taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta ttgaagcatt 6540 
tatcagggtt attgtctcat gagcggatac atatttgaab gtatttagaa aaataaacaa 6600 
ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga aaccathatt 6660 
atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 6709 



<210> 20 
<211> 6736 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 20 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
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gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgbbaacggb ggagggcagb gtagbcbgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagactgt tcctttccat gggbcttbtc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgtga 1920 
aggagaagta ccagcacctg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 1980 
tgcbgatgat ctgcagcgcc accgagaagc tgbgggtgac cgtgtactac ggcgtgcccg 2040 
tgtggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggtgcacaa cghgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 
tgcacgagga cabcabcagc cbgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacg ctagcaagaa ctgcagcttc aacatcagca 2340 
ccagcatccg cggcaagghg cagaaggagt acgccttctt cbacaagcbg gacatcatcc 2400 
ccatcgacaa cgacaccacc agctacagcc tgaccagctg caacaccagc gtgatcaccc 2460 
aggcctgccc caaggbgagc btcgagccca tccccaacca ctactgcgcc cccgccggct 2520 
tcgccatcct gaagtgcaag gacaagaagt tcaacggcaa gggcccctgc accaacgtga 2580 
gcaccgtgca gtgcacccac ggcatccgcc ccgtggtgag cacccagctg ctgctgaacg 2640 
gtagcctggc cgaggaggag gtggtgatcc gcagcgctaa cttcgccgac aacgccaagg 2700 
tgatcatcgt gcagctgaac gagagcgtgg agatcaactg cacccgcccc aacaacaaca 2760 
cccgcaagag catccacatc ggccccggcc gcgccbbcta caccaccggc gagabcatcg 2820 
gcgacabccg ccaggcccac tgcaacctga gccgcgccaa gtggaacgac acccbgaaca 2880 
agatcgtgat caagctgcgc gagcagtbcg gcaacaagac cabcgbgttc aagcacagca 2 94 0 
gcggcggcga ccccgagatc gtgacccaca gcttcaactg cggcggcgag ttcttctact 3000 
gcaacagcac ccagcbgbtc aacagcaccb ggttcaacag cacctggagc accgagggca 3060 
gcaacaacac cgagggcagc gacaccatca cccbgccctg ccgcabcaag cagabcabca 3120 
acabgtggca gaaggtgggc aaggccatgh acgccccccc catcagcggc cagabccgcb 3180 
gcagcagcaa catcaccggc ctgcbgctga cccgcgacgg cggcaacagc aacaacgaga 3240 
gcgagatctt ccgtccgggc ggcggcgaca bgcgcgacaa ctggcgcagc gagctgbaca 3300 
agtacaaggb ggtgaagatc gagcccctgg gcgbggcccc caccaaggcc aagcttaccg 3360 
tccaggcccg ccagctgctg agcggcatcg tgcagcagca gaacaaccbg ctgcgcgcca 3420 
tcgaggccca gcagcacctg ctgcagctga ccgtgtgggg cabcaagcag ctgcaggccc 3480 
gcacccbggc cgbggagcgc baccbgaagg accagcagca gcbcgagcag abctggaacc 3 54 0 
acaccacctg gabggagbgg gaccgcgaga tcaacaacba caccagccbg atccacagcc 3 600 
tgabcgagga gagccagaac cagcaggaga agaacgagca ggagcbgcbg gagctggaca 3 660 
agtgggccag ccbgbggaac bggbtcaaca bcaccaacbg gcbgbggtga ggatccagab 3720 
ctgcbgtgcc tbctagbbgc cagccatcbg tbgtbbgccc cbcccccgtg ccbbccbbga 3780 
cccbggaagg tgccacbccc acbgtccttt ccbaabaaaa bgaggaaabt gcatcgcabt 3840 
gbcbgagbag gbgtcabbct attctggggg gtggggbggg gcaggacagc aagggggagg 3900 
atbgggaaga caatagcagg catgctgggg abgcggbggg ctctabgggt acccaggbgc 3960 
tgaagaabtg acccggbtcc bcctgggcca gaaagaagca ggcacabccc cttctcbgbg 4020 
acacaccctg tccacgcccc tggbtctbag tbccagcccc acbcatagga cactcabagc 4080 
tcaggagggc tccgcctbca abcccacccg cbaaagbacb tggagcggbc tcbcccbccc 4140 
tcabcagccc accaaaccaa acctagcctc caagagbggg aagaaabbaa agcaagabag 4200 
gcbabbaagb gcagagggag agaaaatgcc tccaacatgt gaggaagtaa bgagagaaab 4260 
cabagaabbt taaggccabg abbtaaggcc abcabggccb taatcbbccg cttcctcgct 4320 
cactgacbcg cbgcgcbcgg bcgbtcggct gcggcgagcg gbatcagcbc acbcaaaggc 4380 
ggtaabacgg bbatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 444 0 
ccagcaaaag gccaggaacc gtaaaaaggc cgcgbtgcbg gcgtbbttcc abaggcbccg 4500 
cccccctgac gagcabcaca aaaabcgacg cbcaagbcag aggbggcgaa acccgacagg 4560 
acbabaaaga baccaggcgt bbcccccbgg aagcbccctc gbgcgcbcbc cbgbtccgac 4620 
ccbgccgcbb accggabacc bgbccgccbt bctcccbbcg ggaagcgbgg cgcbbbctca 4680 
tagcbcacgc bgbaggbabc tcagbbcggb gbaggbcgtt cgcbccaagc bgggcbgtgb 4740 
gcacgaaccc cccgbtcagc ccgaccgctg cgccbbabcc ggtaacbabc gbctbgagbc 4 800 
caacccggba agacacgact batcgccacb ggcagcagcc actggtaaca ggabbagcag 4 860 
agcgaggbab gbaggcggbg cbacagagbb cbbgaagbgg bggccbaacb acggcbacac 4920 
tagaagaaca gbatbbggba bctgcgcbcb gcbgaagcca gbtaccbtcg gaaaaagagb 4980 
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tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 5040 
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 5100 
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 5160 
aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 5220 
atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc 5280 
gatctgtcta tttcgttcat ccatagttgc ctgactcggg gggggggggc gctgaggtct 5340 
gcctcgtgaa gaaggtgttg ctgactcata ccaggcctga atcgccccat catccagcca 5400 
gaaagtgagg gagccacggt tgatgagagc tttgttgtag gtggaccagt tggtgatttt 5460 
gaacttttgc tttgccacgg aacggtctgc gttgtcggga agatgcgtga tctgatcctt 5520 
caactcagca aaagttcgat ttattcaaca aagccgccgt cccgtcaagt cagcgtaatg 5580 
ctctgccagt gttacaacca attaaccaat tctgattaga aaaactcatc gagcatcaaa 5640 
tgaaactgca atttattcat atcaggatta tcaataccat atttttgaaa aagccgtttc 5700 
tgtaatgaag gagaaaactc accgaggcag ttccatagga hggcaagatc ctggtatcgg 5760 
tctgcgattc cgactcgtcc aacatcaata caacctatta atttcccctc gtcaaaaata 5820 
aggttatcaa gtgagaaatc accatgagtg acgactgaat ccggtgagaa tggcaaaagc 58 80 
ttatgcattt ctttccagac ttgttcaaca ggccagccat tacgctcgtc atcaaaatca 5940 
ctcgcatcaa ccaaaccgtt attcattcgt gattgcgcct gagcgagacg aaabacgcga 6000 
tcgctgttaa aaggacaatt acaaacagga atcgaatgca accggcgcag gaacactgcc 6060 
agcgcatcaa caatatfcttc acctgaatca ggatattctt ctaatacctg gaatgctgtt 6120 
ttcccgggga tcgcagtggt gagtaaccat gcatcatcag gagtacggafc aaaatgchtg 6180 
atggtcggaa gaggcataaa ttccgtcagc cagtttagtc tgaccatctc atctgtaaca 6240 
tcattggcaa cgctaccttt gccafcgtttc agaaacaact ctggcgcatc gggcttccca 6300 
tacaahcgat agattgtcgc acctgattgc ccgacattat cgcgagccca tttataccca 6360 
tataaatcag catccafcgtt ggaatttaat cgcggcctcg agcaagacgt ttcccghtga 6420 
atatggctca taacacccct tgtattactg tttabgtaag cagacagttt hattgttcat 6480 
gatgatatat tfctfcatcttg tgcaatgtaa catcagagat tttgagacac aacgtggctt 6540 
tccccccccc cccattattg aagcatttat cagggttatt gtctcatgag cggatacata 6600 
tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg 6660 
ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa taggcgtatc 6720 
acgaggccct tfccgtc 6736 



<210> 21 
<211> 6697 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 21 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
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cgtcaatggg agbttgbbbt ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgttta gbgaaccgbc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
bcbtabgcat gctatactgt bbbbggcbbg 
taggtgatgg babagcttag ccbataggtg 
tattggtgac gatactttcc attactaatc 
tathggctat atgccaatac bcbgbccbbc 
99atggggtc ccatttatta tttacaaatt 
cgcagbbttb abbaaacafca gcgtgggatc 
catgggctcb tctccggfcag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gbggagatbg ggctcgcacg 
gcagaagaag abgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag cbgacagacb 
tgcagtcacc gtcgtcgaca cgtgtgatca 
aggagaagta ccagcacctg tggcgctggg 
bgcbgabgab ctgcagcgcc accgagaagc 
tgtggaagga ggccaccacc accctgctct 
aggtgcacaa cgtgtgggcc acccacgcct 
bggbgcbggb gaacgtgacc gagaacttcg 
tgcacgagga catcatcagc cbgbgggacc 
ccctgtgcgt gagcctgaag bgcaccgacc 
gcggccgcab gatcatggag aagggcgaga 
gcaacaccag cgtgatcacc caggcctgcc 
actactgcgc ccccgccggc ttcgccatcc 
agggcccctg caccaacgtg agcaccgtgc 
gcacccagct gctgctgaac ggtagcctgg 
acttcgccga caacgccaag gbgabcatcg 
gcacccgccc caacaacaac acccgcaaga 
acaccaccgg cgagatcatc ggcgacatcc 
agtggaacga caccctgaac aagabcgbga 
ccatcgtgtt caagcacagc agcggcggcg 
gcggcggcga gttcttctac bgcaacagca 
gcacctggag caccgagggc agcaacaaca 
gccgcatcaa gcagatcatc aacatgtggc 
ccatcagcgg ccagatccgc bgcagcagca 
gcggcaacag caacaacgag agcgagabcb 
actggcgcag cgagctgtac aagtacaagg 
ccaccaaggc caagcbbacc gtccaggccc 
agaacaacct gctgcgcgcc atcgaggccc 
gcabcaagca gcbgcaggcc cgcacccbgg 
agctcgagca gafcctggaac cacaccaccb 
acaccagccb gabccacagc ctgatcgagg 
aggagctgcb ggagcbggac aagbgggcca 
ggcbgbggtg aggatccaga tcbgcbgtgc 
ccbcccccgb gccbtccbbg accctggaag 
abgaggaaab tgcatcgcab tgbcbgagba 
ggcaggacag caagggggag gabbgggaag 
gctcbabggg tacccaggbg cbgaagaabb 
aggcacabcc cctbcbcbgb gacacacccb 
cacbcabagg acactcabag ctcaggaggg 



tcaacgggac tttccaaaat gtcgtaacaa 840 
gcgtgtacgg tgggaggtct atataagcag 900 
gagacgccat ccacgcbgtb btgaccbcca 960 
cggccgggaa cggbgcattg gaacgcggab 1020 
cbabagacbc babaggcaca cccctbbggc 1080 
gggcctabac acccccgcbt ccttabgcba 1140 
bgggttattg accabbabbg accacbcccc 1200 
cataacabgg ctctttgcca caacbabcbc 1260 
agagactgac acggacbcbg batbbbbaca 1320 
cacabataca acaacgccgb cccccgbgcc 1380 
bccacgcgaa bcbcgggbac gbgbtccgga 14 4 0 
bccacabccg agcccbggtc ccatgcctcc 1500 
cbccbaacag tggaggccag acttaggcac 1560 
cacaaggccg tggcggtagg gbabgbgbct 1620 
gctgacgcag abggaagacb baaggcagcg 1680 
gbatbcbgab aagagbcaga ggbaacbccc 1740 
gbagbcbgag cagbactcgb bgcbgccgcg 1800 
aacagacbgb bcctbtccab gggtcbbbbc 1860 
gabatcgcgg ccgcbctaga atgcgcgbga 1920 
gcbggcgctg gggcaccabg ctgctgggca 1980 
bgbgggbgac cgbgbacbac ggcgbgcccg 2040 
gcgccagcga cgccaaggcc tacgacaccg 2100 
gcgbgcccac cgaccccaac ccccaggagg 2160 
acabgbggaa gaacgacabg gbggagcaga 2220 
agagccbgaa gcccbgcgbg aagctgaccc 2280 
tgaagaacga caccaacacc aacagcagca 2340 
bcaagaacbg cagcbbcgcb agcaccagcb 2400 
ccaaggtgag cbbcgagccc abccccaacc 2460 
bgaagbgcaa ggacaagaag bbcaacggca 2520 
agtgcaccca cggcabccgc cccgtggbga 2580 
ccgaggagga ggtggtgabc cgcagcgcba 2640 
bgcagctgaa cgagagcgbg gagabcaacb 2 700 
gcatccacab cggccccggc cgcgccbbcb 2760 
gccaggccca ctgcaaccbg agccgcgcca 2820 
bcaagcbgcg cgagcagbbc ggcaacaaga 2 880 
accccgagab cgbgacccac agcbtcaacb 2940 
cccagcbgtb caacagcacc bggbtcaaca 3000 
ccgagggcag cgacaccabc acccbgcccb 3060 
agaaggbggg caaggccatg bacgcccccc 3120 
acatcaccgg ccbgcbgcbg acccgcgacg 3180 
bccgtccggg cggcggcgac abgcgcgaca 3240 
bggbgaagab cgagccccbg ggcgbggccc 3300 
gccagcbgcb gagcggcabc gbgcagcagc 3360 
agcagcaccb gcbgcagcbg accgbgbggg 3420 
ccgbggagcg cbaccbgaag gaccagcagc 3480 
ggabggagtg ggaccgcgag abcaacaacb 3540 
agagccagaa ccagcaggag aagaacgagc 3 600 
gcctgbggaa cbggbtcaac abcaccaacb 3660 
cbbcbagbbg ccagccatcb gbbgtbtgcc 3720 
gbgccacbcc cactgbccbb bcctaabaaa 3780 
ggtgbcabbc babbcbgggg ggbggggbgg 3 840 
acaabagcag gcabgcbggg gabgcggbgg 3900 
gacccggbbc cbccbgggcc agaaagaagc 3960 
gbccacgccc cbggbbctba gbbccagccc 4020 
cbccgccbbc aabcccaccc gcbaaagtac 4080 
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ktggagcggt ctctccctcc ctcatcagcc 
gaagaaatta aagcaagafca ggctattaag 
bgaggaagba abgagagaaa tcatagaafcb 
bbaabcbbcc gcttcctcgc tcactgactc 
ggbabcagct cactcaaagg cggtaatacg 
aaagaacatg tgagcaaaag gccagcaaaa 
ggcgbbbbbc cataggctcc gcccccctga 
gaggtggcga aacccgacag gactataaag 
cgtgcgctct cctgttccga ccctgccgcfc 
gggaagcgtg gcgcbbbcbc atagctcacg 
tcgctccaag cbgggcbgtg tgcacgaacc 
cggtaactat cgbcbbgagb ccaacccggt 
cactggtaac aggabbagca gagcgaggta 
gtggcctaac bacggcbaca ctagaagaac 
agttaccttc ggaaaaagag tbggtagcbc 
c 99tggthtt btbgttbgca agcagcagat 
tcctttgatc ttttctacgg ggtctgacgc 
tttggtcatg agattatcaa aaaggatctt 
bbbtaaabca atcbaaagba tatatgagta 
cagtgaggca cctabctcag cgatctgtct 
gggggggggg cgctgaggtc tgcctcgbga 
aatcgcccca tcatccagcc agaaagtgag 
ggtggaccag bbggtgattb tgaacttttg 
aagatgcgtg abcbgabccb tcaactcagc 
tcccgbcaag tcagcgtaat gctchgccag 
aaaaactcat cgagcatcaa atgaaactgc 
tatttttgaa aaagccgttb ctgtaatgaa 
atggcaagat ccbggtatcg gtctgcgatt 
aatttcccct cgtcaaaaat aaggttatca 
tccggtgaga atggcaaaag cttatgcatt 
ttacgctcgt catcaaaatc actcgcatca 
tgagcgagac gaaatacgcg atcgctgtta 
aaccggcgca ggaacactgc cagcgcatca 
bcbaataccb ggaabgcbgb tttcccgggg 
ggagtacgga taaaatgctt gatggtcgga 
ctgaccatct catctgtaac atcathggca 
tctggcgcat cgggcbbccc abacaabcga 
tcgcgagccc atttataccc atabaaabca 
gagcaagacg bbbcccgbbg aabatggcbc 
gcagacagtt ttattgttca tgatgataba 
btbbgagaca caacgbggcb bbcccccccc 
bgbctcabga gcggabacab atbbgaatgb 
cgcacattbc cccgaaaagb gccaccbgac 
accbabaaaa abaggcgtab cacgaggccc 



caccaaacca aaccbagccb ccaagagtgg 4140 
bgcagaggga gagaaaabgc cbccaacabg 4200 
bbaaggccat gabbbaaggc cabcatggcc 4260 
gcbgcgcbcg gbcgbtcggc bgcggcgagc 4320 
gbbabccaca gaabcagggg ataacgcagg 4380 
ggccaggaac cgbaaaaagg ccgcgbtgcb 4440 
cgagcabcac aaaaabcgac gcbcaagbca 4500 
abaccaggcg bbtccccctg gaagcbcccb 4560 
baccggabac cbgbccgccb btcbccctbc 4620 
ctgbaggbat ctcagbtcgg tgtaggbcgb 4680 
ccccgbbcag cccgaccgcb gcgccbbatc 4740 
aagacacgac btatcgccac bggcagcagc 4 800 
bgbaggcggb gctacagagb tctbgaagbg 4 860 
agtatttggt abcbgcgcbc tgcbgaagcc 4920 
btgatccggc aaacaaacca ccgcbggbag 4980 
bacgcgcaga aaaaaaggat ctcaagaaga 5040 
tcagbggaac gaaaacbcac gbbaagggab 5100 
caccbagatc cbtbbaaabt aaaaabgaag 5160 
aactbggtcb gacagbtacc aatgcbbaab 5220 
abtbcgbbca bccabagttg ccbgacbcgg 52 80 
agaaggbgtb gcbgacbcat accaggccbg 5340 
ggagccacgg tbgatgagag cbbbgbbgba 54 00 
cbbtgccacg gaacggbcbg cgtbgtcggg 5460 
aaaagbbcga ttbabbcaac aaagccgccg 5520 
tgbbacaacc aabbaaccaa bbctgabbag 5580 
aatbbabbca tabcaggabb abcaatacca 5640 
ggagaaaact caccgaggca gtbccabagg 5700 
ccgacbcgbc caacatcaab acaaccbatt 5760 
agtgagaaab caccatgagb gacgacbgaa 5 820 
bctbbccaga cbbgbtcaac aggccagcca 5880 
accaaaccgb tabbcabbcg tgatbgcgcc 5940 
aaaggacaab bacaaacagg aabcgaabgc 6000 
acaababbbt cacctgaabc aggababbcb 6060 
atcgcagtgg bgagbaacca bgcatcabca 6120 
agaggcabaa abbccgbcag ccagbtbagb 6180 
acgcbaccbb bgccabgbbb cagaaacaac 6240 
tagabbgbcg caccbgabbg cccgacabba 6300 
gcabccabgt bggaabbbaa bcgcggccbc 6360 
abaacacccc btgbabbacb gttbabgtaa 6420 
bbbbtatcbb gtgcaabgta acabcagaga 6480 
ccccabbabb gaagcabbta tcagggtbab 6540 
abbbagaaaa abaaacaaab aggggbtccg 6600 
gbcbaagaaa ccabbabbab cabgacabba 6660 
tbbcgbc 6697 



<210> 22 
<211> 6712 
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<220> 
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<400> 22 . 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagcbcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
bbggcgggbg tcggggctgg cbbaactabg cggcatcaga gcagatbgba ctgagagtgc 180 
accababgcg gbgbgaaaba ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca btgcabacgb bgbabccaba tcataatatg bacabbtaba ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgacfcagt babbaabagb aatcaattac 360 
ggggtcatta gttcatagcc catatatgga. gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaabaabga cgtatgttcc 480 
catagtaacg ccaataggga cbbbccatbg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccba ttgacgtcaa 600 
tgacggtaaa tggcccgccb ggcabtabgc ccagtacatg accbbabggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
cabcaabggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agbbtgbbbb ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa bgggcggbag gcgtgtacgg tgggaggtct ababaagcag 900 
agctcgttta gtgaaccgtc agabcgccbg gagacgccat ccacgcbgbb bbgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggbgcatbg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgb tbbbggcbbg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg babagcbbag cctataggtg bgggbbabbg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg cbcbbbgcca caactatchc 1260 
tattggctat atgccaabac tcbgtccbbc agagactgac acggacbcbg battttbaca 1320 
ggabggggbc ccabbbabta tbbacaaabt cacabataca acaacgccgb cccccgtgcc 1380 
cgcagbbbbb abbaaacaba gcgbgggatc bccacgcgaa tcbcgggtac gtgbbccgga 1440 
catgggcbcb bcbccggbag cggcggagcb bccacabccg agccctggbc ccabgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbbg cbccbaacag tggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagtbgbb gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagb gtagbcbgag cagbacbcgb bgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagacbgb tccbbtccab gggbcbbbbc 1860 
bgcagbcacc gbcgbcgaca cgbgbgabca gababcgcgg ccgcbcbaga abgcgcgbga 192 0 
aggagaagba ccagcaccbg bggcgcbggg gcbggcgcbg gggcaccabg cbgcbgggca 1980 
tgcbgabgab cbgcagcgcc accgagaagc tgbgggbgac cgbgbacbac ggcgbgcccg 2040 
bgbggaagga ggccaccacc acccbgcbcb gcgccagcga cgccaaggcc bacgacaccg 2100 
aggbgcacaa cgbgbgggcc acccacgccb gcgbgcccac cgaccccaac ccccaggagg 2160 
tggbgcbggb gaacgbgacc gagaacbbcg acabgbggaa gaacgacabg gbggagcaga 2220 
bgcacgagga cabcabcagc cbgbgggacc agagcctgaa gcccbgcgbg aagcbgaccc 2280 
cccbgbgcgt gagccbgaag bgcaccgacc bgaagaacga caccaacacc aacagcagca 2340 
gcggccgcab gabcabggag aagggcgaga bcaagaacbg cagcbtcaac abcagcacca 2400 
gcabccgcgg caaggbgcag aaggagbacg ccbbcbtcba caagcbggac abcabcccca 2460 
bcgacaacga caccaccagc bacagccbga ccagcbgcaa caccagcgbg abcacccagg 2520 
cctgccccaa ggbgagcbbc gagcccabcc ccaaccacba cbgcgccccc gccggcbbcg 2580 
ccabccbgaa gbgcaaggac aagaagbbca acggcaaggg ccccbgcacc aacgbgagca 2640 
ccgbgcagbg cacccacggc abccgccccg bggbgagcac ccagctgcbg cbgaacggba 2700 
gccbggccga ggaggaggbg gbgabccgca gcgctaacbb cgccgacaac gccaaggbga 2760 
bcabcgbgca gcbgaacgag agcgbggaga bcaacbgcac ccgcgcbagc gcccacbgca 2820 
accbgagccg cgccaagbgg aacgacaccc tgaacaagab cgbgatcaag cbgcgcgagc 2880 
agbbcggcaa caagaccabc gbgbtcaagc acagcagcgg cggcgacccc gagabcgbga 2940 
cccacagcbb caacbgcggc ggcgagbbct bcbacbgcaa cagcacccag cbgtbcaaca 3000 
gcacctggbb caacagcacc bggagcaccg agggcagcaa caacaccgag ggcagcgaca 3 060 
ccabcacccb gcccbgccgc abcaagcaga bcabcaacab gbggcagaag gtgggcaagg 3120 
ccabgtacgc cccccccabc agcggccaga bccgcbgcag cagcaacabc accggcctgc 3180 
bgcbgacccg cgacggcggc aacagcaaca acgagagcga gatcbtccgb ccgggcggcg 3240 
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gcgacatgcg cgacaactgg cgcagcgagc bgbacaagba caaggtggtg aagatcgagc 3300 
cccbgggcgt ggcccccacc aaggccaagc thaccgtcca ggcccgccag ctgctgagcg 3360 
gcatcgtgca gcagcagaac aacctgctgc gcgccatcga ggcccagcag cacctgctgc 3420 
agctgaccgt gtggggcatc aagcagctgc aggcccgcac cctggccgtg gagcgctacc 3480 
tgaaggacca gcagcagctc gagcagatct ggaaccacac cacctggatg gagtgggacc 3540 
gcgagabcaa caachacacc agcctgatcc acagcctgat cgaggagagc cagaaccagc 3600 
aggagaagaa cgagcaggag ctgctggagc bggacaagbg ggccagcctg tggaactggt 3660 
tcaacatcac caactggctg tggtgaggat ccagatctgc bgbgccbbcb agttgccagc 3720 
cabcbgbbgb ttgcccctcc cccgtgcctt ccbbgacccb ggaaggtgcc actcccactg 3780 
bccbbbccta ataaaatgag gaaabbgcab cgcahtgtch gagtaggtgt cattctattc 3840 
tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg 3900 
cbggggabgc ggtgggctct atgggtaccc aggtgctgaa gaabbgaccc ggtbccbcct 3960 
gggccagaaa gaagcaggca cabccccbbc tctgtgacac accctgtcca cgcccctggh 4020 
tcttagttcc agccccactc ataggacact catagctcag gagggchccg ccttcaatcc 4080 
cacccgctaa agtacttgga gcggtctctc cctccctcat cagcccacca aaccaaacct 4140 
agcctccaag agtgggaaga aattaaagca agataggcta bbaagbgcag agggagagaa 4200 
aatgcctcca acatgtgagg aagtaatgag agaaatcaba gaabbbtaag gccatgattt 4260 
aaggccatca tggccttaat cttccgcttc ctcgctcact gactcgctgc gctcggtcgt 4320 
tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc 4380 
a ggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 4440 
aaaggccgcg bbgcbggcgb ttttccatag gctccgcccc cctgacgagc atcacaaaaa 4500 
tcgacgctca agbcagaggb ggcgaaaccc gacaggacta taaagatacc aggcgbttcc 4560 
ccctggaagc tccctcgtgc gcbcbccbgb bccgaccctg ccgcttaccg gatacctgtc 4620 
cgccbbbcbc ccttcgggaa gcgtggcgct bbcbcabagc tcacgctgta ggtatctcag 4680 
ttcggtgtag gbcgbbcgcb ccaagcbggg ctgtgtgcac gaaccccccg thcagcccga 4740 
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgactbabc 4800 
gccactggca gcagccactg gtaacaggat tagcagagcg aggbabgbag gcggbgcbac 4860 
agagttcttg aagbggbggc ctaactacgg cbacacbaga agaacagtab bbggbabcbg 4920 
cgcbcbgcbg aagccagbba ccbtcggaaa aagagbbggb agcbcbbgab ccggcaaaca 4980 
aaccaccgcb ggbagcggbg gbbbbbbbgb bbgcaagcag cagabbacgc gcagaaaaaa 5040 
aggatcbcaa gaagabccbb bgabcbbbbc bacggggbcb gacgcbcagb ggaacgaaaa 5100 
cbcacgtbaa gggabbtbgg tcabgagabb abcaaaaagg abcbbcaccb agatccbbtb 5160 
aaabbaaaaa bgaagbbbba aatcaabcba aagbababab gagbaaacbb ggbcbgacag 5220 
bbaccaabgc btaatcagbg aggcaccbat ctcagcgatc bgtcbabbbc gtbcabccab 5280 
agbbgccbga cbcggggggg gggggcgcbg aggbcbgccb cgbgaagaag gtgbbgcbga 5340 
ctcabaccag gccbgaabcg ccccabcabc cagccagaaa gbgagggagc cacggbtgab 5400 
9 a gagcbbbg bbgbaggbgg accagbbggb gabbtbgaac bbbbgcbbbg ccacggaacg 5460 
gtctgcgtbg bcgggaagab gcgbgabcbg abccbbcaac bcagcaaaag bbcgabbbab 5520 
bcaacaaagc cgccgbcccg bcaagtcagc gbaabgctcb gccagbgbba caaccaabba 5580 
accaatbcbg abbagaaaaa ctcabcgagc abcaaabgaa actgcaabbb abbcababca 5640 
ggabbabcaa baccababbb btgaaaaagc cgbbtcbgba atgaaggaga aaacbcaccg 5700 
aggcagbbcc abaggabggc aagabccbgg babcggbcbg cgatbccgac bcgbccaaca 5760 
bcaabacaac cbabbaabbb ccccbcgbca aaaabaaggb babcaagbga gaaabcacca 5820 
bgagbgacga cbgaabccgg bgagaabggc aaaagcbbab gcabbbcbtb ccagacbbgb 5880 
bcaacaggcc agccabbacg cbcgbcabca aaabcacbcg cabcaaccaa accgbbabbc 5940 
abbcgbgabb gcgccbgagc gagacgaaab acgcgabcgc bgbbaaaagg acaabbacaa 6000 
acaggaabcg aabgcaaccg gcgcaggaac acbgccagcg cabcaacaab abbttcaccb 6060 
gaabcaggab abbcbbcbaa baccbggaab gcbgbbbtcc cggggatcgc agbggbgagb 6120 
aaccabgcab cabcaggagb acggabaaaa bgcbbgabgg bcggaagagg cabaaabbcc 6180 
gbcagccagb bbagbcbgac cabcbcabcb gbaacabcab bggcaacgcb accbbbgcca 6240 
bgbbbcagaa acaacbcbgg cgcabcgggc bbcccabaca abcgabagab bgbcgcaccb 6300 
gabbgcccga catbabcgcg agcccatbba bacccababa aabcagcabc catgbtggaa 6360 
bbbaabcgcg gccbcgagca agacgbtbcc cgtbgaabab ggcbcataac accccbbgba 6420 
bbacbgbbba bgbaagcaga cagbbbtabb gbtcabgabg abababtbbb abcbbgbgca 6480 
abgbaacabc agagabbbbg agacacaacg bggcbbbccc ccccccccca bbabbgaagc 6540 
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tacatatttg aatgtattta gaaaaataaa 6600 
aaagtgccac ctgacgtcta agaaaccatt 6660 
cgbatcacga ggccctttcg tc 6712 



<210> 23 
<211> 6715 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 23 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atcfcacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgafc ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacafca gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgtga 1920 
aggagaagta ccagcacctg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 1980 
tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2 040 
tgtggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 
tgcacgagga catcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 2 340 
gcggccgcat gafccatggag aagggcgaga tcaagaactg cagcttcaac atcagcacca 2400 
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gcatccgcgg caaggbgcag aaggagtacg ccttcttcta caagcfcggac atcatcccca 24 60 
bcgacaacga caccaccagc tacagcctga ccagctgcaa caccagcgtg atcacccagg 2520 
cctgccccaa ggtgagcttc gagcccatcc ccaaccacta cbgcgccccc gccggcttcg 2580 
ccatcctgaa gtgcaaggac aagaagttca acggcaaggg ccccbgcacc aacgtgagca 2640 
ccgtgcagtg cacccacggc abccgccccg tggtgagcac ccagcbgcbg cbgaacggba 2700 
gcctggccga ggaggaggtg gtgatccgca gcgctaactt cgccgacaac gccaaggtga 2760 
bcatcgbgca gctgaacgag agcgtggaga tcaactgcac ccgccccaac aacaacaccc 2820 
gcaagagcat ccacatcggc cccggccgcg ccttctacac caccggcgag atcabcggcg 2880 
acatccgcca ggcccactgc aacctgagcc gcgccaagtg gaacgacacc ctgaacaaga 294 0 
bcgbgabcaa gcbgcgcgag cagbbcggca acaagaccat cgtgttcaag cacagcagcg 3000 
gcggcgaccc cgagatcgtg acccacagct tcaactgcgg cggcgagttc bbcbacbgca 3060 
acagcgctag ccbgcccbgc cgcatcaagc agatcabcaa catgtggcag aaggtgggca 3120 
aggccatgta cgcccccccc atcagcggcc agatccgctg cagcagcaac atcaccggcc 3180 
tgctgctgac ccgcgacggc ggcaacagca acaacgagag cgagatcttc cgtccgggcg 324 0 
gcggcgacat gcgcgacaac tggcgcagcg agctgtacaa gtacaaggtg gtgaagatcg 3300 
agcccctggg cgtggccccc accaaggcca agcttaccgt ccaggcccgc cagctgctga 3360 
gcggcatcgt gcagcagcag aacaacctgc tgcgcgccat cgaggcccag cagcacctgc 34 20 
tgcagctgac cgtgtggggc atcaagcagc tgcaggcccg cacccbggcc gbggagcgcb 34 80 
acctgaagga ccagcagcag ctcgagcaga tctggaacca caccacctgg abggagbggg 3540 
accgcgagat caacaactac accagcctga tccacagcct gatcgaggag agccagaacc 3600 
agcaggagaa gaacgagcag gagctgctgg agctggacaa gtgggccagc ctgtggaact 3 660 
ggttcaacat caccaactgg ctgtggtgag gatccagatc tgctgtgcct tctagttgcc 3720 
agccatctgt tgtttgcccc tcccccgbgc cbtccbtgac cctggaaggt gccactccca 3780 
ctgtcctttc cbaabaaaab gaggaaattg catcgcattg tctgagtagg bgtcabbcba 3840 
bbcbgggggg tggggtgggg caggacagca agggggagga ttgggaagac aatagcaggc 3900 
atgctgggga bgcggbgggc tctatgggta cccaggtgct gaagaabbga cccggttcct 3960 
cctgggccag aaagaagcag gcacatcccc ttctctgtga cacacccbgt ccacgccccb 4020 
ggtbcbbagh bccagcccca cbcabaggac acbcabagcb caggagggct ccgccbtcaa 4080 
bcccacccgc baaagtacbb ggagcggbcb cbccctcccb cabcagccca ccaaaccaaa 414 0 
ccbagccbcc aagagbggga agaaabbaaa gcaagabagg cbabtaagbg cagagggaga 4 200 
gaaaatgcct ccaacatgbg aggaagbaab gagagaaabc abagaatbbt aaggccabga 4260 
bbbaaggcca bcatggcctt aabcbbccgc bbcctcgcbc acbgactcgc bgcgcbcggt 4320 
cgbbcggcbg cggcgagcgg tabcagcbca cbcaaaggcg gbaabacggt babccacaga 4380 
atcaggggat aacgcaggaa agaacabgbg agcaaaaggc cagcaaaagg ccaggaaccg 4440 
taaaaaggcc gcgtbgcbgg cgbtbtbcca taggcbccgc cccccbgacg agcabcacaa 4500 
aaabcgacgc tcaagbcaga ggbggcgaaa cccgacagga ctataaagat accaggcgbb 4560 
bcccccbgga agctccctcg bgcgcbcbcc tgbbccgacc cbgccgcbba ccggataccb 4620 
gbccgccbbt cbcccbtcgg gaagcgbggc gcbbbcbcab agcbcacgcb gbaggbabcb 4680 
cagbtcggbg baggbcgttc gcbccaagcb gggcbgbgbg cacgaacccc ccgtbcagcc 4740 
cgaccgcbgc gccbbabccg gbaacbabcg bctbgagtcc aacccggbaa gacacgacbb 4800 
abcgccactg gcagcagcca cbggtaacag gatbagcaga gcgaggtabg taggcggbgc 4860 
bacagagbbc tbgaagbggb ggcctaacta cggcbacacb agaagaacag tabbtggbab 4920 
cbgcgcbcbg cbgaagccag bbaccbbcgg aaaaagagbb ggbagctcbt gatccggcaa 4980 
acaaaccacc gcbggbagcg gbggbbtttt bgtbbgcaag cagcagabba cgcgcagaaa 5040 
aaaaggabcb caagaagabc cbbbgatctb bbcbacgggg bcbgacgcbc agtggaacga 5100 
aaacbcacgb baagggabbb bggbcatgag abbabcaaaa aggabctbca ccbagabccb 5160 
bbbaaabbaa aaabgaagtb bbaaabcaab cbaaagbaba babgagbaaa cbbggbcbga 5220 
cagbbaccaa tgcbbaabca gbgaggcacc babcbcagcg atctgbcbab tbcgtbcabc 5280 
cabagbbgcc tgacbcgggg ggggggggcg cbgaggbcbg ccbcgbgaag aaggtgbbgc 534 0 
bgacbcabac caggccbgaa bcgccccabc abccagccag aaagbgaggg agccacggbb 5400 
gabgagagcb tbgbbgbagg bggaccagbb ggbgabbbbg aacbbbbgcb bbgccacgga 5460 
acggbcbgcg ttgbcgggaa gabgcgbgab cbgabccbbc aacbcagcaa aagbbcgabb 5520 
babbcaacaa agccgccgbc ccgbcaagbc agcgbaabgc bctgccagtg bbacaaccaa 5580 
bbaaccaatb cbgabtagaa aaacbcabcg agcabcaaat gaaacbgcaa tbbabbcaba 5640 
bcaggabbat caataccaba bbbbbgaaaa agccgbbbct gbaabgaagg agaaaacbca 5700 
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ccgaggcagt tccataggat ggcaagatcc tggtatcggt ctgcgabbcc gactcgtcca 5760 

acatcaatac aacctattaa tttcccctcg tcaaaaabaa ggttatcaag tgagaaatca 5820 

ccatgagtga cgactgaatc cggtgagaat ggcaaaagct tatgcatttc tttccagact 5880 

tgttcaacag gccagccatt acgctcgtca tcaaaatcac tcgcatcaac caaaccgtta 5940 

htcattcgtg attgcgcctg agcgagacga aatacgcgat cgctgttaaa aggacaatta 6000 

caaacaggaa tcgaatgcaa ccggcgcagg aacactgcca gcgcatcaac aatattttca 6060 

cctgaatcag gatattcttc taatacctgg aatgctgbtt tcccggggat cgcagtggtg 6120 

agtaaccatg catcatcagg agtacggata aaatgcttga tggtcggaag aggcataaat 6180 

tccgtcagcc agtttagtct gaccatctca tctgtaacat cattggcaac gctacctttg 6240 

ccatgtttca gaaacaactc tggcgcatcg ggcttcccah acaafccgata gattgtcgca 6300 

cctgattgcc cgacattatc gcgagcccat ttatacccat ataaatcagc atccatgttg 6360 

gaatttaatc gcggcctcga gcaagacgtt tcccgttgaa tatggctcat aacacccctt 6420 

gtabtactgt ttatgtaagc agacagtbbb abbgbbcabg atgabababb bbbabctbgb 6480 

gcaabgbaac abcagagabb tbgagacaca acgbggctbb cccccccccc ccabbabbga 6540 

agcabbtatc agggttabbg bctcatgagc ggabacatat btgaatgbab tbagaaaaab 6600 

aaacaaabag gggttccgcg cacatbbccc cgaaaagbgc cacctgacgt ctaagaaacc 6660 

abtabbabca tgacabtaac ctataaaaat aggcgtabca cgaggcccbt tcgtc 6715 



<210> 24 
<211> 6619 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 24 

bcgcgcgtbb cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatabgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctabbggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggbcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgccbggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagba 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tabaggcaca ccccbbtggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attacbaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 13 80 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccabgccbcc 1500 
agcggcbcat ggtcgctcgg cagctccttg cbcctaacag tggaggccag acbtaggcac 1560 
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agcacaabgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gbabgbgtcb 1620 
gaaaatgagc gbggagabbg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagbcaga ggtaactccc 1740 
gbbgcggbgc tgttaacggt ggagggcagt gtagtctgag cagbacbcgb bgcbgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctbbccab gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgbgbgabca gatatcgcgg ccgcbcbaga abgcgcgbga 1920 
aggagaagta ccagcaccbg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 198 0 
tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2 04 0 
tgtggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggt gaacgtgacc gagaacttcg acabgbggaa gaacgacatg gbggagcaga 2220 
tgcacgagga catcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacg ctagcaccag ctgcaacacc agcgtgatca 2340 
cccaggcctg ccccaaggtg agcttcgagc ccatccccaa ccactactgc gcccccgccg 2400 
gcttcgccat cctgaagbgc aaggacaaga agttcaacgg caagggcccc tgcaccaacg 24 60 
tgagcaccgt gcagtgcacc cacggcatcc gccccgtggb gagcacccag ctgctgctga 2520 
acggtagcct ggccgaggag gaggtggtga tccgcagcgc taacttcgcc gacaacgcca 2580 
aggbgabcab cgtgcagctg aacgagagcg tggagatcaa ctgcacccgc cccaacaaca 2640 
acacccgcaa gagcatccac atcggccccg gccgcgcctt ctacaccacc ggcgagatca 2700 
tcggcgacat ccgccaggcc cactgcaacc tgagccgcgc caagtggaac gacaccctga 2760 
acaagatcgt gatcaagctg cgcgagcagb bcggcaacaa gaccatcgtg bbcaagcaca 2820 
gcagcggcgg cgaccccgag atcgtgaccc acagcttcaa ctgcggcggc gagbbcbbcb 2 880 
actgcaacag cacccagctg ttcaacagca cctggbbcaa cagcacctgg agcaccgagg 2940 
gcagcaacaa caccgagggc agcgacacca tcaccctgcc ctgccgcatc aagcagatca 3000 
tcaacatgtg gcagaaggtg ggcaaggcca tgtacgcccc ccccabcagc ggccagabcc 3060 
gcbgcagcag caacatcacc ggcctgctgc tgacccgcga cggcggcaac agcaacaacg 3120 
agagcgagat cttccgtccg ggcggcggcg acatgcgcga caactggcgc agcgagctgt 3180 
acaagtacaa ggtggtgaag atcgagcccc tgggcgtggc ccccaccaag gccaagctta 3240 
ccgtccaggc ccgccagctg ctgagcggca tcgtgcagca gcagaacaac ctgctgcgcg 3300 
ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag cagctgcagg 3360 
cccgcaccct ggccgtggag cgcbaccbga aggaccagca gcagctcgag cagatctgga 3420 
accacaccac ctggatggag bgggaccgcg agatcaacaa ctacaccagc ctgatccaca 3480 
gcctgatcga ggagagccag aaccagcagg agaagaacga gcaggagctg ctggagctgg 3540 
acaagtgggc cagcctgtgg aactggttca acatcaccaa ctggctgtgg tgaggatcca 3600 
gatctgctgt gccbbctagb tgccagccat ctgttghttg cccctccccc gbgccbbccb 3660 
tgaccctgga aggtgccact cccacbgbcc tbbccbaata aaatgaggaa attgcatcgc 3720 
abbgbcbgag baggbgtcab bctabbctgg ggggbggggb ggggcaggac agcaaggggg 3780 
aggabbggga agacaatagc aggcabgctg gggabgcggb gggcbcbatg ggbacccagg 3840 
tgctgaagaa bbgacccggb bccbccbggg ccagaaagaa gcaggcacat ccccbbcbcb 3900 
gtgacacacc cbgbccacgc ccctggtbct bagbbccagc cccacbcaba ggacacbcab 3960 
agcbcaggag ggctccgccb bcaatcccac ccgcbaaagb acbbggagcg gbctcbccct 4020 
cccbcabcag cccaccaaac caaacctagc ctccaagagb gggaagaaab baaagcaaga 4080 
baggcbabba agtgcagagg gagagaaaab gccbccaaca bgbgaggaag baabgagaga 4140 
aabcabagaa bbbbaaggcc abgabbbaag gccabcabgg ccbtaatctb ccgcbbccbc 4200 
gcbcacbgac bcgcbgcgct cggbcgbbcg gcbgcggcga gcggbabcag cbcacbcaaa 4260 
ggcggtaaba cggbbabcca cagaabcagg ggabaacgca ggaaagaaca bgbgagcaaa 4320 
aggccagcaa aaggccagga accgbaaaaa ggccgcgbbg cbggcgbbtb bccabaggcb 4380 
ccgccccccb gacgagcabc acaaaaabcg acgcbcaagb cagaggbggc gaaacccgac 4440 
aggacbabaa agabaccagg cgbbtccccc bggaagcbcc cbcgbgcgcb cbccbgbbcc 4500 
gacccbgccg cbbaccggab accbgbccgc cbbbcbcccb bcgggaagcg bggcgcbbbc 4560 
bcabagcbca cgcbgbaggb abcbcagbbc ggbgbaggbc gbbcgcbcca agcbgggcbg 4620 
bgtgcacgaa ccccccgtbc agcccgaccg cbgcgccbba bccggbaacb abcgtcbbga 4680 
gbccaacccg gbaagacacg acbbabcgcc acbggcagca gccacbggba acaggabbag 4740 
cagagcgagg babgbaggcg gtgcbacaga gbtcbtgaag bggbggccba acbacggcba 4800 
cacbagaaga acagbabbbg gbatcbgcgc bcbgcbgaag ccagbbaccb bcggaaaaag 4860 
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agttggbagc bctbgabccg gcaaacaaac 
caagcagcag attacgcgca gaaaaaaagg 
ggggtctgac gcbcagbgga acgaaaactc 
aaaaaggatc ttcacctaga tccttttaaa 
tatatatgag taaacttggt cbgacagbta 
agcgatctgt cbabbbcgtb catccatagt 
tctgcctcgt gaagaaggtg ttgctgactc 
ccagaaagbg agggagccac ggtbgabgag 
tttgaactbt tgctttgcca cggaacggtc 
cttcaactca gcaaaagttc gatttattca 
atgctctgcc agtgttacaa ccaattaacc 
aaabgaaacb gcaatttatt catatcagga 
ttctgtaatg aaggagaaaa ctcaccgagg 
cggtctgcga bbccgacbcg tccaacabca 
ataaggbbat caagtgagaa atcaccatga 
agcttabgca tttctttcca gacbbgttca 
tcactcgcat caaccaaacc gttattcatt 
cgatcgctgb taaaaggaca attacaaaca 
gccagcgcat caacaatabt ttcacctgaa 
gtbttcccgg ggabcgcagb ggtgagtaac 
ttgatggtcg gaagaggcab aaattccgtc 
acatcatbgg caacgctacc tbbgccatgt 
ccatacaatc gatagattgt cgcacctgat 
ccatataaat cagcatccab gttggaattt 
tgaatatggc bcabaacacc ccttgtatta 
catgabgaba tattbbbabc bbgbgcaabg 
ctttcccccc ccccccabba btgaagcabb 
atabbbgaat gbatbbagaa aaataaacaa 
gtgccacctg acgbctaaga aaccatbatt 
atcacgaggc cctttcgtc 



caccgctggb agcggtggtt tttttgtttg 4920 

atctcaagaa gatcctttga bcbbbtcbac 4980 

acgttaaggg attttggtca tgagattatc 5040 

ttaaaaatga agbbbtaaat caatcbaaag 5100 

ccaatgctta atcagtgagg cacctatctc 5160 

tgcctgactc gggggggggg sgcgctgagg 5220 

ataccaggcc tgaatcgccc catcatccag 5280 
agctttgttg taggtggacc agttggtgat 5340 
tgcghtgtcg ggaagatgcg tgatctgatc 54 00 
acaaagccgc cgtcccgtca agtcagcgta 54 60 
aattctgatt agaaaaactc atcgagcatc 5520 
ttatcaatac catatttttg aaaaagccgt 5580 
cagttccata ggatggcaag atcctggtat 5640 
atacaaccfca ttaatttccc ctcgtcaaaa 5700 
gtgacgactg aatccggtga gaatggcaaa 5760 
acaggccagc cattacgctc gtcatcaaaa 5820 
cgtgattgcg cctgagcgag acgaaatacg 5880 
ggaatcgaat gcaaccggcg caggaacact 5940 
tcaggatatt cttctaatac ctggaatgct 6000 
catgcatcat caggagtacg gataaaatgc 6060 
agccagttta gtctgaccat ctcatctgta 6120 
ttcagaaaca actctggcgc atcgggcttc 6180 
tgcccgacat tatcgcgagc ccatttatac 6240 
aatcgcggcc tcgagcaaga cgtttcccgt 6300 
ctgtttatgt aagcagacag ttttattgtt 6360 
taacatcaga gattttgaga cacaacgtgg 6420 
tatcagggtt attgtctcafc gagcggatac 6480 
abaggggttc cgcgcacatt tccccgaaaa 6540 
atcafcgacat taacctataa aaataggcgb 6600 

6619 



<210> 25 
<211> 6655 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 25 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggbg 120 
bbggcgggtg tcggggcbgg cbbaactatg cggcatcaga gcagabbgba ctgagagtgc 180 
accatatgcg gtgtgaaaba ccgcacagat gcgtaaggag aaaabaccgc abcagabtgg 240 
cbattggcca ttgcatacgb tgtabccata bcabaatatg bacatttata ttggctcabg 300 
bccaacabta ccgccatgtb gacabbgabt atbgacbagb tatbaabagt aatcaabtac 360 
ggggbcatta gttcabagcc catabatgga gbbccgcgbt acabaactba cggtaaatgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaataatga cgtatgttcc 480 
cabagtaacg ccaabaggga cbbbccabtg acgbcaatgg gtggagtatb tacggtaaac 540 
bgcccacbbg-gcagbacatc aagtgtatca tatgccaagb acgcccccba ttgacgtcaa 600 
tgacggtaaa bggcccgcct ggcattatgc ccagtacabg acctbatggg actbtccbac 660 
ttggcagbac atcbacgtat bagbcatcgc babtaccabg gtgabgcggb bbtggcagta 720 
catcaabggg cgtggabagc ggbbbgacfcc acggggabtb ccaagtctcc accccatbga 780 
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cgtcaatggg agtbtgtbtb. ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct abataagcag 900 
agctcgttta gbgaaccgbc agatcgccbg gagacgccat ccacgcbgbt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt tbtbggcttg gggcctatac acccccgctt ccbbabgcba 1140 
taggtgatgg tatagcttag cctataggbg bgggttattg accattattg accactcccc 1200 
babbggbgac gatactttcc atbacbaabc cabaacabgg ctcbbbgcca caactatctc 1260 
babbggcbab atgccaabac bcbgtccttc agagacbgac acggactctg tattttbaca 1320 
ggabggggbc ccabbtabta ttbacaaatb cacabataca acaacgccgb cccccgbgcc 13 80 
cgcagtbbbb abtaaacata gcgbgggatc bccacgcgaa tcbcgggbac gbgbbccgga 1440 
catgggctcb bcbccggbag cggcggagct bccacatccg agccctggbc ccabgcctcc 1500 
agcggcbcat ggbcgcbcgg cagctcctbg ctccbaacag tggaggccag actbaggcac 1560 
agcacaabgc ccaccaccac cagbgtgccg cacaaggccg tggcggtagg gbabgbgtcb 1620 
gaaaabgagc gtggagabtg ggcbcgcacg gcbgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagtbgtb gbattctgat aagagbcaga ggtaacbccc 174 0 
gttgcggbgc tgttaacggt ggagggcagb gbagtcbgag cagtactcgb tgctgccgcg 1800 
cgcgccacca gacataabag cbgacagact aacagacbgb bcctttccab gggtcbbbbc 1860 
tgcagtcacc gtcgbcgaca cgbgtgabca gabatcgcgg ccgcbctaga abgcgcgtga 1920 
aggagaagba ccagcaccbg bggcgctggg gcbggcgcbg gggcaccatg cbgctgggca 1980 
bgcbgabgab ctgcagcgcc accgagaagc tgtgggbgac cgbgbacbac ggcgtgcccg 2040 
tgtggaagga ggccaccacc accctgctcb gcgccagcga cgccaaggcc bacgacaccg 2100 
aggbgcacaa cgbgbgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgcbggb gaacgtgacc gagaactbcg acatgtggaa gaacgacatg gbggagcaga 2220 
bgcacgagga cabcatcagc ctgbgggacc agagcctgaa gcccbgcgbg aagcbgaccc 2280 
cccbgbgcgb gagcctgaag bgcaccgacg cbagcaagaa ctgcagcfchc aacatcagca 2340 
ccagcabccg cggcaaggbg cagaaggagt acgcctbcbb ctacaagcbg gacabcatcc 2400 
ccabcgacaa cgacaccacc agcbacagcc bgaccagcbg caacaccagc gbgabcaccc 2460 
aggccbgccc caaggtgagc btcgagccca bccccaacca ctactgcgcc cccgccggcb 2520 
bcgccabcct gaagtgcaag gacaagaagt bcaacggcaa gggccccbgc accaacgtga 2580 
gcaccgbgca gtgcacccac ggcabccgcc ccgbggtgag cacccagctg cbgcbgaacg 264 0 
gbagccbggc cgaggaggag gtggbgabcc gcagcgctaa ctbcgccgac aacgccaagg 27 00 
tgabcabcgt gcagctgaac gagagcgtgg agatcaacbg cacccgcgct agcgcccacb 2760 
gcaaccbgag ccgcgccaag tggaacgaca cccbgaacaa gatcgtgatc aagcbgcgcg 2820 
agcagttcgg caacaagacc abcgbgttca agcacagcag cggcggcgac cccgagatcg 2880 
bgacccacag ctbcaacbgc ggcggcgagt bctbctacbg caacagcacc cagctgbbca 2940 
acagcaccbg gbbcaacagc accbggagca ccgagggcag caacaacacc gagggcagcg 3000 
acaccatcac cctgccctgc cgcabcaagc agatcatcaa catgbggcag aaggbgggca 3 060 
aggccatgta cgcccccccc abcagcggcc agabccgcbg cagcagcaac atcaccggcc 3120 
tgcbgcbgac ccgcgacggc ggcaacagca acaacgagag cgagatcbtc cgbccgggcg 3180 
gcggcgacat gcgcgacaac bggcgcagcg agcbgtacaa gbacaaggtg gbgaagabcg 3240 
agcccctggg cgtggccccc accaaggcca agcttaccgt ccaggcccgc cagctgcbga 3300 
gcggcabcgt gcagcagcag aacaaccbgc tgcgcgccab cgaggcccag cagcaccfcgc 3 3 60 
bgcagcbgac cgtgbggggc abcaagcagc bgcaggcccg caccctggcc gtggagcgct 3420 
accbgaagga ccagcagcag cbcgagcaga tctggaacca caccaccbgg abggagtggg 34 80 
accgcgagat caacaacbac accagccbga bccacagccb gabcgaggag agccagaacc 3540 
agcaggagaa gaacgagcag gagcbgcbgg agcbggacaa gbgggccagc ctgbggaact 3600 
ggbbcaacab caccaacbgg cbgtggbgag gabccagabc bgctgbgccb bctagbtgcc 3660 
agccatcbgb tgttbgcccc bcccccgbgc cbbccttgac cctggaaggb gccacbccca 3720 
ctgbcctttc cbaataaaat gaggaaattg catcgcatfcg hctgagtagg bgtcabtcba 3780 
btcbgggggg tggggtgggg caggacagca agggggagga bbgggaagac aabagcaggc 3 84 0 
atgcbgggga tgcggbgggc tcbabgggta cccaggbgct gaagaabbga cccggbtccb 3900 
cctgggccag aaagaagcag gcacabcccc bbctcbgbga cacacccbgb ccacgccccb 3960 
ggbtcbtagb bccagcccca cbcabaggac acbcabagct caggagggcb ccgccttcaa 4020 
tcccacccgc baaagbactb ggagcggbcb cbcccbccct cabcagccca ccaaaccaaa 4080 
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ccbagccbcc aagagtggga agaaattaaa gcaagabagg ctattaagfcg cagagggaga 4140 
gaaaabgccb ccaacabgbg aggaagtaat gagagaaatc atagaatttt aaggccatga 4200 
tttaaggcca tcatggcctt aabctbccgc ttcctcgctc acbgacbcgc bgcgcbcggb 4260 
cgbbcggcbg cggcgagcgg tabcagcbca ctcaaaggcg gfcaatacggt tatccacaga 4320 
abcaggggab aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 43 80 
taaaaaggcc gcgbbgcbgg cgtttttcca baggcbccgc ccccctgacg agcatcacaa 4440 
aaatcgacgc tcaagtcaga ggbggcgaaa cccgacagga ctataaagat accaggcgtt 4500 
bcccccbgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 4560 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agcbcacgcb gbaggbabcb 4620 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 4 680 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggbaa gacacgactt 4740 
atcgccactg gcagcagcca cbggbaacag gattagcaga gcgaggtatg taggcggtgc 4 800 
tacagagttc tbgaagbggb ggcctaacta cggctacact agaagaacag tatttggtat 4860 
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggbagcbcbb gabccggcaa 4920 
acaaaccacc gctggtagcg gbggtbbbtb bgttbgcaag cagcagatta cgcgcagaaa 4 980 
aaaaggatct caagaagabc ctttgatctt ttctacgggg tctgacgctc agtggaacga 5040 
aaactcacgt baagggabbb tggtcatgag abbatcaaaa aggabctbca cctagatcct 5100 
tttaaattaa aaabgaagtt ttaaatcaat ctaaagtata babgagbaaa cbbggbcbga 5160 
cagttaccaa bgcbbaabca gtgaggcacc babcbcagcg atctgtctat ttcgttcatc 5220 
cabagbtgcc tgactcgggg ggggggggcg ctgaggtctg cctcgtgaag aaggtgttgc 5280 
tgactcatac caggcctgaa tcgccccatc atccagccag aaagbgaggg agccacggtt 5340 
gatgagagct ttgttgtagg tggaccagtt ggtgattttg aacttttgct ttgccacgga 5400 
acggtctgcg ttgtcgggaa gatgcgtgat ctgatccttc aactcagcaa aagttcgatt 5460 
tattcaacaa agccgccgtc ccgtcaagtc agcgtaatgc tctgccagtg ttacaaccaa 5520 
ttaaccaatt ctgattagaa aaactcatcg agcatcaaat gaaactgcaa tttattcata 5580 
tcaggattat caataccata tttttgaaaa agccgtttct gtaatgaagg agaaaactca 5640 
ccgaggcagt tccataggat ggcaagatcc tggtatcggb ctgcgattcc gactcgtcca 5700 
acatcaatac aacctattaa fcttcccctcg tcaaaaataa ggttatcaag tgagaaatca 5760 
ccatgagtga cgactgaatc cggtgagaat ggcaaaagct tatgcatttc tttccagact 5820 
tgttcaacag gccagccatt acgctcgtca tcaaaabcac tcgcabcaac caaaccgtba 5880 
ttcattcgtg atbgcgcctg agcgagacga aatacgcgat cgcbgttaaa aggacaabba 5940 
caaacaggaa bcgaabgcaa ccggcgcagg aacacbgcca gcgcabcaac aababbbbca 6000 
cctgaabcag gababbctbc taabaccbgg aabgctgttb bcccggggab cgcagbggbg 6060 
agbaaccabg cabcabcagg agtacggata aaabgcbtga bggbcggaag aggcabaaat 6120 
bccgbcagcc agtbbagbct gaccatcbca tcbgbaacab catbggcaac gctaccbbbg 6180 
ccabgbbbca gaaacaactc bggcgcabcg ggcbbcccat acaatcgata gabtgbcgca 6240 
ccbgabbgcc cgacabbabc gcgagcccab btabacccab abaaabcagc abccatgbbg 6300 
gaabtbaabc gcggccbcga gcaagacgtb bcccgtbgaa babggcfccab aacaccccbb 6360 
gbabbacbgb bbabgbaagc agacagbbtb abtgbtcabg abgabababb bbbabcbbgb 6420 
gcaabgbaac atcagagabb btgagacaca acgbggctbb cccccccccc ccabbabbga 6480 
agcabbbabc agggbbabhg bcbcatgagc ggabacabab btgaabgbab bbagaaaaat 6540 
aaacaaabag gggbbccgcg cacabbbccc cgaaaagbgc caccbgacgb ctaagaaacc 6600 
atbabtabca bgacatbaac cbabaaaaab aggcgbabca cgaggccctb bcgbc 6655 



<210> 26 
<211> 6658 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 26 
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bcgcgcgbbb cggtgatgac ggbgaaaacc 
cagctbgbct gtaagcggat gccgggagca 
ttggcgggtg tcggggctgg cttaactatg 
accatatgcg gtgtgaaata ccgcacagat 
cbabbggcca btgcabacgb bgtabccata 
tccaacatta ccgccabgbb gacabhgatb 
ggggbcabba gttcatagcc catatatgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga cbttccatbg 
tgcccacttg gcagtacatc aagtgtatca 
bgacggbaaa bggcccgcct ggcabbatgc 
btgfrcagbac atcbacgtab tagtcatcgc 
catcaatggg cgbggabagc ggbtbgacbc 
cgtcaatggg agtttgtbbb ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgtbba gtgaaccgtc agatcgccbg 
tagaagacac cgggaccgab ccagcctccg 
tccccgbgcc aagagbgacg taagbaccgc 
tcbbabgcab gcbabactgt bbbtggcbtg 
taggtgatgg tatagcttag ccbabaggtg 
tabtggtgac gatactttcc atbactaatc 
battggctat abgccaatac tctgbccbtc 
ggatggggtc ccabtbatba bttacaaatb 
cgcagbtttt abbaaacata gcgtgggatc 
cabgggcbcb tcbccggbag cggcggagct 
agcggctcafc ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag cbgagbtgtb 
gbtgcggbgc bgbbaacggb ggagggcagb 
cgcgccacca gacabaabag cbgacagact 
bgcagtcacc gbcgbcgaca cgtgtgabca 
aggagaagba ccagcaccbg tggcgctggg 
tgctgatgab cbgcagcgcc accgagaagc 
bgbggaagga ggccaccacc acccbgcbcb 
aggbgcacaa cgtgbgggcc acccacgccb 
bggbgctggt gaacgtgacc gagaactbcg 
bgcacgagga catcatcagc ctgtgggacc 
cccbgtgcgb gagcctgaag tgcaccgacg 
ccagcabccg cggcaaggbg cagaaggagt 
ccabcgacaa cgacaccacc agcbacagcc 
aggcctgccc caaggbgagc ttcgagccca 
tcgccabccb gaagbgcaag gacaagaagb 
gcaccgbgca gtgcacccac ggcatccgcc 
gtagccbggc cgaggaggag gbggtgatcc 
bgabcabcgb gcagctgaac gagagcgbgg 
cccgcaagag catccacabc ggccccggcc 
gcgacatccg ccaggcccac bgcaacctga 
agatcgtgafc caagctgcgc gagcagttcg 
gcggcggcga ccccgagatc gtgacccaca 
gcaacagcgc bagccbgccc bgccgcatca 
gcaaggccab gbacgccccc cccatcagcg 
gcctgctgcb gacccgcgac ggcggcaaca 
gcggcggcga cabgcgcgac aacbggcgca 
tcgagccccb gggcgtggcc cccaccaagg 



tctgacacat gcagcbcccg gagacggbca 60 
gacaagcccg bcagggcgcg bcagcgggbg 120 
cggcatcaga gcagatbgta ctgagagbgc 180 
gcgtaaggag aaaataccgc atcagatbgg 240 
tcabaababg bacattbata tbggcbcabg 300 
abbgactagb tabbaabagb aatcaabbac 360 
gttccgcgtb acataactba cggbaaabgg 420 
cccabbgacg bcaabaabga cgbatgbbcc 480 
acgbcaabgg gbggagtatb tacggbaaac 540 
batgccaagb acgcccccta bbgacgtcaa 600 
ccagbacabg accbtabggg acbbbccbac 660 
babbaccatg gbgatgcggt bttggcagba 720 
acggggabtt ccaagbcbcc accccabbga 780 
bcaacgggac bbtccaaaat gtcgtaacaa 840 
gcgbgtacgg fcgggaggbct atataagcag 900 
gagacgccat ccacgcbgtt ttgacctcca 960 
cggccgggaa cggtgcatbg gaacgcggab 1020 
cbabagactc babaggcaca ccccttbggc 1080 
gggccbabac acccccgcbb ccttabgcba 1140 
bgggbbabtg accatbabtg accactcccc 1200 
cataacabgg ctcbttgcca caacbabcbc 1260 
agagacbgac acggactcbg tatttbbaca 1320 
cacabataca acaacgccgb cccccgtgcc 1380 
bccacgcgaa bcbcgggtac gbgttccgga 1440 
bccacatccg agcccbggbc ccatgccbcc 1500 
ctccbaacag tggaggccag acttaggcac 1560 
cacaaggccg bggcggbagg gbatgtgtcb 1620 
gctgacgcag abggaagact taaggcagcg 1680 
gtabbcbgab aagagbcaga ggtaactccc 174 0 
gbagbcbgag cagbacbcgb bgctgccgcg 1800 
aacagactgt bccbtbccab gggtcbbbtc 1860 
gabatcgcgg ccgctcbaga atgcgcgbga 1920 
gctggcgctg gggcaccabg ctgctgggca 1980 
tgtgggtgac cgbgtacbac ggcgbgcccg 2040 
gcgccagcga cgccaaggcc bacgacaccg 2100 
gcgbgcccac cgaccccaac ccccaggagg 2160 
acabgbggaa gaacgacatg gtggagcaga 2220 
agagccbgaa gccctgcgtg aagcbgaccc 2280 
ctagcaagaa ctgcagcbtc aacabcagca 2340 
acgcctbctt ctacaagctg gacabcabcc 2400 
tgaccagcbg caacaccagc gtgabcaccc 2460 
tccccaacca ctacbgcgcc cccgccggcb 2520 
bcaacggcaa gggcccctgc accaacgtga 2580 
ccgtggbgag cacccagcbg cbgctgaacg 2640 
gcagcgctaa cttcgccgac aacgccaagg 27 00 
agabcaactg cacccgcccc aacaacaaca 2760 
gcgccbbcba caccaccggc gagabcatcg 2820 
gccgcgccaa gbggaacgac acccbgaaca 2880 
gcaacaagac cabcgtgttc aagcacagca 2940 
gctbcaacbg cggcggcgag ttcbbcbacb 3000 
agcagatcab caacabgbgg cagaaggbgg 3060 
gccagabccg cbgcagcagc aacabcaccg 3120 
gcaacaacga gagcgagatc btccgbccgg 3180 
gcgagcbgba caagbacaag gbggbgaaga 324 0 
ccaagcbbac cgbccaggcc cgccagctgc 3300 
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tgagcggcat cgtgcagcag cagaacaacc 
tgctgcagct gaccgtgtgg ggcatcaagc 
gctacctgaa ggaccagcag cagctcgagc 
gggaccgcga gatcaacaac bacaccagcc 
accagcagga gaagaacgag caggagctgc 
actggttcaa catcaccaac tggctgtggt 
gccagccatc tgttgtttgc ccctcccccg 
ccacbgbccb ttcctaataa aatgaggaaa 
cbabbcbggg gggtggggtg gggcaggaca 
ggcatgctgg ggatgcggtg ggcbcbatgg 
ccbccbgggc cagaaagaag caggcacabc 
cctggttctt agttccagcc ccactcatag 
caatcccacc cgctaaagta cttggagcgg 
aaacctagcc tccaagagtg ggaagaaatt 
agagaaaatg cctccaacat gtgaggaagt 
tgatttaagg ccatcatggc cttaatcttc 
ggtcgttcgg ctgcggcgag cggtatcagc 
agaatcaggg gataacgcag gaaagaacat 
ccgtaaaaag gccgcgttgc bggcgbbbbb 
caaaaatcga cgctcaagtc agaggtggcg 
gbbbcccccb ggaagcbccc tcgtgcgctc 
cctgtccgcc tttctccctt cgggaagcgt 
bcbcagbbcg gtgtaggtcg ttcgctccaa 
gcccgaccgc tgcgccttat ccggtaacta 
cbbatcgcca ctggcagcag ccactggtaa 
tgctacagag ttcttgaagt ggtggcctaa 
tatctgcgct ctgctgaagc cagbbacctb 
caaacaaacc accgcbggba gcggtggttt 
aaaaaaagga tctcaagaag atcctttgat 
cgaaaactca cgttaaggga btbbggtcab 
ccttttaaat taaaaatgaa gbbtbaaatc 
bgacagbbac caabgcbbaa bcagtgaggc 
atccatagtt gcctgacbcg gggggggggg 
bgcbgactca taccaggcct gaatcgcccc 
gttgabgaga gcbtbgtbgb aggtggacca 
ggaacggbcb gcgtbgtcgg gaagatgcgt 
abbtatbcaa caaagccgcc gtcccgtcaa 
caattaacca atbcbgabba gaaaaactca 
ababcaggab tatcaatacc atatttttga 
tcaccgaggc agtbccatag gabggcaaga 
ccaacabcaa bacaaccbab baatbtcccc 
tcaccabgag bgacgacbga abccggbgag 
acbbgbbcaa caggccagcc abtacgcbcg 
bbabbcabbc gbgabtgcgc cbgagcgaga 
tbacaaacag gaabcgaabg caaccggcgc 
bcaccbgaab caggababtc bbcbaabacc 
gbgagtaacc abgcatcabc aggagbacgg 
aatbccgbca gccagtbbag bcbgaccatc 
btgccabgbb tcagaaacaa cbcbggcgca 
gcaccbgabb gcccgacabb abcgcgagcc 
tbggaattba abcgcggccb cgagcaagac 
ctbgtatbac bgbbbabgta agcagacagt 
bgbgcaabgb aacabcagag atbbtgagac 
tgaagcabbb abcagggbba bbgbcbcabg 
aabaaacaaa taggggbbcc gcgcacabbt 



bgcbgcgcgc cabcgaggcc cagcagcacc 3360 
agcbgcaggc ccgcacccbg gccgtggagc 3420 
agabcbggaa ccacaccacc bggabggagb 3480 
tgabccacag cctgabcgag gagagccaga 3540 
bggagctgga caagbgggcc agccbgbgga 3600 
gaggabccag abcbgcbgbg ccbbctagbb 3660 
bgccbtccbb gaccctggaa ggbgccacbc 3720 
btgcabcgca bbgbcbgagt aggbgtcabb 3780 
gcaaggggga ggabtgggaa gacaabagca 3840 
gbacccaggb gctgaagaab bgacccggtb 3900 
ccctbcbcbg 'bgacacaccc bgbccacgcc 3960 
gacacbcaba gcbcaggagg gctccgccbb 4020 
bcbcbcccbc ccbcabcagc ccaccaaacc 4080 
aaagcaagat aggcbatbaa gtgcagaggg 4140 
aabgagagaa abcabagaab bbbaaggcca 4200 
cgcbtccbcg cbcacbgacb cgctgcgcbc 4260 
bcacbcaaag gcggbaabac ggbbabccac 4320 
gbgagcaaaa ggccagcaaa aggccaggaa 43 80 
ccabaggcbc cgcccccctg acgagcatca 444 0 
aaacccgaca ggactabaaa gabaccaggc 4 500 
tccbgbbccg acccbgccgc tbaccggaba 4 560 
ggcgcbbbct cabagcbcac gcbgtaggba 4620 
gcbgggcbgb gbgcacgaac cccccgbbca 4680 
bcgbcttgag bccaacccgg baagacacga 474 0 
caggabbagc agagcgaggt atgbaggcgg 4800 
cbacggcbac acbagaagaa cagbabtbgg 4860 
cggaaaaaga gbbggbagcb cbbgabccgg 4920 
bbtbgbtbgc aagcagcaga bbacgcgcag 4980 
ctbbbcbacg gggbcbgacg cbcagbggaa 5040 
gagabbatca aaaaggabcb bcaccbagat 5100 
aabcbaaagb ababatgagb aaacbbggtc 5160 
accbatcbca gcgabcbgbc babbbcgbfcc 5220 
gcgctgaggb ctgccbcgbg aagaaggbgb 5280 
atcabccagc cagaaagbga gggagccacg 5340 
gbtggbgabb tbgaacbbbt gcbtbgccac 5400 
gabcbgabcc bbcaacbcag caaaagtbcg 5460 
gbcagcgbaa tgcbctgcca gbgbbacaac 5520 
bcgagcatca aabgaaacbg caabbbabbc 5580 
aaaagccgbb bcbgbaabga aggagaaaac 5640 
bccbggbatc ggtcbgcgab bccgacbcgt 57 00 
bcgbcaaaaa baaggbbabc aagbgagaaa 5760 
aabggcaaaa gctbabgcab btctbbccag 5820 
bcabcaaaab cacbcgcabc aaccaaaccg 5880 
cgaaatacgc gabcgcbgbt aaaaggacaa 5940 
aggaacacbg ccagcgcatc aacaababtt 6000 
bggaatgcbg bbbbcccggg gabcgcagbg 6060 
abaaaatgcb bgabggtcgg aagaggcaba 6120 
bcatctgbaa cabcabtggc aacgcbaccb 6180 
bcgggcbtcc cabacaabcg abagabbgbc 6240 
cabbtabacc cabataaabc agcabccabg 6300 
gbbbcccgbb gaababggcb cabaacaccc 6360 
tbbabbgbbc atgabgabab abbbbbatcb 6420 
acaacgbggc btbccccccc cccccabtab 6480 
agcggabaca babbtgaatg tabbbagaaa 6540 
ccccgaaaag bgccaccbga cgbctaagaa 6600 
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accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtc 6658 



<210> 27 
<211> 6616 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVK1012x/s containing HIV genes 
<400> 27 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaabaccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
fcccaacatta ccgccatgtt gacattgatt atfcgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480. 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta tfcgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatchc 1260 
tattggctat ahgccaatac tctgtccttc agagactgac acggactcfcg tatttfctaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa bctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 16B0 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tccttbccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgtga 1920 
aggagaagta ccagcacctg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 1980 
tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgfcactac ggcgtgcccg 204 0 
tgtggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 
tgcacgagga catcabcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 23 4 0 
gcggccgcat gatcatggag aagggcgaga tcaagaactg cagcttcgct agcaccagct 2400 
gcaacaccag cgtgatcacc caggcctgcc ccaaggtgag cttcgagccc atccccaacc 24 60 
actactgcgc ccccgccggc ttcgccatcc tgaagtgcaa. ggacaagaag ttcaacggca 252 0 
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agggcccctg caccaacgtg agcaccgtgc agtgcaccca cggcatccgc cccgtggtga 2580 
gcacccagct gctgctgaac ggbagccbgg ccgaggagga ggtggtgatc cgcagcgcba 2640 
acbtcgccga caacgccaag gtgatcatcg bgcagcbgaa cgagagcgtg gagatcaact 2700 
gcacccgcgc tagcgcccac bgcaaccbga gccgcgccaa gtggaacgac accctgaaca 2760 
agatcgtgat caagctgcgc gagcagttcg gcaacaagac catcgtgttc aagcacagca 2820 
gcggcggcga ccccgagatc gtgacccaca gcbbcaactg cggcggcgag ttcttctact 2880 
gcaacagcac ccagctgttc aacagcacct ggttcaacag cacctggagc accgagggca 2940 
gcaacaacac cgagggcagc gacaccatca ccctgccctg ccgcatcaag cagatcatca 3000 
acatgtggca gaaggbgggc aaggccatgt acgccccccc catcagcggc cagatccgct 3060 
gcagcagcaa cabcaccggc ctgctgctga cccgcgacgg cggcaacagc aacaacgaga 312 0 
gcgagatctt ccgtccgggc ggcggcgaca tgcgcgacaa ctggcgcagc gagctgtaca 3180 
agtacaaggt ggbgaagabc gagcccctgg gcgtggcccc caccaaggcc aagcttaccg 3240 
tccaggcccg ccagctgchg agcggcatcg tgcagcagca gaacaacctg ctgcgcgcca 3300 
tcgaggccca gcagcacctg ctgcagchga ccgtgtgggg catcaagcag ctgcaggccc 3360 
gcaccctggc cgbggagcgc tacctgaagg accagcagca gctcgagcag atctggaacc 3420 
acaccacctg gabggagbgg gaccgcgaga tcaacaacta caccagcctg atccacagcc 3480 
tgatcgagga gagccagaac cagcaggaga agaacgagca ggagctgctg gagctggaca 3540 
agtgggccag cctgtggaac tggttcaaca tcaccaactg gctgtggtga ggatccagat 3600 
ctgctgtgcc thctagttgc cagccatctg bbgbbtgccc ctcccccgtg ccttccttga 3660 
ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcabcgcabt 3720 
gtctgagtag gtgbcabbct attctggggg gtggggtggg gcaggacagc aagggggagg 3780 
attgggaaga caabagcagg catgctgggg abgcggbggg ctcbatgggb acccaggbgc 3840 
tgaagaattg acccggttcc tcctgggcca gaaagaagca ggcacatccc cttctctgtg 3900 
acacaccctg tccacgcccc tggbbctbag ttccagcccc actcatagga cactcabagc 3960 
tcaggagggc bccgccbbca abcccacccg cbaaagbacb bggagcggbc tctccctccc 4020 
tcatcagccc accaaaccaa acctagcctc caagagtggg aagaaattaa agcaagabag 4080 
gctattaagt gcagagggag agaaaatgcc tccaacatgt gaggaagtaa bgagagaaab 4140 
catagaattt taaggccatg abtbaaggcc atcatggcct taabcbbccg ctbccbcgct 4200 
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc acbcaaaggc 4260 
ggtaatacgg ttatccacag aatcagggga baacgcagga aagaacatgt gagcaaaagg 4320 
ccagcaaaag gccaggaacc gbaaaaaggc cgcgtbgctg gcgbbbbtcc ataggcbccg 4380 
ccccccbgac gagcabcaca aaaatcgacg cbcaagbcag aggbggcgaa acccgacagg 4440 
actataaaga taccaggcgb bbcccccbgg aagctccctc gbgcgctcbc cbgbtccgac 4500 
ccbgccgctb accggabacc tgbccgccbb tcbcccbbcg ggaagcgbgg cgcttbctca 4560 
bagctcacgc bgbaggbatc bcagtbcggt gbaggtcgbb cgcbccaagc tgggcbgbgb 4620 
gcacgaaccc cccgbbcagc ccgaccgctg cgccbbabcc ggtaacbabc gbcbbgagbc 4680 
caacccggba agacacgacb tabcgccact ggcagcagcc acbggbaaca ggatbagcag 4 740 
agcgaggtab gbaggcggbg cbacagagbb cbbgaagbgg bggccbaacb acggcbacac 4 800 
tagaagaaca gtabbbggba bcbgcgctcb gcbgaagcca gtbacctbcg gaaaaagagt 4 860 
tggbagcbct tgatccggca aacaaaccac cgcbggbagc ggtggtbbbt btgbttgcaa 4 920 
gcagcagatt acgcgcagaa aaaaaggatc bcaagaagab ccbbbgatct bbtctacggg 4980 
gbcbgacgcb cagtggaacg aaaacbcacg tbaagggabb bbggbcatga gattabcaaa 5040 
aaggabcbbc accbagatcc tbbbaaatta aaaabgaagb bbtaaabcaa bcbaaagbab 5100 
ababgagbaa actbggbcbg acagbbacca abgcbbaabc agbgaggcac ctabcbcagc 5160 
gabcbgtcba tbbcgtbcab ccabagbbgc cbgacbcggg gggggggggc gctgaggbct 5220 
gcctcgbgaa gaaggtgbbg ctgacbcata ccaggcctga atcgccccab catccagcca 5280 
gaaagbgagg gagccacggb tgabgagagc btbgbbgtag gtggaccagb tggbgatbbb 5340 
gaacbttbgc bbtgccacgg aacggbctgc gbbgbcggga agabgcgbga tctgabccbb 5400 
caacbcagca aaagbbcgab btatbcaaca aagccgccgt cccgbcaagb cagcgbaabg 5460 
cbcbgccagb gbtacaacca abbaaccaab bcbgabbaga aaaacbcabc gagcatcaaa 5520 
tgaaacbgca abbbabbcab abcaggabta bcaabaccab abbbbtgaaa aagccgbbbc 5580 
bgbaabgaag gagaaaacbc accgaggcag bbccatagga bggcaagabc cbggtabcgg 5640 
tcbgcgabbc cgacbcgbcc aacabcaaba caaccbabba atbbcccctc gbcaaaaaba 5700 
aggbbabcaa gbgagaaabc accabgagbg acgacbgaat ccggtgagaa bggcaaaagc 5760 
btatgcabbb cbbbccagac bbgbbcaaca ggccagccab bacgcbcgtc abcaaaabca 5820 
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ctcgcatcaa ccaaaccgtt attcattcgt gattgcgcch gagcgagacg aaatacgcga 5880 
tcgctgtfcaa aaggacaatt acaaacagga atcgaatgca accggcgcag gaacactgcc 5940 
agcgcatcaa caatattttc acctgaatca ggatattctt ctaatacctg gaatgctgtt 6000 
ttcccgggga tcgcagtggt gagtaaccat gcatcatcag gagtacggat aaaatgcttg 6060 
atggtcggaa gaggcataaa ttccgtcagc cagtttagtc tgaccatctc atctgtaaca 6120 
tcattggcaa cgctaccttt gccatgttfcc agaaacaact ctggcgcatc gggcttccca 61B0 
tacaatcgat agattgtcgc acctgattgc ccgacattat cgcgagccca tttataccca 6240 
tataaatcag catccatgtt ggaatttaat cgcggcctcg agcaagacgt ttcccgttga 6300 
atatggctca taacacccct tgtattactg tttatgtaag cagacagttt tattgttcat 6360 
gatgatatat ttttatcttg tgcaatgtaa catcagagat tttgagacac aacgtggctt 6420 
tccccccccc cccattattg aagcatttat cagggttatt gtctcatgag cggatacata 6480 
tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg 6540 
ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa taggcgtatc 6600 
acgaggccct ttcgtc 6616 



<210> 28 
<211> 6619 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 28 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc cCatttatta tttacaaatt cacatataca acaacgccgt .cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
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gbbgcggbgc bgbbaacggb ggagggcagt gbagbctgag cagtacbcgb bgcbgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggbcbbbtc 1860 
tgcagtcacc gbcgbcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgbga 1920 
aggagaagta ccagcacctg tggcgctggg gcbggcgcbg gggcaccabg cbgctgggca 1980 
tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2040 
bgbggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
bggbgcbggb gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 
tgcacgagga catcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacc bgaagaacga caccaacacc aacagcagca 234 0 
gcggccgcat gatcatggag aagggcgaga tcaagaactg cagcttcgct agcaccagct 2400 
gcaacaccag cgbgatcacc caggcctgcc ccaaggbgag cttcgagccc atccccaacc 2460 
actactgcgc ccccgccggc ttcgccatcc tgaagtgcaa ggacaagaag ttcaacggca 2520 
agggcccctg caccaacgtg agcaccgtgc agbgcaccca cggcatccgc cccgtggtga 2580 
gcacccagct gctgctgaac ggtagcctgg ccgaggagga ggtggtgatc cgcagcgcta 2640 
acttcgccga caacgccaag gtgahcatcg tgcagctgaa cgagagcgtg gagahcaact 2700 
gcacccgccc caacaacaac acccgcaaga gcatccacat cggccccggc cgcgccttct 2760 
acaccaccgg cgagatcatc ggcgacatcc gccaggccca ctgcaacctg agccgcgcca 2820 
agtggaacga caccctgaac aagatcgtga tcaagctgcg cgagcagttc ggcaacaaga 2880 
ccatcgtgtt caagcacagc agcggcggcg accccgagat cgtgacccac agcbbcaacb 2940 
gcggcggcga gbbcbbcbac tgcaacagcg ctagcctgcc cbgccgcabc aagcagatca 3000 
tcaacatgtg gcagaaggtg ggcaaggcca tgtacgcccc ccccatcagc ggccagatcc 3060 
gctgcagcag caacatcacc ggcctgctgc tgacccgcga cggcggcaac agcaacaacg 3120 
agagcgagat cttccgtccg ggcggcggcg acatgcgcga caactggcgc agcgagctgt 3180 
acaagtacaa ggtggtgaag atcgagcccc tgggcgtggc ccccaccaag gccaagctta 3240 
ccgtccaggc ccgccagctg ctgagcggca tcgtgcagca gcagaacaac ctgctgcgcg 3300 
ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag cagcbgcagg 3360 
cccgcaccct ggccgtggag cgcbaccbga aggaccagca gcagctcgag cagatctgga 3420 
accacaccac ctggatggag tgggaccgcg agabcaacaa ctacaccagc ctgatccaca 34 BO 
gcctgatcga ggagagccag aaccagcagg agaagaacga gcaggagctg ctggagctgg 3 540 
acaagtgggc cagcctgtgg aactggttca acatcaccaa ctggctgtgg bgaggabcca 3600 
gatctgctgt gccttctagt tgccagccat cbgtbgbbbg cccctccccc gtgccttcct 3660 
tgaccctgga aggtgccact cccactgtcc bbbccbaata aaatgaggaa attgcatcgc 3720 
attgbctgag taggtgtcat hctattctgg ggggtggggt ggggcaggac agcaaggggg 3780 
aggattggga agacaatagc aggcatgctg gggafcgcggt gggctctatg ggtacccagg 3840 
tgctgaagaa ttgacccggt tcctcctggg ccagaaagaa gcaggcacat ccccttchct 3900 
gbgacacacc ctgtccacgc ccctggttct tagttccagc cccactcata ggacactcat 3960 
agctcaggag ggcbccgccb tcaatcccac ccgctaaagt acttggagcg gtctctccct 4020 
ccctcatcag cccaccaaac caaacctagc ctccaagagt gggaagaaat taaagcaaga 4 080 
taggctatta agtgcagagg gagagaaaat gccbccaaca tgtgaggaag taatgagaga 4140 
aatcatagaa ttttaaggcc atgatttaag gccatcatgg ccbbaabcbb ccgcbtccbc 4200 
gcbcacbgac bcgcbgcgct cggbcgbbcg gcbgcggcga gcggtabcag ctcacbcaaa 4260 
ggcggtaaba cggbbabcca cagaatcagg ggabaacgca ggaaagaaca bgbgagcaaa 4320 
aggccagcaa aaggccagga accgbaaaaa ggccgcgbbg cbggcgbbbb bccabaggcb 4380 
ccgccccccb gacgagcabc acaaaaabcg acgcbcaagt cagaggbggc gaaacccgac 4440 
aggactabaa agabaccagg cgbbbccccc bggaagcbcc cbcgtgcgct cbccbgbbcc 4500 
gacccbgccg cbbaccggab accbgbccgc ctbbctccct tcgggaagcg bggcgcbbtc 4560 
bcatagcbca cgcbgtaggb abcbcagbbc ggbgbaggtc gbbcgcbcca agcbgggcbg 4620 
bgbgcacgaa ccccccgbbc agcccgaccg cbgcgccbba bccggbaacb abcgbcbbga 4680 
gbccaacccg gbaagacacg acbbabcgcc acbggcagca gccacbggba acaggabbag 4740 
cagagcgagg batgbaggcg gbgcbacaga gbbcbbgaag bggbggccba acbacggcba 4800 
cacbagaaga acagbabttg gbabcbgcgc bcbgcbgaag ccagbbacct tcggaaaaag 4860 
agbbggbagc bcbbgatccg gcaaacaaac caccgcbggb agcggbggbb bbbbbgbbbg 4920 
caagcagcag abbacgcgca gaaaaaaagg abcbcaagaa gabccbtbga bcttbbcbac 4980 
ggggbcbgac gcbcagbgga acgaaaactc acgbbaaggg abbbbggbca bgagabbabc 5040 
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aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agtttbaaat caatctaaag 5100 
babababgag baaacbbggb cbgacagbba ccaatgctta abcagbgagg cacctatctc 5160 
agcgatctgt ctabtbcgbb .catccabagb tgcctgactc gggggggggg ggcgcbgagg 5220 
tctgcctcgt gaagaaggtg ttgctgactc ataccaggcc tgaatcgccc catcatccag 5280 
ccagaaagtg agggagccac ggttgatgag agctttgttg taggtggacc agttggtgat 5340 
tttgaacttt tgctttgcca cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc 5400 
cttcaactca gcaaaagttc gatttatbca acaaagccgc cgbcccgtca agbcagcgba 5460 
abgctctgcc agtgbtacaa ccaattaacc aabbctgabb agaaaaacbc abcgagcabc 5520 
aaatgaaact gcaatttatt catatcagga tbabcaabac cabatttbbg aaaaagccgb 5580 
btctgbaatg aaggagaaaa cbcaccgagg cagbbccata ggatggcaag abccbggtat 5640 
cggtcbgcga bbccgactcg tccaacabca atacaaccta ttaatttccc ctcgtcaaaa 5700 
abaaggttat caagtgagaa abcaccabga gbgacgacbg aabccggbga gaabggcaaa 5760 
agcttatgca tttcttbcca gacbbgttca acaggccagc catbacgcbc gbcatcaaaa 5B20 
tcactcgcab caaccaaacc gbtatbcabt cgtgattgcg ccbgagcgag acgaaatacg 5880 
cgatcgctgt taaaaggaca abtacaaaca ggaatcgaat gcaaccggcg caggaacact 5940 
gccagcgcat caacaabatb bbcaccbgaa tcaggababb ctbctaabac ctggaatgct 6000 
gbtttcccgg ggabcgcagt ggfcgagtaac catgcatcat caggagbacg gabaaaatgc 6060 
ttgatggbcg gaagaggcat aaatbccgtc agccagttba gtcbgaccab ctcatcbgta 6120 
acatcatbgg caacgcbacc tbbgccabgt ttcagaaaca acbctggcgc abcgggcttc 6180 
ccatacaatc gabagabtgt cgcacctgat tgcccgacat fcabcgcgagc ccabbtatac 6240 
ccatabaaab cagcatccat gtbggaabtb aatcgcggcc tcgagcaaga cgbfctcccgt 63 00 
tgaatatggc tcabaacacc ccbbgtabba ctgbttatgt aagcagacag ttttabtgtt 6360 
catgabgata tatbbtbatc tbgbgcaabg taacatcaga gatbtbgaga cacaacgbgg 6420 
cbbbcccccc ccccccabba tbgaagcabt babcagggbb attgbcbcab gagcggatac 6480 
ababbbgaab gbabbtagaa aaabaaacaa ataggggtbc cgcgcacatt bccccgaaaa 6540 
gbgccaccbg acgbctaaga aaccabbatt abcabgacab baaccbabaa aaabaggcgb 6600 
abcacgaggc ccbbbcgbc 6619 



<210> 29 
<211> 6634 
<212> DNA 

<213> Arbificial Sequence 
c220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 29 

bcgcgcgbbb cggtgabgac ggbgaaaacc tcbgacacab gcagcbcccg gagacggbca 60 

cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 

tbggcgggbg bcggggcbgg cbbaacbatg cggcabcaga gcagabbgba cbgagagbgc 180 

accatatgcg gbgbgaaaba ccgcacagab gcgtaaggag aaaataccgc abcagabbgg 24 0 

cbabtggcca bbgcabacgb tgbabccaba bcabaatabg bacatbbaba tbggcbcabg 300 

tccaacabba ccgccabgbb gacatbgabb atbgacbagb babbaabagb aabcaabbac 360 

ggggbcabba gbbcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 

cccgcctggc bgaccgccca acgacccccg cccabbgacg bcaataabga cgtabgbbcc 480 

catagbaacg ccaabaggga cbttccatbg acgtcaabgg gbggagbabb bacggbaaac 540 

tgcccacbbg gcagbacabc aagbgbatca tatgccaagb acgcccccba bbgacgbcaa 600 

tgacggtaaa bggcccgccb ggcabbatgc ccagbacabg accbtabggg acbbbccbac 660 

tbggcagbac abcbacgbab bagbcatcgc babbaccabg gbgabgcggb bbbggcagba 72 0 

catcaabggg cgbggabagc ggbbbgactc acggggabbb ccaagbcbcc accccabbga 78 0 

cgtcaatggg agbbbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 

cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggtcb ababaagcag 900 

agcbcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgctgbb tbgaccbcca 960 
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tagaagacac cgggaccgat ccagcctccg cggccgggaa cggbgcattg gaacgcggab 1020 

tccccgtgcc aagagbgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

bcttabgcab gctabacbgb fctttggcttg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg babagctbag cctataggtg tgggttattg accattattg accactcccc 12 00 

tattggtgac gabacbbbcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

babbggcbab atgccaatac bctgtcctbc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa bctcgggbac gbgbbccgga 1440 

cabgggcbcb tcbccggbag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggbcgcbcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gbabgbgbcb 1620 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagbcaga ggfcaactccc 174 0 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt bgctgccgcg 18 00 

cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtctttbc 1860 

tgcagtcacc gtcgtcgaca cgtgtgatca gatabcgcgg ccgctctaga atgcgcgtga 1920 

aggagaagta ccagcacctg tggcgctggg gcbggcgcbg gggcaccatg ctgctgggca 1980 

tgcbgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2 040 

bgbggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 

aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 

tggbgcbggb gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 

tgcacgagga catcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 22 80 

ccctgtgcgt gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 234 0 

gcggccgcat gatcatggag aagggcgaga tcaagaactg cagcttcaac atcagcacca 24 00 

gcabccgcgg caaggtgcag aaggagtacg cctbcbbcba caagctggac abcabcccca 2460 

tcgacaacga caccaccagc tacagcctga ccagctgcaa caccagcgtg atcacccagg 2520 

cctgccccaa ggtgagcttc gagcccabcc ccaaccacba cbgcgccccc gccggcbtcg 2580 

ccabccbgaa gbgcaaggac aagaagbbca acggcaaggg ccccbgcacc aacgbgagca 2640 

ccgbgcagbg cacccacggc abccgccccg bggbgagcac ccagcbgctg cbgaacggba 2700 

gccbggccga ggaggaggbg gbgabccgca gcgcbaacbt cgccgacaac gccaaggbga 27 60 

bcabcgbgca gcbgaacgag agcgbggaga tcaacbgcac ccgcgctagc gcccactgca 2 82 0 

acctgagccg cgccaagtgg aacgacaccc bgaacaagat cgbgabcaag cbgcgcgagc 2 880 

agbbcggcaa caagaccabc gtgtbcaagc acagcagcgg cggcgacccc gagatcgbga 2940 

cccacagcbt caacbgcggc ggcgagtbct tcbactgcaa cagcgctagc cbgcccbgcc 3000 

gcatcaagca gabcabcaac atgbggcaga aggbgggcaa ggccabgtac gcccccccca 3060 

bcagcggcca gabccgcbgc agcagcaaca bcaccggccb gcbgcbgacc cgcgacggcg 3120 

gcaacagcaa caacgagagc gagatctbcc gbccgggcgg cggcgacatg cgcgacaacb 3180 

ggcgcagcga gctgtacaag bacaaggbgg tgaagabcga gcccctgggc gbggccccca 3240 

ccaaggccaa gctbaccgbc caggcccgcc agcbgcbgag cggcabcgbg cagcagcaga 3300 

acaaccbgcb gcgcgccabc gaggcccagc agcaccbgcb gcagcbgacc gtgbggggca 3360 

bcaagcagcb gcaggcccgc acccbggccg tggagcgcta ccbgaaggac cagcagcagc 3420 

bcgagcagab cbggaaccac accacctgga tggagtggga ccgcgagatc aacaactaca 34 80 

ccagccbgab ccacagccbg abcgaggaga gccagaacca gcaggagaag aacgagcagg 354 0 

agcbgcbgga gctggacaag bgggccagcc bgbggaactg gtbcaacatc accaacbggc 3600 

bgtggbgagg abccagatcb gcbgbgccbb cbagbbgcca gccabcbgtb gttbgccccb 3660 

cccccgbgcc bbccbtgacc cbggaaggbg ccactcccac bgbcctbbcc baataaaabg 3720 

aggaaabbgc abcgcabbgb cbgagbaggt gbcatbcbab bcbggggggt ggggbggggc 3780 

aggacagcaa gggggaggab bgggaagaca abagcaggca bgctggggab gcggbgggct 384 0 
ctabgggtac ccaggtgcbg aagaatbgac ccggbbcctc cbgggccaga aagaagcagg 3900 

cacabccccb bcbctgbgac acacccbgbc cacgcccctg gbbcbbagbb ccagccccac 3960 

bcataggaca cbcabagctc aggagggctc cgccbbcaab cccacccgcb aaagbacbbg 4020 

gagcggbcbc bcccbcccbc abcagcccac caaaccaaac ctagccbcca agagbgggaa 4 080 

gaaabtaaag caagabaggc babbaagtgc agagggagag aaaatgccbc caacabgbga 4140 

ggaagtaatg agagaaabca bagaabcbba aggccabgat bbaaggccat cabggccbta 4200 

abcbbccgcb bccbcgcfcca ctgacbcgcb gcgcbcggtc gbbcggctgc ggcgagcggb 4260 
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atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa 4 320 
gaacatgbga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc 4380 
gtttttccat aggcbccgcc cccctgacga gcatcacaaa aabcgacgcb caagtcagag 4440 
gtggcgaaac ccgacaggac tafcaaagata ccaggcgttt ccccctggaa gcbcccbcgb 4500 
gcgctctcct gttccgaccc tgccgcttac cggabaccbg bccgccbbbc tcccttcggg 4560 
aagcgtggcg cbbbcbcaba gctcacgctg baggbabctc agbbcggbgb aggtcgttcg 4620 
ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccbbabccgg 4680 
taactatcgt cttgagtcca acccggtaag acacgactta tcgccacbgg cagcagccac 4740 
tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagtbcb bgaagbggbg 4800 
gcctaactac ggctacacta gaagaacagt atttggtatc tgcgcbcbgc tgaagccagt 4860 
baccbbcgga aaaagagbbg gtagctcttg abccggcaaa caaaccaccg ctggtagcgg 4920 
tggttttttt gbtbgcaagc agcagattac gcgcagaaaa aaaggabcbc aagaagabcc 4980 
bttgabcbbb bcbacggggb cbgacgcbca gbggaacgaa aactcacgtt aagggatttt 504 0 
ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt bbaaabbaaa aatgaagttt 5100 
baaabcaabc taaagtatat atgagtaaac ttggtctgac agbtaccaab gcttaatcag 5160 
tgaggcacct abctcagcga tctgtctatt tcgttcatcc abagbbgccb gactcggggg 5220 
gggggggcgc bgaggbcbgc ctcgtgaaga aggbgbbgcb gactcatacc aggcctgaat 5280 
cgccccatca tccagccaga aagtgaggga gccacggttg atgagagctt tgttgtaggt 5340 
ggaccagbbg gtgattttga acttttgctt tgccacggaa cggtctgcgt bgbcgggaag 54 00 
abgcgbgabc bgabccbbca acbcagcaaa agttcgattt attcaacaaa gccgccgtcc 5460 
cgtcaagtca gcgtaatgct ctgccagtgt tacaaccaat taaccaattc tgattagaaa 5520 
aactcatcga gcatcaaatg aaactgcaat bbatbcabab caggattabc aataccatat 5580 
tfcttgaaaaa gccgtttctg taatgaagga gaaaactcac cgaggcagtt ccataggatg 5640 
gcaagatcct ggtatcggtc tgcgattccg actcgtccaa catcaataca acctatbaat 5700 
ttcccctcgt caaaaataag gttatcaagt gagaaatcac catgagtgac gactgaatcc 5760 
ggtgagaatg gcaaaagctt atgcatttct ttccagactt gttcaacagg ccagccatta 5820 
cgctcgtcat caaaatcact cgcatcaacc aaaccghtat tcattcgtga ttgcgccbga 5880 
gcgagacgaa atacgcgatc gctgttaaaa ggacaattac aaacaggaat cgaatgcaac 594 0 
cggcgcagga acactgccag cgcatcaaca atattttcac ctgaatcagg atattcttct 6000 
aatacctgga atgctgtttt cccggggatc gcagtggtga gtaaccatgc atcatcagga 6060 
gfcacggataa aatgcttgat ggtcggaaga ggcataaatt ccgtcagcca gtttagtctg 6120 
accatctcat ctgtaacatc attggcaacg ctacctttgc catgtttcag aaacaactct 6180 
ggcgcatcgg gcttcccata caatcgatag attgtcgcac ctgattgccc gacattatcg 6240 
cgagcccabt tatacccata taaatcagca tccatgttgg aatttaatcg cggcctcgag 6300 
caagacgttt cccgttgaat atggctcata acaccccttg tattactgtt tatgtaagca 6360 
gacagbttta btgbbcabga bgabababbb btatcbtgbg caabgbaaca bcagagatbt 6420 
bgagacacaa cgtggcbbbc cccccccccc cabbabbgaa gcatbtabca gggbbatbgt 6480 
cbcabgagcg gabacabatt bgaabgtabb bagaaaaaba aacaaabagg ggbtccgcgc 6540 
acabbbcccc gaaaagbgcc accbgacgtc baagaaacca bbabbabcab gacatbaacc 6600 
babaaaaaba ggcgbabcac gaggccctbb cgbc 6634 



<210> 30 
<211> 6538 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVRi012x/s conbaining HIV genes 
<400> 30 

bcgcgcgbbb cggbgabgac ggtgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgta cbgagagbgc 180 
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accatatgcg gtgbgaaata ccgcacagab gcgtaaggag aaaataccgc afccagattgg 240 

ctattggcca btgcatacgt tgtatccafca tcataatatg tacatttata ttggchcatg 300 

bccaacabba ccgccabgbb gacabbgatb attgactagt babbaatagb aatcaattac 360 

ggggtcatta gttcatagcc catababgga gttccgcgtt acataactta cggtaaatgg 420 

cccgcctggc tgaccgccca acgacccccg cccattgacg bcaabaabga cgbabgbtcc 4 80 

catagtaacg ccaataggga ctttccattg acgbcaabgg gbggagbabfc tacggtaaac 540 

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccba ttgacgbcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg acbttcctac 660 

ttggcagtac abcbacgbat tagtcatcgc tattaccatg gtgabgcggb tttggcagta 720 

catcaatggg cgtggatagc ggbbbgacbc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agbbbgbbbb ggcaccaaaa tcaacgggac bbbccaaaab gtcgtaacaa 840 

ctccgcccca ttgacgcaaa tgggcggbag gcgtgtacgg tgggaggtct atabaagcag 900 

agctcgtbba gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

ta 99tgatgg tatagcttag cctataggtg tgggttattg accattabbg accactcccc 1200 

tattggtgac gabacbbtcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccabbbatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt abbaaacaba gcgbgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 

catgggctct bcbccggbag cggcggagct bccacatccg agcccbggtc ccatgcctcc 1500 

agcggcbcab ggbcgcbcgg cagcbccbtg ctcctaacag tggaggccag acttaggcac 1560 

agcacaabgc ccaccaccac cagtgbgccg cacaaggccg bggcggbagg gtatgtgtcb 1620 

gaaaabgagc gtggagabbg ggctcgcacg gctgacgcag atggaagact baaggcagcg 1680 

gcagaagaag atgcaggcag ctgagtbgbb gbattctgat aagagbcaga ggbaacbccc 1740 

gtbgcggtgc bgbbaacggb ggagggcagb gtagtctgag cagtacbcgt bgcbgccgcg 1800 

cgcgccacca gacataabag ctgacagacb aacagacbgt tccttbccat gggtctbbbc 1860 

tgcagtcacc gbcgtcgaca cgbgtgatca gatatcgcgg ccgcfccbaga atgcgcgtga 1920 

aggagaagta ccagcaccbg bggcgctggg gctggcgctg gggcaccatg ctgcbgggca 1980 

bgctgatgab cbgcagcgcc accgagaagc tgtgggbgac cgbgbacbac ggcgbgcccg 2040 

tgtggaagga ggccaccacc acccbgcbcb gcgccagcga cgccaaggcc bacgacaccg 2100 

aggtgcacaa cgbgtgggcc acccacgccb gcgbgcccac cgaccccaac ccccaggagg 2160 

bggbgcbggb gaacgbgacc gagaactbcg acabgbggaa gaacgacatg gtggagcaga 2220 

tgcacgagga catcabcagc ctgbgggacc agagcctgaa gccctgcgtg aagchgaccc 22 80 

cccbgbgcgb gagccbgaag tgcaccgacg cbagcaccag ctgcaacacc agcgbgabca 2340 

cccaggccbg ccccaaggbg agctbcgagc ccatccccaa ccactacbgc gcccccgccg 2400 

gcbbcgccab ccfcgaagbgc aaggacaaga agbbcaacgg caagggcccc bgcaccaacg 2460 

bgagcaccgb gcagbgcacc cacggcabcc gccccgtggb gagcacccag cbgctgctga 2520 

acggbagccb ggccgaggag gaggbggbga tccgcagcgc taacttcgcc gacaacgcca 2580 

aggtgatcab cgbgcagcbg aacgagagcg tggagatcaa ctgcacccgc gcbagcgccc 2640 

acbgcaaccb gagccgcgcc aagbggaacg acacccbgaa caagabcgtg abcaagcbgc 2700 

gcgagcagbb cggcaacaag accatcgbgt tcaagcacag cagcggcggc gaccccgaga 2760 

bcgtgaccca cagcbbcaac bgcggcggcg agtbcbbcta cbgcaacagc acccagctgb 2 820 

bcaacagcac cbggbbcaac agcaccbgga gcaccgaggg cagcaacaac accgagggca 2 880 

gcgacaccab cacccbgccc tgccgcabca agcagabcab caacatgbgg cagaaggbgg 2940 

gcaaggccab gbacgccccc cccabcagcg gccagatccg ctgcagcagc aacatcaccg 3000 

gccbgcfcgcb gacccgcgac ggcggcaaca gcaacaacga gagcgagabc tbccgbccgg 3060 

gcggcggcga catgcgcgac aactggcgca gcgagctgba caagbacaag gbggbgaaga 3120 

bcgagccccb gggcgbggcc cccaccaagg ccaagcbtac cgbccaggcc cgccagcbgc 3180 

bgagcggcab cgbgcagcag cagaacaacc fcgctgcgcgc cabcgaggcc cagcagcacc 3240 

tgcbgcagcb gaccgbgbgg ggcabcaagc agctgcaggc ccgcacccbg gccgtggagc 3300 

gcbaccbgaa ggaccagcag cagctcgagc agabcbggaa ccacaccacc bggatggagt 3360 

gggaccgcga gatcaacaac tacaccagcc tgabccacag ccbgabcgag gagagccaga 3420 

accagcagga gaagaacgag caggagcbgc tggagcbgga caagtgggcc agcctgbgga 3480 
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actggttcaa catcaccaac bggcbgbggb 
gccagccatc tgttgtttgc ccctcccccg 
ccactgtcct btccbaataa aatgaggaaa 
ctattctggg gggtggggtg gggcaggaca 
ggcabgcbgg ggatgcggtg ggctctatgg 
cctcchgggc cagaaagaag caggcacatc 
ccbggbbcbb agttccagcc ccactcatag 
caatcccacc cgctaaagta cttggagcgg 
aaacctagcc tccaagagbg ggaagaaatt 
agagaaaatg cctccaacat gbgaggaagb 
tgatttaagg ccatcabggc cttaatcttc 
ggtcgbtcgg ctgcggcgag cggtatcagc 
agaatcaggg gataacgcag gaaagaacat 
ccgtaaaaag gccgcgttgc tggcgttttt 
caaaaatcga cgchcaagtc agaggtggcg 
gttbcccccb ggaagctccc tcgbgcgctc 
cctgtccgcc tttctccctt cgggaagcgt 
tctcagttcg gtgtaggtcg ttcgctccaa 
gcccgaccgc tgcgccttat ccggtaacta 
cttatcgcca ctggcagcag ccactggtaa 
tgctacagag ttcttgaagt ggtggcctaa 
tatctgcgct ctgctgaagc cagttacctt 
caaacaaacc accgcbggba gcggbggbbb 
aaaaaaagga tctcaagaag atcctttgat 
cgaaaactca cgbbaaggga ttttggtcat 
ccttbtaaab baaaaatgaa gthttaaatc 
tgacagbbac caatgcttaa tcagtgaggc 

atccatagtt gcctgactcg gggggggggg 

tgctgacbca taccaggccb gaatcgcccc 
gttgatgaga gctttgttgt aggfcggacca 
ggaacggtct gcgttgtcgg gaagatgcgt 
abtbabbcaa caaagccgcc gtcccgtcaa 
caattaacca attctgatta gaaaaactca 
ababcaggab tatcaatacc abatbbbbga 
tcaccgaggc agbtccabag gatggcaaga 
ccaacatcaa bacaacctab taatttcccc 
tcaccatgag tgacgactga atccggtgag 
actbghtcaa caggccagcc attacgctcg 
ttattcattc gtgattgcgc ctgagcgaga 
ttacaaacag gaatcgaatg caaccggcgc 
tcacctgaat caggatattc bbcbaabacc 
gtgagtaacc atgcatcatc aggagtacgg 
aattccgtca gccagbtfcag tctgaccatc 
ttgccatgtt tcagaaacaa ctctggcgca 
gcacctgatt gcccgacatt atcgcgagcc 
bbggaabbba abcgcggccb cgagcaagac 
cttgtattac tgtttatgta agcagacagt 
bgtgcaatgt aacabcagag abfcbbgagac 
tgaagcattt atcagggtta bbgtcbcabg 
aataaacaaa baggggtbcc gcgcacattt 
accattatta tcatgacatt aacctataaa 



gaggatccag atctgctgtg ccttctagtt 3540 
tgccttcctt gaccctggaa ggtgccactc 3600 
ttgcatcgca ttgtctgagt aggtgtcatt 3660 
gcaaggggga ggattgggaa gacaatagca 3720 
gbacccaggb gctgaagaat bgacccggtt 3780 
cccbbctcbg tgacacaccc tgtccacgcc 3840 
gacactcata gctcaggagg gctccgcctt 3900 
tctctccctc cctcatcagc ccaccaaacc 3960 
aaagcaagat aggctattaa gtgcagaggg 4020 
aatgagagaa atcatagaat tttaaggcca 4080 
cgcttcctcg ctcactgact cgctgcgctc 4140 
tcactcaaag gcggtaatac ggttatccac 4200 
gtgagcaaaa ggccagcaaa aggccaggaa 4260 
ccataggctc cgcccccctg acgagcatca 4320 
aaacccgaca ggactataaa gataccaggc 4380 
tcctgttccg accctgccgc ttaccggata 4440 
ggcgctttct catagctcac gctgtaggta 4500 
gctgggctgb. gtgcacgaac cccccgttca 4 560 
fccgtcttgag tccaacccgg taagacacga 4620 
caggafctagc agagcgaggt atgtaggcgg 4 68 0 
ctacggctac actagaagaa cagtatttgg 474 0 
cggaaaaaga gttggtagct cttgatccgg 4800 
ttttgtttgc aagcagcaga ttacgcgcag 4860 
cttttctacg gggtctgacg ctcagtggaa 4 92 0 
gagattatca aaaaggatct tcacctagat 498 0 
aafcctaaagt atatatgagt aaacttggtc 504 0 
acctabctca gcgatctgtc tatttcgttc 5100 
gcgctgaggt ctgcctcgtg aagaaggtgfc 5160 
atcatccagc cagaaagtga gggagccacg 5220 
gttggtgatt ttcjaactttt gctttgccac 5280 
gatctgatcc ttcaactcag caaaagttcg 5340 
gtcagcgtaa tgctctgcca gtgttacaac 5400 
tcgagcatca aatgaaactg caatttattc 5460 
aaaagccgtt tctgtaatga aggagaaaac 5520 
tcctggtatc ggtctgcgat tccgactcgt 5580 
tcgtcaaaaa taaggttatc aagtgagaaa 5640 
aatggcaaaa gcttatgcat ttctttccag 5700 
tcatcaaaat cactcgcatc aaccaaaccg 5760 
cgaaatacgc gatcgctgtt aaaaggacaa 5820 
aggaacactg ccagcgcatc aacaatattt 5880 
tggaatgctg ttttcccggg gatcgcagtg 594 0 
ataaaatgcb bgabggbcgg aagaggcata 6000 
bcabctgtaa catcatbggc aacgctacct 6060 
bcgggcttcc cabacaabcg abagatbgbc 6120 
cabtbatacc cababaaabc agcabccabg 6180 
gbbtcccgbb gaabatggct cabaacaccc 6240 
tbbatbgtbc abgatgabat atttttabct 6300 
acaacgtggc bbtccccccc cccccabtab 6360 
agcggabaca batbbgaabg babttagaaa 6420 
ccccgaaaag bgccaccbga cgtcbaagaa 6480 
aabaggcgta bcacgaggcc cbbbcgbc 6538 
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<211> 6541 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVRl012x/s containing HIV genes 



<400> 31 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 60 0 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 90 0 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagfccacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgtga 1920 
aggagaagta ccagcacctg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 1980 
tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2040 
tgtggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 
tgcacgagga catcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacg ctagcaccag ctgcaacacc agcgtgatca 2340 
cccaggcctg ccccaaggtg agcttcgagc ccatccccaa ccactactgc gcccccgccg 2400 
gcttcgccat cctgaagtgc aaggacaaga agttcaacgg caagggcccc tgcaccaacg 2460 
tgagcaccgt gcagtgcacc cacggcatcc gccccgtggt gagcacccag ctgctgctga 2520 
acggtagcct ggccgaggag gaggtggtga tccgcagcgc taacttcgcc gacaacgcca 2580 
aggtgatcat cgtgcagctg aacgagagcg tggagatcaa ctgcacccgc cccaacaaca 2640 
acacccgcaa gagcatccac atcggccccg gccgcgcctt ctacaccacc ggcgagatca 2700 
tcggcgacat ccgccaggcc cactgcaacc tgagccgcgc caagtggaac gacaccctga 2760 
acaagatcgt gatcaagctg cgcgagcagt tcggcaacaa gaccatcgtg ttcaagcaca 2820 
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gcagcggcgg cgaccccgag atcgtgaccc 
acbgcaacag cgctagcctg ccctgccgca 
tgggcaaggc catgtacgcc ccccccatca 
ccggcctgct gctgacccgc gacggcggca 
cgggcggcgg cgacatgcgc gacaactggc 
agatcgagcc cctgggcgbg gcccccacca 
tgctgagcgg catcgtgcag cagcagaaca 
accbgcbgca gctgaccgtg tggggcatca 
agcgctacct gaaggaccag cagcagctcg 
agtgggaccg cgagabcaac aactacacca 
agaaccagca ggagaagaac gagcaggagc 
ggaactggtt caacatcacc aacbggcbgt 
gttgccagcc atctgttgtt tgcccctccc 
ctcccactgt cctttcctaa taaaatgagg 
attctattct ggggggtggg gtggggcagg 
gcaggcatgc tggggatgcg gbgggcbcta 
gttcctcctg ggccagaaag aagcaggcac 
gcccctggtb cbbagbbcca gccccactca 
cbtcaabccc acccgctaaa gtacttggag 
accaaaccta gcctccaaga gbgggaagaa 
gggagagaaa atgcctccaa catgtgagga 
ccatgattta aggccatcat ggccttaatc 
ctcggbcgbb cggctgcggc gagcggtatc 
cacagaatca ggggataacg caggaaagaa 
gaaccgtaaa aaggccgcgt tgctggcgbb 
tcacaaaaat cgacgctcaa gtcagaggtg 
ggcgtttccc cctggaagct ccctcgtgcg 
abaccbgtcc gcctttctcc cttcgggaag 
gbatctcagt tcggtgtagg tcgttcgctc 
tcagcccgac cgctgcgcct tatccggtaa 
cgacttatcg ccacbggcag cagccactgg 
cggtgctaca gagttcttga agtggtggcc 
tggtatctgc gctctgctga agccagttac 
cggcaaacaa accaccgctg gtagcggtgg 
cagaaaaaaa ggatctcaag aagafcccttt 
gaacgaaaac tcacgttaag ggattttggt 
gatcctttta aabtaaaaat gaagbbttaa 
gbcbgacagb taccaatgct taatcagtga 
ttcatccata gttgcctgac tcgggggggg 
tgbbgcbgac tcataccagg cctgaatcgc 
acggttgatg agagctttgt tgtaggtgga 
cacggaacgg tctgcgttgt cgggaagatg 
bcgatbbabt caacaaagcc gccgtcccgt 
aaccaattaa ccaattctga ttagaaaaac 
ttcatatcag gattatcaat accatatbtt 
aactcaccga ggcagbbcca taggatggca 
cgtccaacat caabacaacc tattaatttc 
aaatcaccat gagtgacgac tgaatccggt 
cagacbtgtt caacaggcca gccattacgc 
ccgbtabbca ttcgtgattg cgcctgagcg 
caattacaaa caggaatcga atgcaaccgg 
ttbbcacctg aatcaggata ttcttctaat 
gtggtgagta accatgcatc atcaggagta 
ataaattccg bcagccagbt tagtctgacc 
ccbbbgccab gbbbcagaaa caactctggc 



acagcttcaa ctgcggcggc gagttcttct 2 880 
tcaagcagat catcaacatg tggcagaagg 2 940 
gcggccagab ccgctgcagc agcaacatca 3000 
acagcaacaa cgagagcgag atcttccgtc 3060 
gcagcgagct gtacaagtac aaggtggtga 3120 
aggccaagct taccgtccag gcccgccagc 3180 
accbgctgcg cgccatcgag gcccagcagc 3240 
agcagctgca ggcccgcacc ctggccgtgg 3300 
agcagatctg gaaccacacc accbggabgg 3360 
gcctgatcca cagcctgatc gaggagagcc 3420 
tgctggagct ggacaagtgg gccagcctgt 34 80 
ggtgaggatc cagatctgct gtgccttcta 3540 
ccgtgccttc cttgaccctg gaaggtgcca 3600 
aaattgcatc gcattgtctg agtaggtgtc 3660 
acagcaaggg ggaggattgg gaagacaata 3720 
tgggtaccca ggtgctgaag aattgacccg 37B0 
atccccttct ctgtgacaca ccctgtccac 3840 
taggacactc atagctcagg agggctccgc 3900 
cggtctctcc ctccctcatc agcccaccaa 3960 
attaaagcaa gataggctat taagtgcaga 4020 
agbaatgaga gaaatcatag aabbtbaagg 4080 
ttccgctbcc bcgcbcacbg actcgctgcg 4140 
agcbcacbca aaggcggbaa bacggttatc 4200 
cabgtgagca aaaggccagc aaaaggccag 4260 
bfcbccabagg ctccgccccc cbgacgagca 4320 
gcgaaacccg acaggacfcat aaagabacca 4380 
ctcbcctgtb ccgaccctgc cgcttaccgg 4440 
cgbggcgctb bctcabagct cacgcbgtag 45 00 
caagcbgggc tgtgtgcacg aaccccccgt 4560 
ctabcgbctb gagbccaacc cggbaagaca 4620 
taacaggatb agcagagcga ggtabgbagg 46 80 
taactacggc bacacbagaa gaacagbatt 4740 
cbbcggaaaa agagtbggta gcbcttgatc 4800 
tbbtbbtgbb bgcaagcagc agatbacgcg 4860 
gabcbtbtct acggggtctg acgcbcagtg 4920 
catgagatfca bcaaaaagga tcttcaccta 4980 
abcaabctaa agbatatatg agbaaacbtg 5040 
ggcaccbabc tcagcgabct gbctatttcg 5100 
ggggcgcbga ggbctgccbc gbgaagaagg 5160 
cccatcabcc agccagaaag tgagggagcc 5220 
ccagbtggbg attbbgaacb ttbgcbttgc 5280 
cgbgabctga bcctbcaact cagcaaaagb 5340 
caagbcagcg baabgctcbg ccagbgttac 5400 
tcatcgagca tcaaabgaaa ctgcaattta 5460 
tgaaaaagcc gbttctgtaa tgaaggagaa 5520 
agatccbggt abcggtcbgc gabbccgacb 5580 
ccctcgbcaa aaabaaggtb abcaagtgag 5640 
gagaatggca aaagcbtatg catbtctbbc 5700 
tcgbcabcaa aatcacbcgc abcaaccaaa 5760 
agacgaaata cgcgatcgct gttaaaagga 5820 
cgcaggaaca cbgccagcgc atcaacaaba 5880 
accbggaatg cbgtbttccc ggggatcgca 5940 
cggataaaat gcttgabggt cggaagaggc 6000 
abctcatctg baacatcatb ggcaacgcba 6060 
gcabcgggcb tcccatacaa bcgabagatt 6120 
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gtcgcacctg abbgcccgac abbabcgcga gcccattbat acccatataa atcagcatcc 6180 
abgttggaat ttaatcgcgg ccbcgagcaa gacgbtbccc gttgaatatg gctcataaca 6240 
ccccbtgtat tactgtttat gtaagcagac agbtbtattg btcabgabga batabbbbba 6300 
tcttgtgcaa tgtaacatca gagabbbbga gacacaacgt ggcbbbcccc ccccccccat 6360 
tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag 6420 
aaaaataaac aaabaggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtctaa 6480 
gaaaccabba ttatcatgac attaacctat aaaaabaggc gbatcacgag gcccbbbcgb 6540 
c 6541 



<210> 32 
<211>. 6577 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 32 

bcgcgcgbbt cggbgatgac ggbgaaaacc bctgacacat gcagctcccg gagacggbca 60 
cagctbgbct gbaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
tbggcgggtg bcggggctgg cbbaacbabg cggcabcaga gcagattgta cbgagagbgc 180 
accabatgcg gbgbgaaata ccgcacagab gcgbaaggag aaaataccgc atcagabbgg 240 
cbabbggcca tbgcabacgb tgbabccaba bcabaabatg bacabttaba tbggcbcabg 300 
tccaacatta ccgccabgtb gacabtgabt abtgacbagt tattaatagb aabcaabbac 360 
ggggbcabba gbbcatagcc cabatatgga gbtccgcgbb acataacbba cggbaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccabtgacg bcaabaabga cgbabgbbcc 480 
cabagbaacg ccaabaggga ctbbccabbg acgtcaatgg gbggagbabb tacggbaaac 540 
tgcccacbbg gcagtacatc aagbgbabca babgccaagb acgcccccba tbgacgbcaa 600 
tgacggbaaa bggcccgccb ggcabbabgc ccagbacatg accttabggg acttbccbac 660 
tbggcagtac atctacgbab tagbcatcgc battaccatg gbgatgcggb tbbggcagba 720 
cabcaabggg cgtggabagc ggtbbgactc acggggabbb ccaagtctcc accccabtga 78 0 
cgtcaabggg agbttgtbtb ggcaccaaaa tcaacgggac btbccaaaab gtcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggtcb atabaagcag 900 
agcbcgbbba gbgaaccgtc agabcgccbg gagacgccab ccacgctgtb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcabtg gaacgcggat 1020 
bccccgbgcc aagagbgacg baagbaccgc cbabagacbc tabaggcaca ccccbbbggc 1080 
tctbabgcab gcbabactgt bbtbggcbtg gggccbatac acccccgcbt ccbtabgcba 1140 
taggbgabgg babagctbag cctabaggtg tgggtbabbg accabbabbg accacbcccc 1200 
babbggbgac gabacttbcc abbacbaabc cabaacabgg ctcbbtgcca caactabctc 1260 
babbggcbat abgccaabac bctgbccbbc agagacbgac acggactcbg babbbtbaca 1320 
ggabggggbc ccatbbatba tbbacaaabt cacatabaca acaacgccgb cccccgbgcc 1380 
cgcagbtbbb atbaaacata gcgtgggabc tccacgcgaa tcbcgggbac gbgbbccgga 1440 
cabgggcbcb tctccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggctcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbatgbgbct 1620 
gaaaabgagc gbggagattg ggcbcgcacg gcbgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgtt gbabbcbgab aagagtcaga ggbaactccc 1740 
gbtgcggbgc bgbbaacggb ggagggcagb gbagbcbgag cagtacbcgb bgcbgccgcg 1800 
cgcgccacca gacataabag cbgacagacb aacagacbgb bccbbtccab gggtcbtbbc 1860 
tgcagbcacc gbcgbcgaca cgbgbgabca gababcgcgg ccgcbcbaga abgcgcgbga 1920 
aggagaagta ccagcaccbg bggcgcbggg gcbggcgcbg gggcaccabg ctgcbgggca 1980 
tgcbgabgat cbgcagcgcc accgagaagc bgbgggtgac cgbgbacbac ggcgbgcccg 2040 
bgbggaagga ggccaccacc accctgctcb gcgccagcga cgccaaggcc bacgacaccg 2100 
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aggtgcacaa cgtgtgggcc acccacgcct 
tggbgcbggb gaacgtgacc gagaacttcg 
tgcacgagga catcatcagc cbgbgggacc 
ccctgtgcgt gagccbgaag tgcaccgacg 
ccagcatccg cggcaaggtg cagaaggagt 
ccatcgacaa cgacaccacc agctacagcc 
aggcctgccc caaggtgagc ttcgagccca 
bcgccabccb gaagtgcaag gacaagaagt 
gcaccgtgca gbgcacccac ggcatccgcc 
gtagcctggc cgaggaggag gtggtgatcc 
tgatcatcgt gcagctgaac gagagcgbgg 
gcaacctgag ccgcgccaag tggaacgaca 
agcagbbcgg caacaagacc atcgfcgttca 
tgacccacag cttcaactgc ggcggcgagt 
gccgcatcaa gcagatcatc aacatgbggc 
ccabcagcgg ccagabccgc tgcagcagca 
gcggcaacag caacaacgag agcgagabcb 
actggcgcag cgagctgtac aagtacaagg 
ccaccaaggc caagcttacc gtccaggccc 
agaacaacct gctgcgcgcc atcgaggccc 
gcatcaagca gctgcaggcc cgcacccbgg 
agctcgagca gatctggaac cacaccacct 
acaccagccfc gatccacagc ctgatcgagg 
aggagctgct ggagctggac aagbgggcca 
ggctgtggtg aggatccaga tctgctgtgc 
cctcccccgb gccttccttg accctggaag 
atgaggaaat tgcatcgcat tgtctgagta 
ggcaggacag caagggggag gattgggaag 
gctctabggg tacccaggtg ctgaagaatt 
aggcacatcc cctbcbctgb gacacaccct 
cactcabagg acacbcatag ctcaggaggg 
ttggagcggt cbcbcccbcc cbcabcagcc 
gaagaaatta aagcaagafca ggctabtaag 
fcgaggaagta atgagagaaa tcatagaatt 
ttaabcbbcc gcttcctcgc tcactgactc 
ggtatcagct cactcaaagg cggtaatacg 
aaagaacatg tgagcaaaag gccagcaaaa 
ggcgtttttc cataggctcc gcccccctga 
gaggtggcga aacccgacag gacbabaaag 
cgtgcgchct cctgbtccga cccbgccgcb 
gggaagcgtg gcgctttctc abagcbcacg 
tcgctccaag ctgggctgtg tgcacgaacc 
cggtaactat cgtcbbgagb ccaacccggt 
cactggtaac aggabtagca gagcgaggta 
gtggcctaac tacggctaca ctagaagaac 
agttaccttc ggaaaaagag ttggtagctc 
cggtggbttb bbtgbbbgca agcagcagat 
bccbbbgabc bbbbcbacgg ggtctgacgc 
tttggtcatg agattatcaa aaaggatctt 
ttttaaatca atctaaagta tatafcgagta 
cagtgaggca ccfcatctcag cgabcbgbcb 
gggggggggg cgctgaggtc tgcctcgtga 
aatcgcccca tcatccagcc agaaagtgag 
ggtggaccag bbggbgatbb tgaacttttg 
aagatgcgtg atctgatcct tcaactcagc 



gcgtgcccac cgaccccaac ccccaggagg 2160 
acatgtggaa gaacgacatg gtggagcaga 2220 
agagcctgaa gccctgcgtg aagcbgaccc 2280 
ctagcaagaa ctgcagcttc aacabcagca 2340 
acgccbbctt cbacaagcbg gacatcatcc 2400 
tgaccagctg caacaccagc gtgatcaccc 2460 
tccccaacca ctactgcgcc cccgccggct 2520 
tcaacggcaa gggcccctgc accaacgtga 2580 
ccgtggtgag cacccagctg ctgctgaacg 2640 
gcagcgctaa cttcgccgac aacgccaagg 2700 
agatcaactg cacccgcgct agcgcccact 2760 
ccctgaacaa gatcgtgatc aagctgcgcg 282Q 
agcacagcag cggcggcgac cccgagatcg 2880 
tcttctactg caacagcgct agcctgccct 2940 
agaaggtggg caaggccatg tacgcccccc 3 000 
acatcaccgg cctgctgctg acccgcgacg 3060 
tccgtccggg cggcggcgac atgcgcgaca 3120 
tggtgaagat cgagcccctg ggcgtggccc 3180 
gccagctgct gagcggcatc gtgcagcagc 3240 
agcagcacct gctgcagctg accgtgtggg 3300 
ccgtggagcg ctacctgaag gaccagcagc 3360 
ggatggagtg ggaccgcgag atcaacaact 3420 
agagccagaa ccagcaggag aagaacgagc 34 80 
gcctgtggaa ctggttcaac atcaccaact 3540 
cttctagttg ccagccatct gttgtttgcc 3600 
ghgccactcc cactgtcctt tcctaataaa 3660 
ggtgtcattc tattctgggg ggtggggtgg 3720 
acaatagcag gcatgccggg gatgcggtgg 3780 
gacccggttc ctcctgggcc agaaagaagc 3840 
gtccacgccc ccggttctta gfctccagccc 3900 
ctccgccttc aatcccaccc gctaaagtac 3960 
caccaaacca aacctagcct ccaagagtgg 4020 
tgcagaggga gagaaaatgc ctccaacatg 4080 
ttaaggccat gatttaaggc catcatggcc 4140 
gctgcgctcg gtcgtfccggc tgcggcgagc 42 00 
gttatccaca gaabcagggg ataacgcagg 4260 
ggccaggaac cgbaaaaagg ccgcgbbgcb 4320 
cgagcabcac aaaaatcgac gcbcaagbca 43 BO 
ataccaggcg bbbcccccbg gaagctcccb 4440 
taccggabac cbgbccgccb bbctcccbbc 4500 
cbgtaggbab cbcagbtcgg bgbaggtcgb 4560 
ccccgbbcag cccgaccgcb gcgcctbabc 4620 
aagacacgac bbabcgccac bggcagcagc 4680 
tgbaggcggb gctacagagb bcbbgaagbg 4740 
agbabbbggb abcbgcgcbc bgctgaagcc 4800 
ttgabccggc aaacaaacca ccgcbggbag 4860 
bacgcgcaga aaaaaaggab cbcaagaaga 4920 
bcagbggaac gaaaacbcac gbbaagggab 4980 
caccbagabc cbbbbaaatb aaaaabgaag 5040 
aacbbggtcb gacagbtacc aabgcbbaab 5100 
abbbcgbbca bccabagbbg ccbgacbcgg 5160 
agaaggbgbt gcbgacbcab accaggccbg 5220 
ggagccacgg bbgatgagag cbbbgbtgta 5280 
cbtbgccacg gaacggbctg cgbbgbcggg 5340 
aaaagbtcga btbabbcaac aaagccgccg 5400 
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tcccgtcaag tcagcgtaat gcbcbgccag tgbbacaacc aabbaaccaa ttctgattag 5460 
aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt abcaabacca 5520 
babbbbbgaa aaagccgtbb ctgtaatgaa ggagaaaact caccgaggca gttccatagg 5580 
atggcaagat cctggtatcg gbcbgcgabb ccgactcgtc caacatcaat acaacctatt 5640 
aatttcccct cgtcaaaaat aaggbbabca agtgagaaat caccatgagt gacgactgaa 5700 
bccggbgaga abggcaaaag cbbabgcabb tcttfcccaga ctbgbbcaac aggccagcca 5760 
bbacgcbcgb cabcaaaabc actcgcafcca accaaaccgt tabbcabtcg bgabtgcgcc 5820 
tgagcgagac gaaatacgcg abcgcbgbba aaaggacaat tacaaacagg aatcgaatgc 5880 
aaccggcgca ggaacacbgc cagcgcatca acaababtbb cacctgaatc aggatabtcb 5940 
tctaatacct ggaatgctgt tttcccgggg atcgcagtgg tgagtaacca bgcatcatca 6000 
ggagtacgga taaaatgctt gatggtcgga agaggcataa abbccgbcag ccagtttagt 6060 
ctgaccatct cabcbgbaac atcabtggca acgctacctt bgccabgbtt cagaaacaac 6120 
tctggcgcat cgggctbccc atacaatcga bagabbgbcg cacctgattg cccgacatba 6180 
tcgcgagccc atttataccc atataaatca gcatccatgt bggaabtbaa tcgcggccbc 6240 
gagcaagacg bbbcccgbbg aatatggctc ataacacccc bbgbabbacb gtttatgtaa 6300 
gcagacagtt ttattgttca tgatgatata bbbbbabcbt gtgcaatgta acatcagaga 6360 
ttttgagaca caacgtggct ttcccccccc ccccattatt gaagcattba tcagggbtab 6420 
tgbcbcabga gcggatacab attbgaatgb abbtagaaaa ataaacaaab aggggbbccg 6480 
cgcacabtbc cccgaaaagb gccacctgac gbctaagaaa ccabbabtab cabgacabta 6540 
accbataaaa ataggcgbab cacgaggccc bbbcgbc 6577 



<210> 33 
<211> 6538 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
c400> 33 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagctbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggtg 12 0 
tbggcgggtg bcggggcbgg cbtaacbabg cggcabcaga gcagabbgba ctgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagat gcgbaaggag aaaabaccgc atcagabbgg 24 0 
cbabbggcca bbgcabacgb bgbabccaba bcataabatg bacatbbaba btggcbcabg 300 
bccaacabba ccgccatgbb gacabtgabb abbgacbagb tabbaabagt aabcaabbac 360 
ggggbcabba gbbcabagcc catababgga gbbccgcgtb acabaacbta cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg tcaataabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbtbccatbg acgbcaatgg gbggagbabb bacggbaaac 540 
bgcccacbtg gcagbacabc aagbgbatca babgccaagb acgcccccba btgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbatgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagbac abcbacgbab bagbcabcgc babbaccabg gbgabgcggt btbggcagba 720 
cabcaabggg cgbggabagc ggtbtgacbc acggggatbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbtbgtbtb ggcaccaaaa bcaacgggac ttbccaaaab gtcgbaacaa 84 0 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg tgggaggtct atabaagcag 900 
agcbcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgctgbb btgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcatbg gaacgcggab 1020 
bccccgbgcc aagagbgacg baagbaccgc cbabagacbc bataggcaca ccccbbbggc 1080 
bcbbatgcab gcbabacbgb bbbbggcbbg gggcctabac acccccgcbt ccbbabgcba 1140 
baggbgabgg batagctbag ccbabaggbg bgggtbatbg accabtatbg accactcccc 1200 
babbggbgac gatacbbbcc abbacbaabc cabaacabgg cbcbbtgcca caacbabcbc 1260 
babbggcbab abgccaatac tcbgbcctbc agagacbgac acggacbcbg babbbbbaca 1320 
ggabggggbc ccabbbabba tbbacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
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cgcagttttt attaaacata gcgtgggatc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca cgtgtgatca 
aggagaagta ccagcacctg tggcgctggg 
tgctgatgat ctgcagcgcc accgagaagc 
tgtggaagga ggccaccacc accctgctct 
aggtgcacaa cgtgtgggcc acccacgcct 
tggtgctggt gaacgtgacc gagaacttcg 
tgcacgagga catcatcagc ctgtgggacc 
ccctgtgcgt gagcctgaag tgcaccgacc 
gcggccgcat gatcatggag aagggcgaga 
gcaacaccag cgtgatcacc caggcctgcc 
actacfcgcgc ccccgccggc ttcgccatcc 
agggcccctg caccaacgtg agcaccgtgc 
gcacccagct gctgctgaac ggtagcctgg 
acttcgccga caacgccaag gtgatcatcg 
gcacccgcgc tagcgcccac tgcaacctga 
agatcgtgat caagctgcgc gagcagttcg 
gcggcggcga ccccgagatc gtgacccaca 
gcaacagcgc tagcctgccc tgccgcatca 
gcaaggccat gtacgccccc cccatcagcg 
gcctgctgct gacccgcgac ggcggcaaca 
gcggcggcga catgcgcgac aactggcgca 
tcgagcccct gggcgtggcc cccaccaagg 
tgagcggcat cgtgcagcag cagaacaacc 
tgctgcagct gaccgtgtgg ggcatcaagc 
gctacctgaa ggaccagcag cagctcgagc 
gggaccgcga gatcaacaac tacaccagcc 
accagcagga gaagaacgag caggagctgc 
actggttcaa catcaccaac tggctgtggt 
gccagccatc tgttghttgc ccctcccccg 
ccactgtcct ttcctaataa aatgaggaaa 
ctattctggg gggtggggtg gggcaggaca 
ggcatgctgg ggatgcggtg ggctctatgg 
cctcctgggc cagaaagaag caggcacatc 
cctggttctt agttccagcc ccactcatag 
caatcccacc cgctaaagta cttggagcgg 
aaacctagcc tccaagagtg ggaagaaatt 
agagaaaafcg cctccaacat gtgaggaagt 
tgatttaagg ccatcatggc cttaatctfcc 
ggtcgttcgg ctgcggcgag cggtatcagc 
agaatcaggg gataacgcag gaaagaacat 
ccgtaaaaag gccgcgttgc tggcgttttt 
caaaaatcga cgctcaagtc agaggtggcg 
gtttccccct ggaagctccc tcgtgcgctc 
cctgtccgcc tttccccctt cgggaagcgt 
tctcagttcg gtgtaggtcg ttcgctccaa 
gcccgaccgc tgcgccttat ccggtaacta 
cttatcgcca ctggcagcag ccactggtaa 



tccacgcgaa tctcgggtac gtgttccgga 1440 
tccacatccg agccctggtc ccatgcctcc 1500 
ctcctaacag tggaggccag acttaggcac 1560 
cacaaggccg tggcggtagg gtafcgtgtct 1620 
gctgacgcag afcggaagacb taaggcagcg 1680 
gtattctgat aagagtcaga ggtaactccc 1740 
gtagtctgag cagtactcgt tgctgccgcg 1800 
aacagactgt tcctttccat gggtcttttc 1860 
gatatcgcgg ccgctctaga atgcgcgtga 1920 
gctggcgcfcg gggcaccatg ctgctgggca 1980 
tgtgggtgac cghgtactac ggcgtgcccg 2 04 0 
gcgccagcga cgccaaggcc tacgacaccg 2100 
gcgtgcccac cgaccccaac ccccaggagg 2160 
acatgtggaa gaacgacatg gtggagcaga 2220 
agagcctgaa gccctgcgtg aagctgaccc 2280 
tgaagaacga caccaacacc aacagcagca 2340 
tcaagaactg cagcttcgct agcaccagct 2400 
ccaaggtgag cttcgagccc atccccaacc 2460 
tgaagtgcaa ggacaagaag ttcaacggca 252 0 
agtgcaccca cggcatccgc cccgtggtga 25 80 
ccgaggagga ggtggtgatc cgcagcgcta 264 0 
tgcagctgaa cgagagcgtg gagatcaact 2700 
gccgcgccaa gtggaacgac accctgaaca 2760 
gcaacaagac catcgtgttc aagcacagca 2820 
gcttcaactg cggcggcgag ttcttctact 2880 
agcagatcat caacatgtgg cagaaggtgg 2 940 
gccagafcccg ctgcagcagc aacatcaccg 3000 
gcaacaacga gagcgagatc ttccgtccgg 3060 
gcgagctgta caagtacaag gtggtgaaga 3120 
ccaagcttac cgtccaggcc cgccagctgc 3180 
tgctgcgcgc catcgaggcc cagcagcacc 3240 
agctgcaggc ccgcacccfcg gccgtggagc 3300 
agatctggaa ccacaccacc tggatggagt 33 60 
tgatccacag cctgatcgag gagagccaga 3420 
tggagctgga caagtgggcc agcctgtgga 34 80 
gaggatccag atctgctgtg ccttctagtt 3540 
tgccttcctt gaccctggaa ggtgccactc 3600 
ttgcatcgca ttgtctgagt aggtgtcatfc 3660 
gcaaggggga ggattgggaa gacaatagca 3720 
gtacccaggt gctgaagaat tgacccggtt 3780 
cccttctctg tgacacaccc tgtccacgcc 3840 
gacactcata gctcaggagg gctccgcctt 3900 
tctctccctc cctcatcagc ccaccaaacc 3960 
aaagcaagat aggctattaa gtgcagaggg 4 020 
aatgagagaa atcatagaafc tttaaggcca 4080 
cgcttcctcg ctcactgact cgctgcgctc 4140 
tcactcaaag gcggtaatac ggttatccac 4200 
gbgagcaaaa ggccagcaaa aggccaggaa 4260 
ccataggctc cgcccccctg acgagcatca 4320 
aaacccgaca ggactataaa gataccaggc 4380 
tcctgttccg accctgccgc ttaccggata 4440 
ggcgctttct catagctcac gctgtaggta 4500 
gctgggctgt gtgcacgaac cccccgttca 4560 
tcgtcttgag bccaacccgg taagacacga 4620 
caggattagc agagcgaggt atgtaggcgg 4680 
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tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 4740 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 4800 
caaacaaacc accgctggta gcggtggtbt ttbbgttbgc aagcagcaga ttacgcgcag 4 860 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 4920 
cgaaaactca cgttaaggga ttbtggtcat gagattatca aaaaggatct tcacctagat 4980 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 5040 
tgacagttac caafcgcttaa bcagtgaggc acctatctca gcgatchgtc tatttcgttc 5100 
atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg aagaaggtgt 5160 
tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga gggagccacg 5220 
gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt gctttgccac 5280 
ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagtbcg 5340 
atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca gtgttacaac 5400 
caattaacca attctgatta gaaaaacfcca tcgagcatca aatgaaactg caatttattc 5460 
atatcaggat tatcaatacc atattbttga aaaagccgtt tctgtaatga aggagaaaac 5520 
tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat tccgactcgt 5580 
ccaacatcaa tacaacctat fcaatttcccc bcgtcaaaaa taaggttatc aagtgagaaa 5640 
tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat ttctttccag 57 00 
acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc aaccaaaccg 5760 
ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa 5820 
ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaatattt 5880 
tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg gatcgcagtg 5940 
gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggbcgg aagaggcaba 6000 
aatbccgtca gccagtttag tctgaccatc tcatctgtaa cabcattggc aacgctacct 6060 
ttgccatgtt tcagaaacaa cbctggcgca bcgggcttcc catacaatcg abagatfcgbc 6120 
gcacctgatt gcccgacabt atcgcgagcc catttatacc cababaaatc agcahccatg 6180 
tbggaattta atcgcggcct cgagcaagac gtttcccgbt gaatabggct cataacaccc 6240 
cttgtabtac tgttbabgba agcagacagt tbtattgbtc abgatgabat atttttabct 6300 
tgtgcaatgb aacatcagag atbttgagac acaacgtggc bbtccccccc cccccatbat 63 60 
tgaagcattt atcagggtba bbgbctcatg agcggabaca babtbgaatg tatttagaaa 6420 
aabaaacaaa taggggbtcc gcgcacabbb ccccgaaaag bgccaccbga cgbctaagaa 6480 
accattatta tcabgacatb aacctabaaa aataggcgta tcacgaggcc cbbtcgtc 6538 



<210> 34 
<211> 6460 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HXV genes 
<400> 34 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 

ctattggcca ttgcatacgt tgtatccata tcataatabg tacatttata ttggctcatg 300 

tccaacatta ccgccatgtt gacattgatt attgactagt tatbaatagt aatcaattac 360 

ggggtcatba gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 

cccgcctggc bgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt ttbggcagta 720 
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catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtcbcc accccattga 780 
cgtcaatggg agbbbgbbtb ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa bgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agabcgccbg gagacgccat ccacgctgtt bbgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc XOBO 
bcbtatgcab gctabacbgb tbtbggcbbg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggfctabbg accabtabtg accactcccc 1200 
tattggtgac gatactttcc abbactaabc cataacatgg ctctttgcca caactatctc 1260 
babbggcbab atgccaatac tcbgbccbbc agagactgac acggacbcbg tatttttaca 1320 
ggatggggtc ccabbbabta tttacaaatt cacatabaca acaacgccgt cccccgtgcc 13 80 
cgcagtbttb attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgbtccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gbabgtgtcb 1620 
gaaaatgagc gbggagabbg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbtgbt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gbagbcbgag cagtactcgt bgcbgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagacbgb bccbbtccab gggbctbbbc 1860 
tgcagbcacc gbcgbcgaca cgtgbgabca gababcgcgg ccgctctaga abgcgcgbga 192 0 
aggagaagba ccagcacctg bggcgcbggg gctggcgcbg gggcaccatg cbgcbgggca 1980 
tgcbgabgab cbgcagcgcc accgagaagc bgbgggtgac cgbgbacbac ggcgbgcccg 2 04 0 
tgtggaagga ggccaccacc accctgcbct gcgccagcga cgccaaggcc bacgacaccg 2100 
aggbgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggbgcbggb gaacgtgacc gagaacbbcg acabgbggaa gaacgacatg gbggagcaga 2220 
tgcacgagga cabcabcagc ctgbgggacc agagcctgaa gcccbgcgtg aagctgaccc 2280 
cccbgtgcgb gagcctgaag bgcaccgacg ctagcaccag cbgcaacacc agcgbgatca 234 0 
cccaggcctg ccccaaggbg agcbtcgagc ccatccccaa ccacbacbgc gcccccgccg 2400 
gcbbcgccab ccbgaagbgc aaggacaaga agbbcaacgg caagggcccc tgcaccaacg 2460 
tgagcaccgb gcagtgcacc cacggcatcc gccccgtggb gagcacccag cbgcbgcbga 2520 
acggbagccb ggccgaggag gaggtggtga bccgcagcgc baacbtcgcc gacaacgcca 25 80 
aggbgabcab cgbgcagctg aacgagagcg bggagatcaa cbgcacccgc gcbagcgccc 2640 
actgcaacct gagccgcgcc aagbggaacg acaccctgaa caagatcgtg abcaagcbgc 2700 
gcgagcagbb cggcaacaag accatcgtgb bcaagcacag cagcggcggc gaccccgaga 2760 
tcgbgaccca cagcttcaac tgcggcggcg agbbcbtcta cbgcaacagc gctagccbgc 2 82 0 
cctgccgcab caagcagatc abcaacatgb ggcagaaggb gggcaaggcc abgtacgccc 2 880 
cccccabcag cggccagatc cgctgcagca gcaacatcac cggcctgctg ctgacccgcg 2940 
acggcggcaa cagcaacaac gagagcgaga tcbbccgtcc gggcggcggc gacabgcgcg 3000 
acaactggcg cagcgagctg bacaagbaca aggbggbgaa gabcgagccc cbgggcgbgg 3060 
cccccaccaa ggccaagcbt accgbccagg cccgccagcb gcbgagcggc atcgbgcagc 3120 
agcagaacaa ccbgcbgcgc gccatcgagg cccagcagca cctgcbgcag cbgaccgbgb 3180 
ggggcabcaa gcagcbgcag gcccgcaccc bggccgbgga gcgcbaccbg aaggaccagc 3240 
agcagcbcga gcagabcbgg aaccacacca ccbggabgga gbgggaccgc gagabcaaca 3300 
acbacaccag ccbgabccac agcctgabcg aggagagcca gaaccagcag gagaagaacg 3360 
agcaggagcb gcbggagcbg gacaagbggg ccagccbgtg gaacbggbbc aacabcacca 34 20 
acbggcbgbg gbgaggabcc agabctgcbg bgccbbcbag bbgccagcca bcbgbbgbbb 34 80 
gccccbcccc cgbgcctbcc bbgaccctgg aaggbgccac bcccacbgbc ctbbccbaab 354 0 
aaaabgagga aabbgcabcg cabtgbctga gbaggbgbca btcbabtcbg gggggbgggg 3600 
tggggcagga cagcaagggg gaggabbggg aagacaabag caggcatgcb ggggabgcgg 3 660 
fcgggctctab gggbacccag gbgctgaaga abbgacccgg bbccfcccbgg gccagaaaga 3720 
agcaggcaca bccccbtctc bgbgacacac ccbgbccacg cccctggbbc tbagbbccag 3780 
ccccacbcab aggacactca bagcbcagga gggcbccgcc bbcaabccca cccgcbaaag 3840 
tacbbggagc ggbcbcbccc bcccbcabca gcccaccaaa ccaaaccbag cctccaagag 3900 
tgggaagaaa btaaagcaag abaggctabb aagtgcagag ggagagaaaa bgcctccaac 3960 
abgbgaggaa gtaabgagag aaabcataga abbbbaaggc cabgabtbaa ggccabcabg 4020 
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gccbbaabct bccgcttcct cgcbcacbga 
agcggtatca gcbcacbcaa aggcggtaat 
aggaaagaac abgbgagcaa aaggccagca 
gctggcgbbfc ttccataggc tccgcccccc 
bcagaggbgg cgaaacccga caggactata 
cctcgtgcgc tctcctgttc cgaccctgcc 
bbcgggaagc gtggcgcttt ctcatagctc 
cgttcgctcc aagctgggct gtgtgcacga 
atccggtaac batcgbctbg agtccaaccc 
agccactggt aacaggatta gcagagcgag 
gtggtggcct aactacggct acactagaag 
gccagttacc bbcggaaaaa. gagbbggbag 
bagcggbggb ttttttgttt gcaagcagca 
agatcctttg atcttttcta cggggbcbga 
gabbbtggbc abgagabbat caaaaaggat 
aagttttaaa tcaatctaaa gtatatatga 
aatcagtgag gcacctatct cagcgatctg 

cggggggggg gggcgctgag gtctgcctcg 

ctgaatcgcc ccatcatcca gccagaaagt 
gbaggbggac cagbbggbga bbbtgaactb 
gggaagatgc gtgatctgat ccttcaactc 
ccgtcccgtc aagtcagcgt aabgcbcbgc 
tagaaaaact catcgagcat caaatgaaac 
ccatattttt gaaaaagccg tttctgtaat 
aggatggcaa gatcctggta tcggtctgcg 
atbaabbbcc cctcgtcaaa aataaggtta 
gaatccggtg agaatggcaa aagcttatgc 
ccabbacgcb cgbcabcaaa atcactcgca 
gcctgagcga gacgaaatac gcgabcgcbg 
tgcaaccggc gcaggaacac tgccagcgca 
tcttctaata cctggaatgc tgttttcccg 
tcaggagtac ggataaaatg ctbgabggbc 
agtctgacca bcbcabcbgb aacabcabbg 
aactctggcg catcgggctt cccatacaat 
btabcgcgag cccatttata cccatataaa 
ctcgagcaag acgbbbcccg ttgaatatgg 
taagcagaca gbbbbabbgb tcabgabgat 
agabttbgag acacaacgtg gctttccccc 
babtgbcbca tgagcggata catatttgaa 
ccgcgcacat ttccccgaaa agbgccaccb 
ttaacctata aaaataggcg babcacgagg 



ctcgctgcgc tcggccgfcfcc ggctgcggcg 4080 
acggtbatcc acagaatcag gggataacgc 4140 
aaaggccagg aaccgbaaaa aggccgcgbb 4200 
tgacgagcat cacaaaaatc gacgctcaag 4260 
aagataccag gcgbttcccc ctggaagcbc 4320 
gcbbaccgga taccbgtccg ccbbbctccc 4380 
acgcbgbagg babcbcagtt cggtgbaggb 4440 
accccccgtt cagcccgacc gctgcgcctt 4500 
ggbaagacac gactbabcgc cactggcagc 4560 
gtabgtaggc ggbgcbacag agbtcbbgaa 4620 
aacagbatbb ggtabctgcg ctctgctgaa 4680 
ctcttgabcc ggcaaacaaa ccaccgcbgg 4740 
gabbacgcgc agaaaaaaag gabctcaaga 4 800 
cgcbcagtgg aacgaaaacb cacgbbaagg 4860 
cttcaccbag abccbttbaa abtaaaaatg 4920 
gbaaactbgg tctgacagtt accaabgctb 4980 
tcbabttcgt tcabccatag tbgcctgacb 5040 
bgaagaaggt gbbgcbgacb cabaccaggc 5100 
gagggagcca cggtbgabga gagctttgbb 5160 
ttgcttbgcc acggaacggb ctgcgbbgbc 5220 
agcaaaagtb cgabttattc aacaaagccg 5280 
cagbgbbaca accaabbaac caatbcbgab 5340 
bgcaatbtab bcababcagg atbabcaaba 5400 
gaaggagaaa acbcaccgag gcagttccab 54 60 
abbccgacbc gbccaacatc aabacaaccb 5520 
bcaagbgaga aabcaccabg agbgacgacb 5580 
abbtctbbcc agacbbgtbc aacaggccag 5640 
bcaaccaaac cgbbabbcab bcgbgabbgc 5700 
bbaaaaggac aatbacaaac aggaabcgaa 5760 
bcaacaabab bbbcacctga atcaggatab 5820 
gggatcgcag bggbgagbaa ccabgcatca 5880 
ggaagaggca baaabbccgt cagccagtbb 5940 
gcaacgcbac cbbbgccabg tbbcagaaac 6000 
cgabagabbg bcgcaccbga bbgcccgaca 6060 
bcagcatcca bgtbggaatt baabcgcggc 6120 
cbcabaacac ccctbgtabt acbgbbbabg 6180 
ababttbtab cbbgbgcaab gtaacatcag 6240 
cccccccabb abbgaagcab ttabcagggb 63 00 
bgbabbbaga aaaabaaaca aataggggbb 6360 
gacgbcbaag aaaccatbab tabcabgaca 6420 
cccbbtcgbc 6460 



<210> 35 
<211> 7987 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pAdApb conbaining HIV genes 
<400> 35 

bbaabbaacc gcaatbcbca bgbbbgacag cbbatcabca bcaabaabab accbbabbtb 60 
ggabbgaagc caababgaba abgagggggb ggagbbbgbg acgtggcgcg gggcgbggga 12 0 
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acggggcggg tgacgtagta gtgtggcgga agtgtgatgb tgcaagtgtg gcggaacaca 180 

tgtaagcgac ggatgtggca aaagtgacgt ttttggtgtg cgccggbgta cacaggaagt 240 

gacaattttc gcgcggtbbt aggcggatgt tgtagtaaat ttgggcgbaa ccgagtaaga 300 

tttggccatt ttcgcgggaa aactgaabaa gaggaagtga aatcbgaaba attttgtgtt 360 

actcatagcg cgbaatabbb gtcbagggcc gcggggactt tgaccgttta cgtggagact 420 

cgcccaggtg tttttctcag gtgttttccg cgttccgggt caaagttggc gttttattat 480 

batagtcagt acgtaccagt gcactggcct agagcggccc cattgcatac gttgtatcca 54 0 

tatcataata tgtacattta tahtggctca tgtccaacat tacggccabg ttgacatfcga 600 

bbattgacba gbbattaata gtaatcaatt acggggtcat tagttcatag cccatababg 660 

gagttccgcg tbacataacb bacggbaaat ggcccgccbg gcbgaccgcc caacgacccc 72 0 

cgcccabbga cgbcaataab gacgbatgtt cccabagbaa cgccaabagg gacttbccat 780 

bgacgbcaat gggbggagta bttacggtaa acbgcccacb tggcagbaca tcaagbgbat 84 0 

catatgccaa gtacgccccc battgacgtc aatgacggba aabggcccgc ctggcatbab 900 

gcccagtaca tgaccbtatg ggacbtbcct acbtggcagb acabctacgt attagtcatc 960 

gcbattacca tggtgatgcg gttttggcag tacabcaatg ggcgtggata gcggbbbgac 1020 

tcacggggat ttccaagtct ccaccccatt gacgbcaatg ggagttbgtb ttggcaccaa 1080 

aabcaacggg actttccaaa atgtcgbaac aactccgccc cattgacgca aatgggcggt 114 0 

aggcgtgtac ggbgggaggt ctatabaagc agagctcgbt tagtgaaccg tcagabcgcc 1200 

bggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 12 60 

cgbcaccgtc gbcgacacgt gtgabcagab abcgcggccg ctcbagaatg cgcgtgaagg 132 0 

agaagtacca gcaccbgtgg cgctggggct ggcgctgggg caccatgctg ctgggcatgc 13 80 

tgatgatcbg cagcgccacc gagaagcbgb gggbgaccgb gtactacggc gtgcccgtgt 1440 

ggaaggaggc caccaccacc cbgcbctgcg ccagcgacgc caaggcctac gacaccgagg 1500 

tgcacaacgt gbgggccacc cacgcctgcg tgcccaccga ccccaacccc caggaggtgg 1560 

tgcbggtgaa cgtgaccgag aacttcgaca tgtggaagaa cgacafcggtg gagcagatgc 1620 

acgaggacab catcagcctg bgggaccaga gcctgaagcc ctgcgbgaag ctgacccccc 1680 

bgbgcgtgag cctgaagtgc accgacgcba gcaagaacbg cagcbbcaac atcagcacca 1740 

gcatccgcgg caaggtgcag aaggagtacg ccttcttcta caagcbggac atcatcccca 1800 

bcgacaacga caccaccagc bacagcctga ccagctgcaa caccagcgtg atcacccagg 1860 

cctgccccaa ggtgagcttc gagcccatcc ccaaccacta ctgcgccccc gccggcttcg 1920 

ccatccbgaa gtgcaaggac aagaagtbca acggcaaggg ccccbgcacc aacgtgagca 1980 

ccgtgcagtg cacccacggc abccgccccg tggbgagcac ccagcbgctg ctgaacggba 2040 

gcctggccga ggaggaggbg gbgatccgca gcgcbaactb cgccgacaac gccaaggtga 2100 

bcatcgtgca gctgaacgag agcgtggaga tcaactgcac ccgccccaac aacaacaccc 2160 

gcaagagcab ccacatcggc cccggccgcg ccbbctacac caccggcgag atcabcggcg 2220 

acatccgcca ggcccacbgc aaccbgagcc gcgccaagbg gaacgacacc ctgaacaaga 2280 

tcgtgatcaa gctgcgcgag cagtbcggca acaagaccat cgtgttcaag cacagcagcg 2340 

gcggcgaccc cgagatcgtg acccacagct bcaactgcgg cggcgagbtc btctacbgca 2400 

acagcaccca gctgtbcaac agcacctggb tcaacagcac cbggagcacc gagggcagca 24 60 

acaacaccga gggcagcgac accabcaccc tgccctgccg catcaagcag abcabcaaca 2520 

bgbggcagaa ggtgggcaag gccabgbacg ccccccccab cagcggccag atccgcbgca 2580 

gcagcaacat caccggcctg cbgcbgaccc gcgacggcgg caacagcaac aacgagagcg 2640 

agatcbtccg tccgggcggc ggcgacabgc gcgacaactg gcgcagcgag ctgbacaagb 27 00 

acaaggtggt gaagatcgag cccctgggcg bggcccccac caaggccaag ctbaccgtcc 2760 

aggcccgcca gctgcbgagc ggcatcgbgc agcagcagaa caaccbgctg cgcgccabcg 2820 

aggcccagca gcaccbgcbg cagctgaccg fcgbggggcat caagcagctg caggcccgca 2880 

ccctggccgt ggagcgcbac cbgaaggacc agcagcagct cgagcagatc bggaaccaca 294 0 

ccacctggat ggagtgggac cgcgagabca acaactacac cagcctgatc cacagccbga 3000 

tcgaggagag ccagaaccag caggagaaga acgagcagga gctgctggag cbggacaagt 3060 

gggccagcct gbggaacbgg ttcaacabca ccaactggcb gtggbgagga bccagabctg 3120 

ctgtgccbtc bagttgccag ccatcbgbtg tbtgcccctc ccccgbgcct tccttgaccc 3180 

tggaaggtgc cactcccacb gbccttbcct aataaaatga ggaaabbgca tcgcabtgtc 3240 

bgagbaggtg bcattctabt ctggggggtg gggtggggca gcacagcaag ggggaggatb 33 00 

gggaagacaa tagcaggcab gctggggatg cggbgggcbc tabgggtacc cagggccgca 3360 

baactbcgta taatgbabgc tatacgaagb tataagabct gbacbgaaab gbgbgggcgt 3420 
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ggcttaaggg tgggaaagaa tatataaggt gggggtctta tgtagttttg tatctgtttt 3480 

gcagcagccg ccgccgccat gagcaccaac tcgtttgatg gaagcattgt gagctcatat 3540 

bbgacaacgc gcatgccccc atgggccggg gtgcgtcaga atgtgatggg ctccagcatt 3 600 

gatggtcgcc ccgtcctgcc cgcaaactct actaccttga cctacgagac cgtgtctgga 3660 

acgccgttgg agactgcagc ctccgccgcc gcttcagccg ctgcagccac cgcccgcggg 3720 

attgtgactg actttgcttt cctgagcccg cbbgcaagca gtgcagcttc ccgttcatcc 3780 

gcccgcgatg acaagttgac ggcbcbbbbg gcacaattgg attcfcttgac ccgggaacbb 3840 

aatgtcgttt ctcagcagct gbbggabcbg cgccagcagg tttctgccct gaaggcttcc 3 900 

bcccctccca atgcggttta aaacataaat aaaaaaccag actctgtttg gatttggatc 3960 

aagcaagtgt ctbgcbgbcb ttatttaggg gttttgcgcg cgcggtaggc ccgggaccag 4020 

cggtctcggt cgbbgagggb ccbgbgbabt bbbtccagga cgtggtaaag gbgacbcbgg 4 080 

atgttcagat acatgggcat aagcccgtct ctggggtgga ggtagcacca cbgcagagct 4140 

tcatgctgcg gggtggtgtt gtagatgatc cagtcgtagc aggagcgctg ggcgtggtgc 4 200 

ctaaaaatgt ctttcagtag caagctgatt gccaggggca ggcccbbggb gtaagtgttt 4260 

acaaagcggt taagctggga bgggtgcaba cgbggggaba tgagatgcat cttggactgt 4320 

atttttaggt bggcbatgbb cccagccata tccctccggg gabbcabgbb gtgcagaacc 4380 

accagcacag bgbabccggb gcacttggga aatttgtcat gtagcttaga aggaaatgcg 4440 

tggaagaact tggagacgcc cttgtgacct ccaagatttt ccabgcabbc gtccabaatg 4 500 

atggcaatgg gcccacgggc ggcggcctgg gcgaagatat ttctgggatc actaacgtca 4560 

bagbbgbgbb ccaggatgag atcgtcatag gccabtbbta caaagcgcgg gcggagggtg 4 620 

ccagactgcg gtataatggt tccatccggc ccaggggcgt agttaccctc acagatttgc 4680 

atttcccacg ctttgagttc agatgggggg atcatgtcta cctgcggggc gatgaagaaa 4740 

acggtbtccg gggtagggga gabcagcbgg gaagaaagca ggttcctgag cagctgcgac 4 800 

ttaccgcagc cggtgggccc gtaaatcaca cctattaccg gctgcaactg gtagttaaga 4860 

gagcbgcagc tgccgtcatc cctgagcagg ggggccacbb cgtbaagcab gbcccbgacb 4 920 

cgcabgtbbb cccbgaccaa abccgccaga aggcgcbcgc cgcccagcga tagcagbbct 4 980 

bgcaaggaag caaagbbbtb caacggtbbg agaccgbccg ccgbaggcab gctbbbgagc 5040 

gtbbgaccaa gcagbbccag gcggbcccac agcbcggbca ccbgcbctac ggcatcbcga 5100 

bccagcabab cbccbcgbtb cgcgggtbgg ggcggcbbbc gcbgtacggc agtagbcggt 5160 

gcbcgbccag acgggccagg gbcabgtctt bccacgggcg cagggbccbc gtcagcgbag 5220 

bcbgggtcac ggbgaagggg tgcgctccgg gctgcgcgcb ggccagggbg cgcbtgaggc 5280 

bggbccbgcb ggbgcbgaag cgcbgccggb cbtcgcccbg cgcgtcggcc aggbagcabb 5340 

bgaccabggb gbcabagbcc agccccbccg cggcgtggcc cbbggcgcgc agcbbgcccb 5400 

bggaggaggc gccgcacgag gggcagbgca gacbbtbgag ggcgtagagc tbgggcgcga 5460 

gaaabaccga bbccggggag baggcabccg cgccgcaggc cccgcagacg gbcbcgcabb 5520 

ccacgagcca ggbgagcbcb ggccgbbcgg ggtcaaaaac caggbtbccc ccabgcbbbb 5580 

bgabgcgbbb cbbaccbcbg gbbbccabga gccggbgbcc acgctcggbg acgaaaaggc 5640 

bgbccgbgbc cccgbabaca gacbbgagag gccbgbcctc gagcggbgbb ccgcggbccb 5700 

ccbcgbabag aaacbcggac cactcbgaga caaaggcbcg cgbccaggcc agcacgaagg 5760 

aggcbaagbg ggaggggbag cggbcgbbgb ccacbagggg gtccactcgc bccagggbgb 5820 

gaagacacab gbcgcccbcb bcggcabcaa ggaaggbgab bggbbbgbag gtgbaggcca 5880 

cgbgaccggg bgbbccbgaa ggggggcbab aaaagggggb gggggcgcgb bcgbcctcac 5940 

bcbcbtccgc abcgcbgbcb gcgagggcca gcbgtbgggg bgagtcgacg cgaggctgga 6000 

tggccbbccc cabbabgabb cbtcbcgcbb ccggcggcab cgggabgccc gcgbbgcagg 6060 

ccabgcbgbc caggcaggba gabgacgacc abcagggaca gctbcaaggc cagcaaaagg 6120 

ccaggaaccg baaaaaggcc gcgbbgcbgg cgbbtbbcca baggcbccgc cccccbgacg 6180 

agcabcacaa aaabcgacgc bcaagbcaga ggbggcgaaa cccgacagga ctabaaagab 6240 

accaggcgbb bcccccbgga agcbcccbcg bgcgcbcbcc bgbbccgacc cbgccgcbba 6300 

ccggabaccb gbccgccbbb cbcccbbcgg gaagcgbggc gcbtbcbcab agcbcacgcb 6360 

gbaggbabcb cagbtcggbg baggtcgttc gcbccaagct gggcbgbgtg cacgaacccc 6420 

ccgbbcagcc cgaccgctgc gccbbatccg gbaacbabcg bcbtgagbcc aacccggbaa 6480 

gacacgacbb abcgccacbg gcagcagcca cbggbaacag gabbagcaga gcgaggbabg 6540 

taggcggbgc bacagagtbc bbgaagbggb ggccbaacba cggcbacacb agaaggacag 66 00 

tabbbggbab cbgcgcbcbg cbgaagccag bbaccbtcgg aaaaagagbb ggbagcbctb 6660 

gabccggcaa acaaaccacc gcbggbagcg gtggbbtbtb bgbbbgcaag cagcagabta 6720 
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cgcgcagaaa aaaaggabcb caagaagabc cbbbgabcbb ttctacgggg tctgacgctc 6780 
agtggaacga aaactcacgt taagggattt tggtcatgag attahcaaaa aggatcttca 6840 
ccbagabccb tttaaabtaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 6900 
cttggtctga cagttaccaa fcgcttaatca gtgaggcacc tatctcagcg abctgtcbab 6960 
tbcgbbcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7020 
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7080 
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7140 
ccgcctccat ccagbcbatt aattgttgcc gggaagctag agbaagbagt tcgccagtba 7200 
atagtttgcg caacgbbgbb gccattgctg caggcatcgt ggtgtcacgc tcgtcgtttg 7260 
gtatggcttc abtcagctcc ggbbcccaac gatcaaggcg agbbacabga tcccccatgt 7320 
bgbgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagbtggccg 7380 
cagbgbtabc acbcatggtb abggcagcac tgcataatbc bcbtactgbc abgccabccg 7440 
baagatgcbb btcbgtgacb ggtgagbact caaccaagbc abbcbgagaa bagtgtatgc 7500 
ggcgaccgag tbgctcbbgc ccggcgtcaa cacgggataa baccgcgcca cabagcagaa 7560 
cbtbaaaagt gctcatcabb ggaaaacgtt cbtcggggcg aaaactctca aggabcttac 7620 
cgcbgbbgag abccagbbcg atgbaaccca cbcgbgcacc caacbgabcb tcagcatcbt 7680 
btactbbcac cagcgbbbcb gggbgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 7740 
gaabaagggc gacacggaaa tgbbgaabac tcatactcbt ccttbbtcaa tatbabbgaa 7800 
gcabttatca gggbtabtgb ctcatgagcg gabacababb bgaabgtabt tagaaaaaba 7860 
aacaaatagg ggbbccgcgc acabbtcccc gaaaagbgcc accbgacgbc taagaaacca 7 920 
bbabtatcab gacabbaacc babaaaaaba ggcgbabcac gaggcccbbt cgtctbcaag 7980 
aabbgbt 7987 



<210> 36 
<211> 7948 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pAdApb conbaining HIV genes 
<400> 36 

ttaabtaacc gcaabbctca bgbbbgacag cbtatcabca tcaabaabab acctbabbbb 60 
ggabbgaagc caababgaba abgagggggb ggagbbbgbg acgtggcgcg gggcgbggga 120 
ac 9999 c 999 bgacgbagba gtgbggcgga agbgbgabgt bgcaagbgbg gcggaacaca 180 
bgbaagcgac ggatgbggca aaagtgacgb bbbbggbgbg cgccggbgta cacaggaagt 240 
gacaabbbbc gcgcggbtbb aggcggabgt bgbagbaaat bbgggcgbaa ccgagtaaga 300 
tbbggccatb bbcgcgggaa aacbgaabaa gaggaagbga aabctgaaba abtttgbgbb 360 
acbcabagcg cgbaabattb gtcbagggcc gcggggacbb bgaccgbbba cgtggagact 420 
cgcccaggtg bbbbbcbcag gbgbbbbccg cgbbccgggb caaagbbggc gbbbbabbat 480 
babagbcagb acgbaccagb gcacbggccb agagcggccc cabbgcabac gttgbabcca 540 
babcabaaba bgtacabbba babbggcbca tgbccaacab baccgccabg tbgacabbga 600 
ttabtgacba gbbatbaaba gbaabcaabt acggggtcab bagbbcabag cccabababg 660 
gagbbccgcg bbacabaacb bacggbaaab ggcccgccbg gcbgaccgcc caacgacccc 720 
cgcccabbga cgbcaataab gacgbabgbb cccabagbaa cgccaabagg gacbbbccab 780 
bgacgtcaab gggbggagba bbbacggbaa acbgcccacb bggcagbaca bcaaghgbab 840 
cababgccaa gbacgccccc babbgacgbc aabgacggba aabggcccgc cbggcatbat 900 
gcccagbaca bgaccbbabg ggactbbccb acbbggcagb acabctacgb abbagbcabc 960 
gcbabbacca bggbgabgcg gbbbbggcag bacabcaabg ggcgbggaba gcggtbbgac 102 0 
tcacggggab bbccaagbct ccaccccatt gacgbcaabg ggagbtbgbb bbggcaccaa 1080 
aabcaacggg acbbbccaaa abgtcgbaac aacbccgccc cabbgacgca aabgggcggb 1140 
aggcgtgbac ggbgggaggb cbabataagc agagcbcgbb bagbgaaccg bcagabcgcc 120O 
bggagacgcc abccacgcbg bbbtgaccbc cabagaagac accgggaccg abccagccbc 1260 
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cgtcaccgtc gtcgacacgt gtgatcagat atcgcggccg ctctagaatg cgcgtgaagg 1320 
agaagtacca gcacctgtgg cgctggggct ggcgctgggg caccatgctg ctgggcatgc 1380 
tgatgatctg cagcgccacc gagaagctgt gggtgaccgt gtactacggc gtgcccgfcgt 1440 
ggaaggaggc caccaccacc ctgctctgcg ccagcgacgc caaggcctac gacaccgagg 1500 
tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc caggaggtgg 1560 
tgctggtgaa cgtgaccgag aacttcgaca tgtggaagaa cgacatggtg gagcagatgc 1620 
acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag ctgacccccc 1680 
tgtgcgtgag cctgaagtgc accgacctga agaacgacac caacaccaac agcagcagcg 1740 
gccgcatgat catggagaag ggcgagatca agaactgcag cttcgctagc accagctgca 1800 
acaccagcgt gatcacccag gcctgcccca aggtgagctt cgagcccatc cccaaccact 1860 
actgcgcccc cgccggcttc gccatcctga agtgcaagga caagaagttc aacggcaagg 1920 
gcccctgcac caacgtgagc accgtgcagt gcacccacgg catccgcccc gtggtgagca 1980 
cccagctgct gctgaacggt agcctggccg aggaggaggt ggtgatccgc agcgctaact 204 0 
tcgccgacaa cgccaaggtg atcatcgtgc agctgaacga gagcgtggag atcaactgca 2100 
cccgccccaa caacaacacc cgcaagagca tccacatcgg ccccggccgc gccttctaca 2160 
ccaccggcga gatcatcggc gacatccgcc aggcccactg caacctgagc cgcgccaagt 222 0 
ggaacgacac cchgaacaag atcgtgatca agctgcgcga gcagttcggc aacaagacca 22 80 
tcgtgttcaa gcacagcagc ggcggcgacc ccgagatcgt gacccacagc ttcaactgcg 2340 
gcggcgagtt cttctactgc aacagcaccc agctgttcaa cagcacctgg ttcaacagca 2400 
cctggagcac cgagggcagc aacaacaccg agggcagcga caccatcacc ctgccctgcc 2460 
gcatcaagca, gatcatcaac atgtggcaga aggtgggcaa ggccatgtac gcccccccca 2520 
tcagcggcca gatccgctgc agcagcaaca tcaccggcct gctgctgacc cgcgacggcg 2580 
gcaacagcaa caacgagagc gagatcttcc ghccgggcgg cggcgacatg cgcgacaact 2640 
ggcgcagcga gctgtacaag tacaaggtgg tgaagatcga gcccctgggc gtggccccca 2700 
ccaaggccaa gcttaccgtc caggcccgcc agctgctgag cggcatcgtg cagcagcaga 2760 
acaacctgct gcgcgccatc gaggcccagc agcacctgct gcagctgacc gtgtggggca 2 820 
tcaagcagct gcaggcccgc accctggccg tggagcgcta cctgaaggac cagcagcagc 2 880 
tcgagcagat ctggaaccac accacctgga tggagtggga ccgcgagatc aacaactaca 294 0 
ccagcctgat ccacagcctg atcgaggaga gccagaacca gcaggagaag aacgagcagg 3000 
agctgctgga gctggacaag tgggccagcc tgtggaactg gttcaacatc accaactggc 3060 
tgtggtgagg atccagatct gctgtgcctt ctagttgcca gccatctgtt gtttgcccct 3120 
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 3180 
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 3240 
agcacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct 3300 
ctatgggtac ccagggccgc ataacttcgt ataatgtatg ctatacgaag ttataagatc 3360 
tgtactgaaa tgtgtgggcg tggcttaagg gtgggaaaga atatataagg tgggggtctt 3420 
atgtagtttt gtatctgttt tgcagcagcc gccgccgcca tgagcaccaa ctcgtttgat 3480 
ggaagcattg tgagctcata tttgacaacg cgcatgcccc catgggccgg ggtgcgtcag 3540 
aatgtgatgg gctccagcat tgatggtcgc cccgtcctgc ccgcaaactc tactaccttg 3600 
acctacgaga ccgtgtctgg aacgccgttg gagactgcag cctccgccgc cgcttcagcc 3660 
gctgcagcca ccgcccgcgg gattgtgact gactttgctt tcctgagccc gcttgcaagc 3720 
agtgcagctt cccgttcatc cgcccgcgat gacaagttga cggctctttt ggcacaattg 3780 
gattctttga cccgggaact fcaatgtcgtt tctcagcagc tgttggafcct gcgccagcag 3840 
gttfcctgccc tgaaggcttc ctcccctccc aatgcggttt aaaacataaa taaaaaacca 3900 
gactcfcgttt ggatttggat caagcaagtg tcttgctgtc tttatttagg ggttttgcgc 3960 
gcgcggtagg cccgggacca gcggtctcgg tcgttgaggg tcctgtgtat tttttccagg 4020 
acgtggtaaa ggtgactctg gatgttcaga tacatgggca taagcccgtc tctggggtgg 4 080 
aggtagcacc actgcagagc ttcatgctgc ggggtggtgt tgtagatgat ccagtcgtag 414 0 
caggagcgct gggcgtggtg cctaaaaatg tctfctcagta gcaagctgat tgccaggggc 42 00 
aggcccttgg tgtaagtgtt tacaaagcgg ttaagctggg atgggtgcat acgtggggat 4260 
atgagatgca tcttggactg tatttttagg ttggctatgt tcccagccat atccctccgg 4320 
ggattcatgt tgtgcagaac caccagcaca gtgtatccgg tgcacttggg aaatttghca 43 80 
tgbagcttag aaggaaatgc gtggaagaac ttggagacgc ccttgtgacc tccaagattt 4440 
hccatgcatt cgtccataat gatggcaatg ggcccacggg cggcggcctg ggcgaagata 4500 
bttctgggat cactaacgtc atagttgtgt tccaggatga gatcgfccata ggccatttbt 4560 
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acaaagcgcg ggcggagggb gccagactgc 
tagbbacccb cacagabbbg catttcccac 
acctgcgggg cgatgaagaa aacggbbbcc 
aggttcctga gcagctgcga cttaccgcag 
ggctgcaact ggtagttaag agagctgcag 
bcgbbaagca tgtccctgac tcgcatgttb 
ccgcccagcg atagcagttc ttgcaaggaa 
gccgtaggca tgcttttgag cgtttgacca 
acctgctcta cggcatctcg atccagcata 
cgctgtacgg cagbagbcgg tgctcgtcca 
gcagggtcct cgtcagcgta gtctgggbca 
tggccagggt gcgcttgagg cbggbccbgc 
gcgcgtcggc caggtagcat bbgaccabgg 
ccttggcgcg cagcttgccc bbggaggagg 
gggcgtagag cttgggcgcg agaaataccg 
ccccgcagac ggtctcgcat tccacgagcc 
ccaggtttcc cccatgcttt bbgabgcgbb. 
cacgctcggt gacgaaaagg ctgtccgtgt 
cgagcggtgt tccgcggtcc tcctcgtata 
gcgtccaggc cagcacgaag gaggctaagt 
ggtccactcg ctccagggtg tgaagacaca 
tbggbbbgta ggtgtaggcc acgbgaccgg 
bgggggcgcg bbcgtccbca ctctcttccg 
gtgagbcgac gcgaggctgg atggccttcc 
tcgggatgcc cgcgttgcag gccabgcbgb 
agcttcaagg ccagcaaaag gccaggaacc 
ataggctccg cccccctgac gagcatcaca 
acccgacagg actabaaaga taccaggcgt 
ctgttccgac cctgccgctt accggabacc 
cgctbbcbca bagcbcacgc bgbaggbabc 
tgggcbgbgb gcacgaaccc cccgbbcagc 
gbcbbgagbc caacccggba agacacgacb 
ggabbagcag agcgaggbab gtaggcggtg 
acggctacac bagaaggaca gtabbbggba 
gaaaaagagb bggbagctcb bgabccggca 
btgbtbgcaa gcagcagabb acgcgcagaa 
bbbcbacggg gbcbgacgcb cagbggaacg 
gabbabcaaa aaggabcbbc acctagabcc 
bcbaaagbab ababgagbaa acbtggtctg 
cbabctcagc gabctgbcba bbbcgttcat 
baacbacgab acgggagggc bbaccabcbg 
cacgcbcacc ggcbccagab bbabcagcaa 
gaagbggbcc bgcaacbbba bccgcctcca 
gagbaagbag btcgccagbb aabagbbbgc 
bggbgbcacg ctcgbcgtbt ggbabggcbb 
gagbbacatg atcccccabg ttgbgcaaaa 
tbgtcagaag baagbbggcc gcagbgbbab 
cbcbbacbgb cabgccabcc gbaagabgct 
cabbcbgaga abagbgbabg cggcgaccga 
abaccgcgcc acabagcaga acbbbaaaag 
gaaaacbcbc aaggabcbba ccgcbgbtga 
ccaacbgabc bbcagcabcb bbtacbbbca 
ggcaaaafcgc cgcaaaaaag ggaabaaggg 
bccbbbbbca abatbabbga agcabbbabc 
bbgaabgbab bbagaaaaab aaacaaabag 



ggbabaabgg bbccabccgg cccaggggcg 4620 
gcbbbgagbb cagabggggg gabcabgbcb 4680 
ggggbagggg agabcagcbg ggaagaaagc 4740 
ccggbgggcc cgtaaabcac accbabbacc 4800 
cbgccgbcab cccbgagcag gggggccacb 4 8 60 
bcccbgacca aabccgccag aaggcgcbcg 4920 
gcaaagbbbb bcaacggbbt gagaccgbcc 4980 
agcagbbcca ggcggtccca cagctcggbc 5040 
bcbcctcgbb bcgcgggbbg gggcggcbbb 5100 
gacgggccag ggbcabgbcb bbccacgggc 5160 
cggbgaaggg gbgcgcbccg ggctgcgcgc 5220 
bggbgcbgaa gcgcbgccgg tcbbcgcccb 5280 
bgbcabagbc cagccccbcc gcggcgtggc 5340 
cgccgcacga ggggcagbgc agacbbbbga 54 00 
abbccgggga gtaggcatcc gcgccgcagg 5460 
aggbgagctc bggccgbbcg gggbcaaaaa 5520 
bcbtaccbcb ggbbbccabg agccggbgbc 5580 
ccccgbabac agacbbgaga ggccbgbccb 5640 
gaaacbcgga ccacbcbgag acaaaggcbc 5700 
gggaggggba gcggbcgbbg bccacbaggg 5760 
bgtcgcccbc btcggcabca aggaaggbga 5820 
gbgbbccbga aggggggcba baaaaggggg 5880 
cabcgcbgbc bgcgagggcc agcbgbbggg 5940 
ccabbabgab bcbbcbcgcb bccggcggca 6000 
ccaggcaggb agabgacgac cabcagggac 6060 
gbaaaaaggc cgcgbbgcbg gcgbtbttcc 6120 
aaaabcgacg cbcaagtcag aggbggcgaa 6180 
bbccccctgg aagcbcccbc gbgcgctcbc 6240 
bgbccgccbb bcbccctbcg ggaagcgbgg 6300 
bcagbbcggb gbaggbcgbb cgcbccaagc 6360 
ccgaccgcbg cgccbbabcc ggbaacbabc 6420 
babcgccacb ggcagcagcc acbggbaaca 6480 
cbacagagtb cbbgaagbgg bggccbaacb 6540 
bcbgcgctcb gcbgaagcca gbbaccbbcg 6600 
aacaaaccac cgcbggbagc ggbggbbbtb 6660 
aaaaaggabc bcaagaagat ccbbbgatcb 6720 
aaaacbcacg bbaagggabb bbggbcabga 6780 
tbbbaaabba aaaabgaagb bbbaaabcaa 6840 
acagbbacca abgcbbaabc agbgaggcac 6900 
ccatagtbgc cbgacbcccc gtcgbgbaga 6960 
gccccagbgc tgcaabgaba ccgcgagacc 7020 
baaaccagcc agccggaagg gccgagcgca 70 80 
bccagbcbab taabbgtbgc cgggaagcba 7140 
gcaacgtbgb tgccabbgcb gcaggcabcg 7200 
cabbcagcbc cggbbcccaa cgabcaaggc 7260 
aagcggbbag cbccbtcggb- ccbccgabcg 7320 
cacbcabggb babggcagca cbgcabaabb 7380 
bbbcbgbgac bggbgagbac bcaaccaagb 7440 
gbbgcbcbbg cccggcgbca acacgggaba 7500 
bgcbcabcab bggaaaacgb bcbbcggggc 7560 
gabccagbbc gabgbaaccc acbcgbgcac 7620 
ccagcgbbbc bgggbgagca aaaacaggaa 7680 
cgacacggaa abgbbgaaba cbcabacbcb 7740 
agggbbabbg bctcabgagc ggabacatab 7800 
gggtbccgcg cacabbbccc cgaaaagtgc 7860 



- 84 - 



WO 02/32943 PCT/US01/25721 

cacctgacgt ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca 7920 
cgaggccctb tcgtcttcaa gaattgtt 794 8 



<210> 37 
<211> 7963 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pAdApt containing HIV genes 

• 

<400> 37 

ttaattaacc gcaattctca tgtttgacag cttatcatca tcaataatat accttatttt 60 
ggattgaagc caatatgata atgagggggt ggagtttgtg acgtggcgcg gggcgtggga 120 
a cggggcggg tgacgtagta gtgtggcgga agtgtgatgt bgcaagtgbg gcggaacaca 180 
tgtaagcgac ggatgtggca aaagtgacgt ttttggtgtg cgccggtgba cacaggaagt 24 0 
gacaattttc gcgcggtttt aggcggatgt tgtagtaaat ttgggcgtaa ccgagtaaga 300 
tttggccatt ttcgcgggaa aactgaataa gaggaagbga aatctgaata attttgtgtt 360 
actcatagcg cgbaatabtb gtcbagggcc gcggggactt tgaccgttta cgtggagact 420 
cgcccaggtg tttttctcag gtgttttccg cgttccgggt caaagttggc gttttattat 480 
tatagtcagt acgtaccagt gcactggcct agagcggccc cattgcabac gttgtabcca 54 0 
tatcataata tgtacabbba battggctca bgbccaacat taccgccatg ttgacattga 600 
ttattgacta gttabtaaba gtaatcaatt acggggtcat tagttcatag cccababatg 660 
gagbtccgcg btacataacb tacggtaaat ggcccgcctg gctgaccgcc caacgacccc 720 
cgcccabtga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat 780 
tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat 840 
catatgccaa gtacgccccc tattgacgbc aabgacggba aatggcccgc ctggcattat 900 
gcccagbaca tgacctbabg ggactttcct acttggcagt acatctacgt attagtcatc 960 
gctattacca bggtgabgcg gttttggcag tacatcaatg ggcgtggata gcggbtbgac 1020 
tcacggggab ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa 1080 
aatcaacggg actttccaaa atgtcgtaac aactccgccc cattgacgca aatgggcggh 1140 
aggcgtgtac ggtgggaggt ctabataagc agagctcgtb tagtgaaccg tcagatcgcc 1200 
tggagacgcc abccacgcbg btbbgacctc catagaagac accgggaccg atccagcctc 1260 
cgbcaccgbc gtcgacacgb gtgatcagat atcgcggccg ctctagaatg cgcgtgaagg 1320 
agaagtacca gcacctgbgg cgcbggggct ggcgctgggg caccabgctg ctgggcabgc 1380 
tgatgatcbg cagcgccacc gagaagctgt gggbgaccgt gtactacggc gbgcccgtgb 1440 
ggaaggaggc caccaccacc ctgctcbgcg ccagcgacgc caaggcctac gacaccgagg 1500 
tgcacaacgt gtgggccacc cacgccbgcg tgcccaccga ccccaacccc caggaggtgg 1560 
cgctggtgaa cgbgaccgag aacttcgaca bgbggaagaa cgacabggtg gagcagatgc 1620 
acgaggacab catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag cbgacccccc 1680 
tgbgcgbgag cctgaagtgc accgacctga agaacgacac caacaccaac agcagcagcg 1740 
gccgcabgat cabggagaag ggcgagabca agaactgcag ctbcaacabc agcaccagca 1800 
tccgcggcaa ggbgcagaag gagbacgcct tcbtctacaa gctggacatc atccccatcg 1860 
acaacgacac caccagctac agcctgacca gctgcaacac cagcgbgabc acccaggccb 1920 
gccccaaggt gagcttcgag cccatcccca accactactg cgcccccgcc ggcbbcgcca 1980 
tcctgaagbg caaggacaag aagttcaacg gcaagggccc ctgcaccaac gtgagcaccg 2040 
tgcagbgcac ccacggcabc cgccccgbgg tgagcaccca gcbgctgctg aacggtagcc 2100 
tggccgagga ggaggtggtg atccgcagcg cbaacttcgc cgacaacgcc aaggbgatca 2160 
tcgtgcagct gaacgagagc gtggagabca actgcacccg cgctagcgcc cacfcgcaacc 2220 
tgagccgcgc caagbggaac gacacccbga acaagatcgb gabcaagctg cgcgagcagb 2280 
tcggcaacaa gaccafccgbg ttcaagcaca gcagcggcgg cgaccccgag atcgtgaccc 2340 
acagcbtcaa ctgcggcggc gagbtcttct actgcaacag cacccagctg ttcaacagca 2400 
cctggttcaa cagcaccbgg agcaccgagg gcagcaacaa caccgagggc agcgacacca 24 60 
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tcaccctgcc ctgccgcatc aagcagatca tcaacabgbg gcagaaggtg ggcaaggcca 2520 
tgtacgcccc ccccatcagc ggccagatcc gctgcagcag caacatcacc ggcctgctgc 2580 
tgacccgcga cggcggcaac agcaacaacg agagcgagat cttccgtccg ggcggcggcg 2640 
acatgcgcga caactggcgc agcgagctgt acaagtacaa ggtggtgaag ahcgagcccc 2700 
tgggcgtggc ccccaccaag gccaagchta ccgtccaggc ccgccagctg cbgagcggca 2760 
tcgtgcagca gcagaacaac ctgctgcgcg ccatcgaggc ccagcagcac ctgctgcagc 2820 
tgaccgtgbg gggcatcaag cagctgcagg cccgcaccct ggccgtggag cgctacctga 2880 
aggaccagca gcagctcgag cagatctgga accacaccac cbggabggag tgggaccgcg 2940 
agatcaacaa ctacaccagc cbgabccaca gcctgatcga ggagagccag aaccagcagg 3000 
agaagaacga gcaggagctg ctggagctgg acaagtgggc cagcctgtgg aactggttca 3060 
acatcaccaa ctggcbgbgg tgaggatcca gatctgctgt gcctbcbagt tgccagccat 3120 
cbgbbgbtbg cccctccccc gtgccttcct bgacccbgga aggtgccact cccactgtcc 3180 
tbtcctaata aaatgaggaa abtgcatcgc attgtctgag taggtgtcat tctattctgg 3240 
9999t9999t ggggcagcac agcaaggggg aggattggga agacaatagc aggcatgctg 3300 
gggatgcggt gggctctatg ggtacccagg gccgcataac ttcgtataat gbatgcbata 3360 
cgaagttata agatcbgbac tgaaatgtgt gggcgtggct taagggtggg aaagaatata 3420 
baaggbgggg gbctbabgba gbtbtgbabc bgttbtgcag cagccgccgc cgccatgagc 3480 
accaactcgt bbgabggaag cattgtgagc tcatatttga caacgcgcat gcccccatgg 3540 
gccggggtgc gbcagaabgt gatgggctcc agcattgatg gbcgccccgt cctgcccgca 3600 
aactctacta ccttgaccta cgagaccgtg tctggaacgc cgttggagac tgcagcctcc 3660 
gccgccgctt cagccgctgc agccaccgcc cgcgggattg tgactgactt tgctttcctg 3720 
agcccgctfcg caagcagtgc agcttcccgt' tcatccgccc gcgatgacaa gttgacggct 3780 
cttttggcac aattggattc bttgacccgg gaacttaatg tcgtttctca gcagctgttg 3840 
gatctgcgcc agcaggtttc tgccctgaag gcttcctccc ctcccaatgc ggtttaaaac 3900 
abaaataaaa aaccagactc tgtttggatt tggatcaagc aagbgtcttg cbgtctttab 3960 
bbaggggbbb tgcgcgcgcg gtaggcccgg gaccagcggt ctcggtcgtt gagggtcctg 4020 
tgtatttttt ccaggacgtg gtaaaggtga ctctggatgt tcagatacat gggcataagc 4080 
ccgbcbctgg ggtggaggta gcaccactgc agagcttcat gctgcggggt ggbgttgbag 414 0 
atgatccagt cgtagcagga gcgctgggcg tggtgcctaa aaatgtcttt cagtagcaag 4200 
ctgattgcca ggggcaggcc cttggtgtaa gtgtttacaa agcggttaag ctgggatggg 4260 
tgcatacgtg gggatatgag atgcatcttg gactgtattt ttaggttggc tatgttccca 4320 
gccatatccc tccggggatt catgtbgbgc agaaccacca gcacagtgta tccggtgcac 4380 
ttgggaaatt fcgtcatgtag cttagaagga aatgcgtgga agaactbgga gacgcccttg 4440 
tgacctccaa gattttccat gcattcgtcc ataatgatgg caahgggccc acgggcggcg 4500 
gcctgggcga agatatttct gggatcacta acgtcatagt tgtgttccag gatgagatcg 4560 
tcataggcca tttttacaaa gcgcgggcgg agggtgccag actgcggtat aatggttcca 4620 
tccggcccag gggcgtagtt accctcacag atttgcattt cccacgcttt gagttcagat 4680 
999999 a tca tgtctacctg cggggcgatg aagaaaacgg tttccggggt aggggagatc 4 740 
agctgggaag aaagcaggtt cctgagcagc tgcgacttac cgcagccggt gggcccgtaa 4800 
atcacaccta ttaccggctg caactggtag ttaagagagc tgcagctgcc gtcatccctg 4860 
agcagggggg ccacttcgtt aagcatgtcc ctgactcgca tgtbttccct gaccaaatcc 4920 
gccagaaggc gctcgccgcc cagcgatagc agttcttgca aggaagcaaa gtttttcaac 4980 
ggtttgagac cgtccgccgt aggcatgctt ttgagcgttt gaccaagcag ttccaggcgg 5040 
tcccacagct cggtcacctg ctctacggca tctcgatcca gcatatctcc tcgtttcgcg 5100 
ggbtggggcg gctttcgctg tacggcagta gbcggtgctc gbccagacgg gccagggbca 5160 
bgbcttbcca cgggcgcagg gtcctcgtca gcgtagtctg ggtcacggtg aaggggbgcg 5220 
ctccgggcbg cgcgcbggcc agggbgcgcb bgaggctggb cctgctggbg ctgaagcgct 5280 
gccggbcbbc gcccbgcgcg bcggccaggb agcabbbgac catggtgtca tagbccagcc 5340 
ccbccgcggc gbggcccbtg gcgcgcagcb bgcccbbgga ggaggcgccg cacgaggggc 5400 
agbgcagact btbgagggcg tagagcbbgg gcgcgagaaa taccgabtcc ggggagtagg 5460 
cabccgcgcc gcaggccccg cagacggbct cgcabtccac gagccaggtg agcbctggcc 5520 
gbbcggggbc aaaaaccagg btbcccccat gctttttgat gcgbbfcctta ccbctggttt 5580 
ccatgagccg gtgtccacgc bcggtgacga aaaggctgtc cgbgbccccg tabacagact 5640 
bgagaggcct gtccbcgagc ggbgtbccgc ggbccbcctc gtabagaaac tcggaccact 5700 
cbgagacaaa ggcfccgcgtc caggccagca cgaaggaggc baagbgggag gggbagcggt 5760 
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cgbtgtccac tagggggtcc acbcgctcca gggtgbgaag acacatgtcg ccctcttcgg 5820 

cabcaaggaa ggtgabbggt ttgtaggtgt aggccacgtg accgggtgtfc cctgaagggg 5880 

ggctataaaa gggggtgggg gcgcgbtcgb cctcactctc ttccgcatcg ctgtctgcga 5940 

gggccagctg bbggggbgag tcgacgcgag gctggatggc cttccccatt atgattcttc 6000 

tcgcttccgg cggcabcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 6060 

acgaccatca gggacagcbb caaggccagc aaaaggccag gaaccgtaaa aaggccgcgt 6120 

bgcbggcgtb bbbccabagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa 6180 

gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgttfcccc ccbggaagcb 6240 

ccctcgtgcg cbcbccbgbt ccgacccbgc cgcttaccgg atacctgtcc gcctttctcc 6300 

cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt bcggbgbagg 6360 

bcgbbcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct 6420 

babccggbaa cbabcgbcbb gagtccaacc cggtaagaca cgacttatcg ccactggcag 6480 

cagccactgg baacaggabt agcagagcga ggtafcgbagg cggbgcbaca gagbbcttga 654 0 

agbggfcggcc baactacggc bacacbagaa ggacagtabt tggtatcbgc gcbctgcbga 6600 

agccagbbac cbtcggaaaa agagbbggba gcbctbgabc cggcaaacaa accaccgctg 6660 

gbagcggbgg tbbbtbtgbt bgcaagcagc agabtacgcg cagaaaaaaa ggabctcaag 6720 

aagatcctbb gabctttbct acggggbctg acgcbcagtg gaacgaaaac tcacgbtaag 6780 

ggatbtbggt cabgagabta bcaaaaagga hcbbcaccba gatcctbbba aabbaaaaat 6840 

gaagtttbaa abcaabchaa agtabatatg agtaaacbbg gtctgacagt fcaccaatgcb 6900 

baatcagtga ggcaccbabc tcagcgatcb gbctabbtcg bbcabccaba gtbgcctgac 6960 

tccccgtcgb gbagataact acgabacggg agggctbacc atcbggcccc agbgctgcaa 7020 

bgataccgcg agacccacgc bcaccggctc cagabbtatc agcaataaac cagccagccg 7080 

gaagggccga gcgcagaagt ggbccbgcaa ctttatccgc cbccatccag tcbatbaabb 7140 

gtbgccggga agcbagagba agbagbbcgc cagbtaabag bbbgcgcaac gbbgbbgcca 7200 

ttgcbgcagg cabcgtggbg bcacgcbcgt cgfcbbggtab ggcbtcattc agctccggtb 7260 

cccaacgabc aaggcgagbt acabgatccc ccabgtbgbg caaaaaagcg gbbagctccb 7320 

tcggtccbcc gabcgbtgbc agaagbaagb bggccgcagb gbbatcacbc atggbbabgg 7380 

cagcacbgca baabbcbcbb acbgbcabgc cabccgtaag abgcbtbbct gtgacbggbg 7440 

agbactcaac caagbcabbc tgagaabagb gbabgcggcg accgagbtgc bcbbgcccgg 7500 

cgbcaacacg ggabaabacc gcgccacaba gcagaacbtb aaaagbgcbc atcatbggaa 7560 

aacgbtcbbc ggggcgaaaa cbcbcaagga bcbbaccgct gtbgagabcc agbtcgabgb 7620 

aacccacbcg bgcacccaac tgabcbbcag catcbbbbac ttbcaccagc gbtbctgggb 7680 

gagcaaaaac aggaaggcaa aabgccgcaa aaaagggaab aagggcgaca cggaaatgbb 7740 

gaatacbcab acbctbccbt bbbcaababb atbgaagcab bbabcagggb babtgtcbca 7800 

tgagcggata cabattbgaa bgbabbbaga aaaataaaca aataggggtb ccgcgcacat 7860 

ttccccgaaa agbgccacct gacgtcbaag aaaccatbat babcabgaca tbaaccbaba 7920 

aaaabaggcg babcacgagg cccbtbcgtc bbcaagaabt gbb 7963 



<210> 38 

<211> 7966 
<212> DNA 

. <213> Arbificial Sequence 
<220> 

<223> plasmid pAdApb conbaining HIV genes 
<400> 38 

bbaabbaacc gcaabbcbca tgbbbgacag ctbabcabca tcaabaabab accbtafcbtt 60 
ggatbgaagc caababgaba abgagggggt ggagbbtgbg acgbggcgcg gggcgbggga 12 0 
acggggcggg bgacgbagba gbgbggcgga agbgbgabgb tgcaagbgbg gcggaacaca 180 
bgbaagcgac ggatgbggca aaagtgacgb bbbbggbgbg cgccggbgba cacaggaagb 240 
gacaacbbbc gcgcggtbbb aggcggabgb bgbagbaaat ttgggcgbaa ccgagbaaga 300 
bbbggccabb btcgcgggaa aacbgaabaa gaggaagbga aabcbgaaba abtbbgbgbb 360 



- 87 - 



WO 02/32943 PCT/US01/25721 

actcatagcg cgtaatattt gbcbagggcc gcggggactt bgaccgbtta cgtggagact 420 
cgcccaggtg tttttctcag gtgbbttccg cgttccgggt caaagttggc gttbtatbab 480 
tatagtcagt acgtaccagt gcactggcct agagcggccc cattgcatac gttgtatcca 540 
tatcataata tgtacattta. tattggcbca bgtccaacat taccgccatg bbgacatbga 600 
ttattgacta gttattaata gtaatcaatt acggggtcat bagttcabag cccabatabg 660 
gagbtccgcg ttacafcaact tacggtaaab ggcccgcctg gctgaccgcc caacgacccc 720 
cgcccabbga cgtcaataat gacgtabgbb cccatagtaa cgccaatagg gactttccat 780 
tgacgtcaat gggtggagta bbtacggtaa actgcccact tggcagtaca tcaagtgtat 840 
catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattat 900 
gcccagtaca bgacctbabg ggacbbbccb acttggcagt acabctacgb attagtcatc 960 
gctatbacca tggtgabgcg gttbbggcag bacatcaabg ggcgtggaba gcggbbbgac 1020 
tcacggggat ttccaagtct ccaccccabb gacgtcaabg ggagbtbgbb btggcaccaa 1080 
aabcaacggg acbttccaaa abgbcgbaac aacbccgccc catbgacgca aatgggcggt 1140 
aggcgtgtac ggtgggaggb ctababaagc agagctcgbt tagbgaaccg bcagatcgcc 1200 
bggagacgcc atccacgcbg bttbgaccbc catagaagac accgggaccg atccagccbc 1260 
cgtcaccgbc gtcgacacgt gbgatcagat abcgcggccg cbcbagaabg cgcgtgaagg 1320 
agaagtacca gcacctgtgg cgcbggggcb ggcgcbgggg caccabgcbg ctgggcatgc 13 80 
tgatgatctg cagcgccacc gagaagctgb gggbgaccgt gtacbacggc gtgcccgtgb 1440 
ggaaggaggc caccaccacc ctgctcbgcg ccagcgacgc caaggcctac gacaccgagg 1500 
tgcacaacgt gbgggccacc cacgccbgcg tgcccaccga ccccaacccc caggaggbgg 1560 
tgcbggbgaa cgbgaccgag aacttcgaca tgtggaagaa cgacatggtg gagcagabgc 1620 
acgaggacab cabcagccbg tgggaccaga gccbgaagcc ctgcgbgaag cbgacccccc 1680 
bgbgcgbgag ccbgaagtgc accgaccbga agaacgacac caacaccaac agcagcagcg 1740 
gccgcabgab cabggagaag ggcgagabca agaactgcag ctbcaacabc agcaccagca 1800 
bccgcggcaa ggbgcagaag gagbacgccb bcbbctacaa gcbggacatc abccccabcg 1860 
acaacgacac caccagcbac agcctgacca gctgcaacac cagcgbgatc acccaggcct 1920 
gccccaaggt gagcbbcgag cccatcccca accactacbg cgcccccgcc ggctbcgcca 1980 
bccbgaagbg caaggacaag aagbtcaacg gcaagggccc cbgcaccaac gbgagcaccg 2040 
bgcagtgcac ccacggcabc cgccccgtgg bgagcaccca gctgcbgcbg aacggbagcc 2100 
tggccgagga ggaggtggtg atccgcagcg ctaacttcgc cgacaacgcc aaggbgabca 2160 
bcgbgcagcb gaacgagagc gtggagabca actgcacccg ccccaacaac aacacccgca 2220 
agagcabcca catcggcccc ggccgcgcct bctacaccac cggcgagatc atcggcgaca 2280 
bccgccaggc ccacbgcaac ctgagccgcg ccaagbggaa cgacaccctg aacaagabcg 2340 
bgabcaagcb gcgcgagcag ttcggcaaca agaccatcgt gttcaagcac agcagcggcg 2400 
gcgaccccga gatcgtgacc cacagcbbca acbgcggcgg cgagbtcbbc bactgcaaca 2460 
gcgctagcct gccctgccgc atcaagcaga tcabcaacab gbggcagaag gbgggcaagg 2520 
ccatgtacgc cccccccabc agcggccaga tccgctgcag cagcaacatc accggccbgc 2580 
tgcbgacccg cgacggcggc aacagcaaca acgagagcga gatcbtccgb ccgggcggcg 2 640 
gcgacabgcg cgacaacbgg cgcagcgagc tgtacaagta caaggbggtg aagatcgagc 2700 
ccctgggcgt ggcccccacc aaggccaagc bbaccgtcca ggcccgccag ctgctgagcg 2760 
gcatcgtgca gcagcagaac aacctgcbgc gcgccabcga ggcccagcag cacctgcbgc 2 820 
agctgaccgb gbggggcatc aagcagcbgc aggcccgcac ccbggccgtg gagcgcbacc 2880 
tgaaggacca gcagcagcbc gagcagatct ggaaccacac caccbggatg gagbgggacc 2940 
gcgagatcaa caactacacc agccbgatcc acagccbgat cgaggagagc cagaaccagc 3000 
aggagaagaa cgagcaggag ctgcbggagc bggacaagbg ggccagccbg bggaacbggt 3060 
tcaacatcac caactggcbg tggtgaggat ccagatctgc tgtgcctbct agbbgccagc 3120 
catctgbtgt tbgccccbcc cccgtgccbt ccttgaccct ggaaggbgcc acbcccactg 3180 
tcctttccta abaaaatgag gaaatbgcat cgcabbgbct gagbaggbgt catbcbabbc 3240 
bggggggtgg 99tggggcag cacagcaagg gggaggabtg ggaagacaab agcaggcatg 3300 
ctggggabgc ggbgggctct atgggbaccc agggccgcab aacttcgtab aatgbabgcb 33 60 
abacgaagtt abaagatcbg bactgaaatg tgtgggcgtg gcbtaagggt gggaaagaab 3420 
ababaaggtg ggggtcttat gtagbbtbgt abctgttbtg cagcagccgc cgccgccabg 3480 
agcaccaact cgbttgabgg aagcabtgbg agcbcabatt tgacaacgcg cabgccccca 3540 
bgggccgggg tgcgtcagaa bgbgatgggc bccagcabbg abggtcgccc cgtccbgccc 3600 
gcaaacbcba ctaccbbgac ctacgagacc gbgbcbggaa cgccgbtgga gacbgcagcc 3660 
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tccgccgccg cttcagccgc tgcagccacc 
ctgagcccgc ttgcaagcag tgcagcttcc 
gctcttttgg cacaattgga ttctttgacc 
btggabctgc gccagcaggt ttcbgccctg 
aacataaata aaaaaccaga ctctgbttgg 
tatttagggg bbbtgcgcgc gcggtaggcc 
ctgtgtattt bbbccaggac gtggtaaagg 
agcccgtctc tggggtggag gtagcaccac 
tagatgatcc agtcgtagca ggagcgctgg 
aagcbgabbg ccaggggcag gcccbbggtg 
gggtgcatac gtggggatat gagatgcatc 
ccagccatat ccctccgggg attcabgtbg 
cacttgggaa atttgtcatg tagcttagaa 
ttgtgacctc caagatttfcc catgcattcg 
gcggcctggg cgaagatatt tctgggatca 
tcgtcatagg ccatttttac aaagcgcggg 
ccatccggcc caggggcgta gttaccctca 
gatgggggga bcabgtcbac ctgcggggcg 
atcagctggg aagaaagcag gttcctgagc 
taaatcacac ctattaccgg ctgcaactgg 
ctgagcaggg gggccacttc gbbaagcatg 
tccgccagaa ggcgctcgcc gcccagcgafc 
aacggtttga gaccgtccgc cgtaggcatg 
cggtcccaca gctcggtcac ctgctctacg 
gcgggttggg gcggctbbcg ctgtacggca 
bcatgtcbtt ccacgggcgc agggtcctcg 
gcgctccggg ctgcgcgctg gccagggtgc 
gctgccggtc btcgcccbgc gcgtcggcca 
gcccctccgc ggcgtggccc tbggcgcgca 
ggcagtgcag acttttgagg gcgtagagct 
aggcatccgc gccgcaggcc ccgcagacgg 
gccgttcggg gtcaaaaacc aggtttcccc 
bbbccatgag ccggtgtcca cgctcggtga 
acfctgagagg cctgtcctcg agcggtgttc 
actctgagac aaaggctcgc gtccaggcca 
ggbcgttgbc cactaggggg tccactcgct 
cggcatcaag gaaggtgatt ggbbbgtagg 
gggggctata aaagggggtg ggggcgcgtt 
cgagggccag ctgttggggt gagtcgacgc 
ttctcgcttc cggcggcatc gggatgcccg 
atgacgacca tcagggacag cttcaaggcc 
cgttgctggc gbbttbccab aggctccgcc 
caagbcagag gbggcgaaac ccgacaggac 
gcbcccbcgb gcgcbctcct gfcbccgaccc 
tccctbcggg aagcgbggcg cbttctcafca 
aggtcgbbcg cbccaagctg ggctgtgtgc 
ccbbatccgg taactabcgb cttgagtcca 
cagcagccac tggbaacagg abbagcagag 
tgaagtggtg gccbaactac ggctacacta 
tgaagccagb tacctbcgga aaaagagttg 
ctggtagcgg tggbbbtttt gtbtgcaagc 
aagaagatcc tbtgatcttb bctacggggb 
aagggabttt ggbcabgaga ttabcaaaaa 
aabgaagttb taaatcaatc baaagbabat 
gcbtaabcag tgaggcacct atctcagcga 



gcccgcggga bbgbgacbga cbttgcttbc 3720 
cgbbcabccg cccgcgafcga caagbtgacg 3780 
cgggaacbba abgtcgbbbc tcagcagctg 3840 
aaggcttcct ccccbcccaa bgcggtbbaa 3900 
atbbggatca agcaagtgtc ttgcbgbctb 3960 
cgggaccagc ggbcbcggbc gtbgagggtc 4 020 
tgactctgga bgbbcagaba cabgggcata 4080 
tgcagagcbt catgcbgcgg ggtggtgbtg 4140 
gcgtggtgcc baaaaatgtc bttcagbagc 4200 
baagtgttta caaagcggbb aagctgggab 4260 
tbggactgba ttbttaggbb ggctatgbtc 4320 
tgcagaacca ccagcacagb gbatccggbg 43 80 
ggaaatgcgb ggaagaactt ggagacgccc 4440 
tccataatga tggcaatggg cccacgggcg 4500 
cbaacgtcat agttgtgtbc caggatgaga 4560 
cggagggtgc cagacbgcgg tataatggtt 4620 
cagatttgca tttcccacgc ttbgagtbca 4680 
atgaagaaaa cggbttccgg ggtaggggag 474 0 
agctgcgact baccgcagcc ggtgggcccg 4800 
tagtbaagag agctgcagch gccgtcatcc 4 860 
tcccfcgactc gcatgttttc ccbgaccaaa 4920 
agcagbtcbt gcaaggaagc aaagttttbc 4980 
cttttgagcg ttbgaccaag cagttccagg 5040 
gcatcbcgat ccagcatatc tcctcgtbbc 5100 
gbagbcggtg ctcgbccaga cgggccaggg 5160 
tcagcgtagb ctgggtcacg gtgaaggggt 5220 
gcttgaggct ggtccbgctg gtgctgaagc 52 80 
ggtagcattt gaccatggtg tcatagbcca 5340 
gcttgcccbt ggaggaggcg ccgcacgagg 5400 
tgggcgcgag aaataccgab tccggggagb 5460 
tcbcgcabtc cacgagccag gbgagctctg 5520 
cabgctbbtt gatgcgtbbc tbacctcbgg 5580 
cgaaaaggct gbccgtgbcc ccgtatacag 5640 
cgcggbcctc cbcgbabaga aactcggacc 5700 
gcacgaagga ggctaagbgg gaggggbagc 5760 
ccagggbgtg aagacacatg tcgccctcbt 5820 
tgtaggccac gtgaccgggt gttccbgaag 5880 
cgbcctcacb cbctbccgca tcgctgbcbg 5940 
gaggctggab ggccttcccc attatgattc 6000 
cgtbgcaggc cabgcbgbcc aggcaggbag 6060 
agcaaaaggc caggaaccgb aaaaaggccg 6120 
ccccbgacga gcabcacaaa aatcga.cgct 6180 
tataaagaba ccaggcgbbt cccccbggaa 6240 
tgccgcbbac cggataccbg tccgccbbbc 6300 
gcbcacgctg taggtatctc agttcggbgt 6360 
acgaaccccc cgbbcagccc gaccgctgcg 6420 
acccggtaag acacgacbta tcgccacbgg 64 80 
cgaggbabgb aggcggbgcb acagagtbct 6540 
gaaggacagb abttggbatc tgcgctcbgc 6600 
gtagctcbbg abccggcaaa caaaccaccg 6660 
agcagattac gcgcagaaaa aaaggabctc 6720 
cbgacgcbca gbggaacgaa aactcacgbb 6780 
ggabcbtcac ctagabccbt tbaaabbaaa 6840 
abgagbaaac btggbcbgac agfcbaccaat 6900 
bcbgbctatt tcgbtcatcc atagbfcgccb 6960 
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gactccccgt cgbgtagaba actacgatac gggagggctt accatctggc cccagtgctg 7020 
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 7080 
ccggaagggc cgagcgcaga agtggtcctg caacbtbabc cgcctccatc cagbcbabba 7140 
attgttgccg ggaagctaga gtaagtagtt cgccagttaa bagbbbgcgc aacgttgttg 7200 
ccattgctgc aggcatcgtg gbgbcacgcb cgtcgtttgg tatggcttca ttcagctccg 7260 
gbbcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggbbagcb 7320 
ccttcggtcc bccgabcgbb gbcagaagta agttggccgc agbgbbabca ctcatggtta 73 80 
bggcagcacb gcataattct cttactgtca bgccabccgb aagatgcttt tctgtgactg 7440 
gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt bgcbcbtgcc 7500 
cggcgtcaac acgggabaat accgcgccac atagcagaac tttaaaagtg cbcabcabbg 7560 
gaaaacgttc ttcggggcga aaacbctcaa ggabctbacc gctgttgaga tccagttcga 7620 
tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7680 
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 7740 
gttgaatact catactcttc ctbbbbcaab abbabbgaag cabtbabcag ggbbabbgbc 7800 
bcabgagcgg abacababbb gaabgbabtt agaaaaabaa acaaataggg gtbccgcgca 7860 
cabbbccccg aaaagbgcca ccbgacgbct aagaaaccab tatbabcabg acabbaaccb 7920 
abaaaaabag gcgtabcacg aggcccbbbc gbctbcaaga abtgtb 7966 



<210> 39 
<211> 7870 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pAdApb conbaining HIV genes 
<400> 39 

tbaabbaacc gcaatbctca bgtbtgacag cbbabcabca bcaataabab accbtabbbb 60 
ggatbgaagc caababgaba atgagggggb ggagbbbgtg acgbggcgcg gggcgbggga 120 
acggggcggg bgacgbagba gbgbggcgga agtgbgabgb bgcaagbgbg gcggaacaca 180 
bgbaagcgac ggabgbggca aaagbgacgb btbbggbgtg cgccggbgba cacaggaagb 240 
gacaabbbbc gcgcggbbbb aggcggabgb bgbagbaaab bbgggcgbaa ccgagbaaga 300 
bbtggccabb btcgcgggaa aacbgaabaa gaggaagbga aabcbgaaba abbbbgbgbb 360 
acbcabagcg cgbaabatbb gbcbagggcc gcggggacbb bgaccgbtba cgbggagacb 420 
cgcccaggbg tttbbcbcag gbgbbbbccg cgbbccgggb caaagbbggc gbbbbabbab 480 
babagbcagb acgbaccagb gcacbggccb agagcggccc cabbgcatac gbbgbatcca 540 
babcabaaba bgbacabbba tabbggcbca tgbccaacat baccgccabg bbgacabbga 600 
tbatbgacta gtbabbaaba gbaabcaabb acggggbcab tagbbcabag cccabababg 660 
gagbbccgcg bbacabaacb bacggbaaab ggcccgccbg gcbgaccgcc caacgacccc 720 
cgcccabtga cgtcaataab gacgbabghb cccatagtaa cgccaabagg gacbbbccab 780 
bgacgbcaab gggbggagba bbbacggbaa acbgcccacb bggcagbaca bcaagtgbab 84 0 
cababgccaa gbacgccccc babbgacgbc aabgacggba aabggcccgc cbggcatbab 900 
gcccagtaca bgaccbbabg ggacbbbccb acbbggcagb acabcbacgb abbagbcabc 960 
gcbabbacca bggbgabgcg gbbbbggcag bacabcaabg ggcgbggaba gcggbbtgac 1020 
tcacggggat bbccaagbct ccaccccabb gacgbcaabg ggagbbbgbb bbggcaccaa 1080 
aabcaacggg acbbbccaaa abgtcgtaac aacbccgccc cabbgacgca aabgggcggt 1140 
aggcgbgbac ggbgggaggt cbababaagc agagcbcgbb bagbgaaccg bcagabcgcc 12 00 
bggagacgcc abccacgcbg bbbbgaccbc cabagaagac accgggaccg atccagccbc 1260 
cgbcaccgbc gbcgacacgb gtgabcagab abcgcggccg cbctagaabg cgcgtgaagg 1320 
agaagbacca gcaccbgbgg cgcbggggcb ggcgcbgggg caccabgcbg cbgggcabgc 1380 
bgabgabcbg cagcgccacc gagaagcbgb gggbgaccgb gbacbacggc gbgcccgbgb 1440 
ggaaggaggc caccaccacc ctgcbcbgcg ccagcgacgc caaggccbac gacaccgagg 1500 
bgcacaacgb gbgggccacc cacgccfcgcg bgcccaccga ccccaacccc caggaggbgg 1560 
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tgctggtgaa cgtgaccgag aacttcgaca 
acgaggacat catcagcctg tgggaccaga 
tgtgcgtgag cctgaagtgc accgacgcta 
aggcctgccc caaggtgagc ttcgagccca 
tcgccatcct gaagtgcaag gacaagaagb 
gcaccgbgca gtgcacccac ggcatccgcc 
gtagcctggc cgaggaggag gbggbgabcc 
tgatcatcgb gcagctgaac gagagcgtgg 
cccgcaagag catccacatc ggccccggcc 
gcgacatccg ccaggcccac tgcaaccbga 
agatcgtgat caagctgcgc gagcagttcg 
gcggcggcga ccccgagatc gbgacccaca 
gcaacagcac ccagcbgtbc aacagcacct 
gcaacaacac cgagggcagc gacaccatca 
acatgtggca gaaggtgggc aaggccabgb 
gcagcagcaa catcaccggc ctgctgctga 
gcgagabcbb ccgtccgggc ggcggcgaca 
agtacaaggt ggtgaagatc gagcccctgg 
tccaggcccg ccagctgctg agcggcatcg 
tcgaggccca gcagcacctg ctgcagctga 
gcaccctggc cgtggagcgc tacctgaagg 
acaccaccbg gabggagtgg gaccgcgaga 
bgabcgagga gagccagaac cagcaggaga 
agtgggccag cctgtggaac bggbtcaaca 
cbgcbgtgcc ttcbagbbgc cagccabcbg 
cccbggaagg tgccacbccc acbgbcctbb 
gbcbgagbag gbgtcabbcb abtcbggggg 
abbgggaaga caabagcagg cabgcbgggg 
gcabaacbbc gbabaabgba bgcbabacga 
cgbggcbbaa gggbgggaaa gaabatabaa 
btbgcagcag ccgccgccgc cabgagcacc 
babbbgacaa cgcgcabgcc cccabgggcc 
atbgabggbc gccccgbccb gcccgcaaac 
ggaacgccgb bggagacbgc agccbccgcc 
gggabbgbga cbgacbbtgc bbbccbgagc 
bccgcccgcg abgacaagbb gacggcbcbb 
ctbaabgbcg bbbcbcagca gctgbbggab 
bccbccccbc ccaabgcggb bbaaaacaba 
abcaagcaag bgbcbbgcbg bcbtbabbba 
cagcggbcbc ggbcgbbgag ggbccbgbgb 
tggabgbbca gabacabggg cabaagcccg 
gcbtcatgcb gcggggbggb gbbgbagatg 
bgccbaaaaa bgbcbbbcag bagcaagcbg 
btbacaaagc ggbtaagcbg ggatgggbgc 
bgbabbbbba ggbbggcbab gbbcccagcc 
accaccagca cagtgbabcc ggbgcacbbg 
gcgbggaaga acbbggagac gcccbbgtga 
abgabggcaa bgggcccacg ggcggcggcc 
bcabagbbgt gbbccaggab gagabcgbca 
gbgccagacb gcggbabaab ggbbccabcc 
bgcabbbccc acgcbbbgag bbcagabggg 
aaaacggbbb ccggggtagg ggagabcagc 
gacbtaccgc agccggbggg cccgbaaabc 
agagagctgc agctgccgbc abcccbgagc 
acbcgcabgb tbbccctgac caaatccgcc 



bgbggaagaa cgacabggbg gagcagabgc 1620. 
gccbgaagcc cbgcgtgaag cbgacccccc 1680 
gcaccagcbg caacaccagc gbgatcaccc 1740 
bccccaacca ctacbgcgcc cccgccggcb 1800 
bcaacggcaa gggccccbgc accaacgbga 1860 
ccgbggtgag cacccagcbg cbgcbgaacg 1920 
gcagcgcbaa cbbcgccgac aacgccaagg 1980 
agabcaacbg cacccgcccc aacaacaaca 2040 
gcgccbbcba caccaccggc gagabcatcg 2100 
gccgcgccaa gbggaacgac acccbgaaca 2160 
gcaacaagac cabcgbgbbc aagcacagca 2220 
gcbbcaacbg cggcggcgag btcbbcbacb 2280 
ggbbcaacag caccbggagc accgagggca 2340 
cccbgcccbg ccgcabcaag cagatcabca 2400 
acgccccccc cabcagcggc cagabccgcb 2460 
cccgcgacgg cggcaacagc aacaacgaga 2520 
bgcgcgacaa cbggcgcagc gagcbgbaca 2580 
gcgbggcccc caccaaggcc aagcbtaccg 264 0 
tgcagcagca gaacaacctg cbgcgcgcca 2700 
ccgbgbgggg cabcaagcag cbgcaggccc 2760 
accagcagca gctcgagcag abcbggaacc 2820 
bcaacaacba caccagccbg abccacagcc 2880 
agaacgagca ggagcbgcbg gagcbggaca 2940 
bcaccaacbg gcbgbggbga ggabccagab 3000 
bbgbbbgccc ctcccccgtg ccbbcctbga 3060 
ccbaabaaaa bgaggaaabb gcabcgcabt 3120 
gtggggbggg gcagcacagc aagggggagg 3180 
abgcggbggg cbcbabgggb acccagggcc 3240 
agbbabaaga bcbgbacbga aabgbgbggg 3300 
ggtgggggbc bbabgbagtb bbgbabcbgb 3360 
aacbcgbbbg abggaagcab bgbgagcbca 3420 
9999tgcgbc agaabgbgab gggcbccagc 3480 
bctacbaccb bgaccbacga gaccgbgtcb 3540 
gccgcbbcag ccgcbgcagc caccgcccgc 3600 
ccgcbtgcaa gcagbgcagc bbcccgtbca 3660 
bbggcacaab tggabbcbbb gacccgggaa 3720 
cbgcgccagc aggbbbcbgc cctgaaggcb 3780 
aataaaaaac cagacbctgb bbggabbbgg 3840 
ggggbbbbgc gcgcgcggba ggcccgggac 3900 
abbbbbbcca ggacgbggba aaggbgacbc 3960 
bctcbggggb ggaggbagca ccactgcaga 4020 
abccagbcgb agcaggagcg cbgggcgbgg 4080 
abtgccaggg gcaggcccbb ggbgtaagbg 4140 
abacgbgggg ababgagabg catctbggac 4200 
ababcccbcc ggggabtcab gbbgtgcaga 4260 
ggaaatbbgb cabgtagcbb agaaggaaab 4320 
cctccaagab bbbccatgca bbcgtccata 4380 
tgggcgaaga babbtctggg abcactaacg 4440 
baggccabbb bbacaaagcg cgggcggagg 4500 
ggcccagggg cgbagbbacc cbcacagabb 4560 
gggabcabgb cbaccbgcgg ggcgabgaag 4620 
bgggaagaaa gcaggbbccb gagcagcbgc 4680 
acacctabba ccggcbgcaa cbggbagbba 4740 
a 999999 c ca cbbcgbbaag catgbcccbg 4800 
agaaggcgcb cgccgcccag cgatagcagb 4 860 
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tcttgcaagg aagcaaagbb tttcaacggt bbgagaccgb ccgccgtagg cabgcbbbbg 4920 
agcgbbbgac caagcagttc caggcggbcc cacagctcgg bcacctgcbc tacggcabcb 4980 
cgatccagca tatctcctcg bbtcgcgggb bggggcggct ttcgctgtac ggcagtagtc 5040 
ggtgctcgtc cagacgggcc agggbcabgb ctttccacgg gcgcagggtc ctcgtcagcg 5100 
bagbcbgggb cacggtgaag gggtgcgctc cgggctgcgc gctggccagg gtgcgcttga 5160 
ggctggtcct gctggtgcbg aagcgctgcc ggtcttcgcc ctgcgcgtcg gccaggbagc 5220 
atttgaccat ggtgbcabag bccagccccb ccgcggcgtg gcccttggcg cgcagcttgc 5280 
ccttggagga ggcgccgcac gaggggcagb gcagactttt gagggcgtag agcbbgggcg 5340 
cgagaaatac cgattccggg gagtaggcat ccgcgccgca ggccccgcag acggtctcgc 5400 
abbccacgag ccaggtgagc bcbggccgtb cggggbcaaa aaccaggttt cccccatgcb 5460 
ttttgatgcg tttcttacct ctggtttcca tgagccggtg tccacgctcg gfcgacgaaaa 5520 
ggctgtccgt gtccccgtat acagacttga gaggcctgtc ctcgagcggt gttccgcggt 5580 
ccbccbcgba bagaaacbcg gaccactctg agacaaaggc tcgcgtccag gccagcacga 5640 
aggaggctaa gtgggagggg tagcggtcgt tgtccactag ggggtccact cgctccaggg 5700 
tgtgaagaca catgtcgccc bcbbcggcab caaggaaggt gatbggbbbg baggbgbagg 5760 
ccacgtgacc gggtgttcct gaaggggggc babaaaaggg ggtgggggcg cgttcgtcct 5820 
cactctctbc cgcatcgctg hctgcgaggg ccagctgttg gggtgagtcg acgcgaggct 5880 
ggabggcctt ccccattatg abbcbbcbcg cbbccggcgg catcgggatg cccgcgttgc 5940 
aggccatgct gtccaggcag gbagabgacg accatcaggg acagcttcaa ggccagcaaa 6000 
aggccaggaa ccgtaaaaag gccgcgbbgc bggcgbbbbt ccataggctc cgcccccctg 6060 
acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 6120 
gabaccaggc gtttccccct ggaagcbccc tcgtgcgctc tcctgttccg acccbgccgc 6180 
ttaccggata cctgtccgcc bbbcbcccbb cgggaagcgt ggcgcbbtcb catagctcac 6240 
gctgtaggta tetcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 6300 
cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 6360 
taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 6420 
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga 6480 
cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 6540 
cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 6600 
ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 6660 
ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 6720 
tcacctagafc ccttttaaat taaaaatgaa gttttaaatc aatctaaagb atabatgagt 6780 
aaacttggtc tgacagttac caatgcttaa tcagbgaggc acctatctca gcgatctgtc 6840 
tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg 6900 
• gcttaccatc tggccccagt gctgcaatga baccgcgaga cccacgctca ccggctccag 6960 
atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt 7020 
tatccgcctc catccagtct atfcaattgtt gccgggaagc tagagtaagt agttcgccag 7080 
ttaatagttt gcgcaacght gttgccattg ctgcaggcat cgtggtgtca cgctcgtcgt 7140 
ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca 7200 
tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg 7260 
ccgcagtgtt atcactcatg gttatggcag cactgcataa fctctcttact gtcatgccat 7320 
ccgtaagatg cttttctgtg actggtgagb acbcaaccaa gbcabtcbga gaabagbgba 7380 
tgcggcgacc gagbtgcbcb bgcccggcgb caacacggga taataccgcg ccacabagca 744 0 
gaacbtbaaa agbgcbcabc abbggaaaac gbbcbtcggg gcgaaaactc bcaaggabcb 7500 
baccgctgbb gagatccagb bcgabgbaac ccacbcgtgc acccaacbga tcbbcagcab 7560 
cbbbbacbbb caccagcgtb bcbgggbgag caaaaacagg aaggcaaaab gccgcaaaaa 7620 
agggaabaag ggcgacacgg aaabgtbgaa bacbcabacb cbbcctbtbb caababbatt 7680 
gaagcabtba bcagggbbat bgbcbcabga gcggabacab abbbgaabgb atbbagaaaa 7740 
abaaacaaab aggggbbccg cgcacabtbc cccgaaaagb gccaccbgac gtcbaagaaa 7 800 
ccabbabtab cabgacabba acctataaaa abaggcgbab cacgaggccc btbcgbcbbc 7860 
aagaabtgbb 7870 
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<211> 7906 
<212> DNA 

* 

<213> Artificial Sequence 
<220> 

<223> plasmid pAdApt containing HIV genes 
<400> 40 

ttaattaacc gcaattctca tgtttgacag cttatcatca tcaataatat accttatttt 60 
ggattgaagc caatatgata atgagggggt ggagtttgtg acgtggcgcg gggcgtggga 120 
a cggggcggg tgacgtagta gtgtggcgga agtgtgatgt tgcaagtgtg gcggaacaca 18 0 
tgtaagcgac ggatgtggca aaagtgacgt ttttggtgtg cgccggtgta cacaggaagt 24 0 
gacaattttc gcgcggtttt aggcggatgt tgtagtaaat ttgggcgtaa ccgagtaaga 300 
tttggccatt ttcgcgggaa aactgaataa gaggaagtga aatctgaata attttgtgtt 360 
actcatagcg cgtaatattt gtctagggcc gcggggactt tgaccgttta cgtggagact 42 0 
cgcccaggtg tttttctcag gtgttttccg cgttccgggt caaagttggc gttttattat 480 
tatagtcagt acgtaccagt gcactggcct agagcggccc cattgcatac gttgtatcca 54 0 
tatcataata tgtacattta tattggctca tgtccaacat taccgccatg ttgacattga 600 
ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag cccatatatg 660 
gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc caacgacccc 720 
cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat 780 
tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat 84 0 
catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattat 900 
gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt attagtcatc 960 
gctattacca tggtgatgcg gttttggcag tacatcaatg ggcgtggata gcggtttgac 1020 
tcacggggat ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa 1080 
aatcaacggg actttccaaa atgtcgtaac aactccgccc cattgacgca aatgggcggt 114 0 
aggcgtgtac ggtgggaggt ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 1200 
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 1260 
cgtcaccgtc gtcgacacgt gtgatcagat atcgcggccg ctctagaatg cgcgtgaagg 1320 
agaagtacca gcacctgtgg cgctggggct ggcgctgggg caccatgctg ctgggcatgc 1380 
tgatgatctg cagcgccacc gagaagctgt gggtgaccgt gtactacggc gtgcccgtgt 1440 
ggaaggaggc caccaccacc ctgctctgcg ccagcgacgc caaggcctac gacaccgagg 1500 
tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc caggaggtgg 1560 
tgctggtgaa cgtgaccgag aacttcgaca tgtggaagaa cgacatggtg gagcagatgc 1620 
acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag ctgacccccc 1680 
tgtgcgtgag cctgaagtgc accgacgcta gcaagaactg cagcttcaac atcagcacca 1740 
gcatccgcgg caaggtgcag aaggagtacg ccttcttcta caagctggac atcatcccca 1800 
tcgacaacga caccaccagc tacagcctga ccagctgcaa caccagcgtg atcacccagg 1860 
cctgccccaa ggtgagcttc gagcccatcc ccaaccacta ctgcgccccc gccggcttcg 1920 
ccatcctgaa gtgcaaggac aagaagttca acggcaaggg cccctgcacc aacgtgagca 1980 
ccgtgcagtg cacccacggc atccgccccg tggtgagcac ccagctgctg ctgaacggta 204 0 
gcctggccga ggaggaggtg gtgatccgca gcgctaactt cgccgacaac gccaaggtga 2100 
tcatcgtgca gctgaacgag agcgtggaga tcaactgcac ccgcgctagc gcccactgca 2160 
acctgagccg cgccaagtgg aacgacaccc tgaacaagat cgtgatcaag ctgcgcgagc 2220 
agttcggcaa caagaccatc gtgttcaagc acagcagcgg cggcgacccc gagatcgtga 2280 
cccacagctt caactgcggc ggcgagttct tctactgcaa cagcacccag ctgttcaaca 2340 
gcacctggtt caacagcacc tggagcaccg agggcagcaa caacaccgag ggcagcgaca 2400 
ccatcaccct gccctgccgc atcaagcaga tcatcaacat gtggcagaag gtgggcaagg 2460 
ccatgtacgc cccccccatc agcggccaga tccgctgcag cagcaacatc accggcctgc 2520 
tgctgacccg cgacggcggc aacagcaaca acgagagcga gatcttccgt ccgggcggcg 2580 
gcgacatgcg cgacaactgg cgcagcgagc tgtacaagta caaggtggtg aagatcgagc 264 0 
ccctgggcgt ggcccccacc aaggccaagc ttaccgtcca ggcccgccag ctgctgagcg 2700 
gcatcgtgca gcagcagaac aacctgctgc gcgccatcga ggcccagcag cacctgctgc 2760 
agctgaccgt gtggggcatc aagcagctgc aggcccgcac cctggccgtg gagcgctacc 2820 
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tgaaggacca gcagcagctc gagcagatct ggaaccacac cacctggatg gagtgggacc 2880 
gcgagatcaa caactacacc agcctgatcc acagcctgat cgaggagagc cagaaccagc 2 940 
aggagaagaa cgagcaggag ctgctggagc tggacaaghg ggccagcctg tggaactggt 3000 
tcaacatcac caacbggcbg tggtgaggat ccagatcbgc tgtgccttct agbbg'ccagc 3060 
catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc acbcccacbg 3120 
tcctttccta ataaaatgag gaaatbgcab cgcattgtct gagtaggtgt cabbcbabbc 3180 
tggggggtgg ggtggggcag cacagcaagg gggaggattg ggaagacaat agcaggcatg 3240 
cbggggabgc ggtgggctct atgggtaccc agggccgcat aacttcgtat aatgtatgct 3300 
atacgaagtt abaagabcbg tactgaaatg tgtgggcgtg gcttaagggt gggaaagaat 3360 
abataaggtg ggggtcttat gtagttttgt atctgttttg cagcagccgc cgccgccatg 3420 
agcaccaact cgtttgatgg aagcatbgtg agcbcababb bgacaacgcg cafcgccccca 34 80 
tgggccgggg tgcgbcagaa tgtgatgggc tccagcattg atggtcgccc cgtcctgccc 3540 
gcaaactcta cbacctbgac ctacgagacc gtgtctggaa cgccgtbgga gacbgcagcc 3600 
tccgccgccg cttcagccgc tgcagccacc gcccgcggga bbgtgacbga cbbbgcbbbc 3660 
ctgagcccgc bbgcaagcag tgcagcttcc cgbtcabccg cccgcgatga caagttgacg 3720 
gcbcbbbbgg cacaattgga bbctbtgacc cgggaactta abgbcgbbbc tcagcagctg 3780 
ttggatctgc gccagcaggb ttctgccctg aaggcbtccb cccctcccaa tgcggtttaa 3840 
aacataaata aaaaaccaga cbcbgbbbgg atttggatca agcaagbgbc ttgctgtctt 3900 
tatttagggg ttttgcgcgc gcggtaggcc cgggaccagc ggtctcggtc gbtgagggbc 3 960 
cbgtgtabbb btbccaggac gbggbaaagg bgacbcbgga tgbbcagaba cabgggcaba 4020 
agcccgtcbc bggggtggag gtagcaccac bgcagagcbb cabgcbgcgg ggtggbgtbg 4080 
tagabgabcc agbcgbagca ggagcgcbgg gcgtggbgcc baaaaatgbc tbtcagbagc 4140 
aagcbgattg ccaggggcag gcccbtggbg baagbgbbta caaagcggbb aagcbgggab 42 00 
gggbgcatac gbggggatab gagabgcatc bbggacbgta btbbbaggtb ggcbabgbbc 4260 
ccagccabab cccbccgggg abbcatgbbg tgcagaacca ccagcacagb gbabccggbg 4320 
cacbbgggaa atbbgbcatg bagcbbagaa ggaaatgcgb ggaagaacbb ggagacgccc 43 80 
bbgbgaccbc caagabbbbc catgcattcg tccabaabga bggcaatggg cccacgggcg 4440 
gcggccbggg cgaagabatb bcbgggatca cbaacgtcab agbbgbgbbc caggabgaga 4500 
bcgbcabagg ccabbbbbac aaagcgcggg cggagggbgc cagacbgcgg babaabggbt 4560 
ccatccggcc caggggcgba gbbacccbca cagabbbgca bbbcccacgc ttbgagtbca 4620 
gabgggggga bcatgbcbac cbgcggggcg abgaagaaaa cggtbbccgg ggbaggggag 4680 
abcagcbggg aagaaagcag gbbccbgagc agcbgcgacb baccgcagcc ggbgggcccg 4740 
baaatcacac cbabbaccgg cbgcaacbgg bagbtaagag agcbgcagcb gccgbcabcc 4800 
cbgagcaggg gggccacbbc gbtaagcabg bcccbgacbc gcabgbbbbc ccbgaccaaa 4860 
bccgccagaa ggcgctcgcc gcccagcgab agcagbbcbb gcaaggaagc aaagbbbbbc 4920 
aacggbttga gaccgtccgc cgbaggcabg cbbbbgagcg bbbgaccaag cagbbccagg 4980 
cggbcccaca gcbcggbcac cbgcbcbacg gcabcbcgab ccagcababc bccbcgbbtc 5040 
gcgggbbggg gcggcbbtcg cbgtacggca gtagtcggbg cbcgtccaga cgggccaggg 5100 
bcabgbcbbb ccacgggcgc agggbccbcg bcagcgbagb cbgggtcacg gbgaaggggb 5160 
gcgcbccggg cbgcgcgcbg gccagggbgc gcbbgaggcb ggbccbgcbg gbgctgaagc 5220 
gcbgccggbc bbcgcccbgc gcgbcggcca ggtagcabbb gaccabggbg bcabagbcca 5280 
gccccbccgc ggcgbggccc tbggcgcgca gcbbgcccbb ggaggaggcg ccgcacgagg 534 0 
ggcagbgcag acbbbbgagg gcgbagagcb bgggcgcgag aaabaccgab bccggggagb 54 00 
aggcabccgc gccgcaggcc ccgcagacgg bcbcgcabbc cacgagccag gbgagcbcbg 5460 
gccgbbcggg gbcaaaaacc aggbttcccc cabgcbbbtb gabgcgbbbc btaccbctgg 5520 
bbbccabgag ccggbgbcca cgcbcggbga cgaaaaggcb gbccgbgbcc ccgbabacag 5580 
acbbgagagg ccbgbccbcg agcggbgbbc cgcggbccbc cbcgbabaga aacbcggacc 5640 
acbcbgagac aaaggctcgc gbccaggcca gcacgaagga ggcbaagbgg gaggggbagc 5700 
ggbcgbbgbc cacbaggggg bccacbcgcb ccagggbgbg aagacacabg bcgcccbcbb 5760 
cggcabcaag gaaggbgatb ggbbbgbagg tgbaggccac gbgaccgggb gbbccbgaag 5820 
gggggcbaba aaagggggtg ggggcgcgbb cgbccbcacb cbcbtccgca bcgcbgtctg 5880 
cgagggccag cbgtbggggb gagbcgacgc gaggcbggab ggcctbcccc abbabgabbc 5 94 0 
bbcbcgcbbc cggcggcabc gggabgcccg cgbbgcaggc cabgcbgbcc aggcaggbag 6000 
abgacgacca bcagggacag cbbcaaggcc agcaaaaggc caggaaccgb aaaaaggccg 6060 
cgbbgcbggc gbbbbbccab aggcbccgcc ccccbgacga gcabcacaaa aabcgacgcb 6120 
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caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 6180 
gctccctcgt gcgctctcct gbtccgaccc tgccgcttac cggatacctg tccgcctttc 6240 
tcccttcggg aagcgtggcg ctttctcata gctcacgctg baggtatctc agttcggtgb 6300 
aggbcgtbcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 6360 
ccttabccgg baacbatcgb cbtgagbcca acccggtaag acacgactta bcgccacbgg 6420 
cagcagccac tggtaacagg attagcagag cgaggtatgb aggcggtgct acagagttct 64 80 
tgaagtggbg gcchaactac ggcbacacba gaaggacagt atttggtatc tgcgctctgc 6540 
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 6600 
ctggtagcgg tggtbbtttb gttbgcaagc agcagattac gcgcagaaaa aaaggatctc 6660 
aagaagabcc ttbgabcttb bctacggggb ctgacgctca gtggaacgaa aactcacgbb 6720 
aagggatttt ggbcabgaga bbabcaaaaa ggabctbcac ctagatcctt ttaaattaaa 6780 
aatgaagttt taaatcaatc baaagbabab atgagtaaac btggtctgac agttaccaat 6840 
gcbtaafccag tgaggcacct abcbcagcga tctgtctabt bcgbbcabcc abagnbgcct 6900 
gactccccgb cgbgtagaba acbacgabac gggagggcbt accatcbggc cccagbgcbg 6960 
caatgatacc gcgagaccca cgcbcaccgg cbccagabbb abcagcaaba aaccagccag 7020 
ccggaagggc cgagcgcaga agbggbccbg caacbbbatc cgccbccabc cagtcbabba 7080 
abbgbbgccg ggaagcbaga gbaagbagbb cgccagbbaa bagbbbgcgc aacgtbgbbg 7140 
ccabbgcbgc aggcabcgbg gbgbcacgcb cgbcgbbbgg babggcbbca btcagctccg 7200 
gbtcccaacg atcaaggcga gbbacatgat cccccabgtb gbgcaaaaaa gcggtbagcb 7260 
cctbcggtcc bccgabcgbb gbcagaagta agttggccgc agtgbbabca ctcabggbba 7320 
bggcagcact gcabaabbcb cbtacbgbca bgccabccgb aagatgcbbb bctgbgacbg 7380 
gbgagbacbc aaccaagbca tbcbgagaab agbgbatgcg gcgaccgagb bgctctbgcc 744 0 
cggcgbcaac acgggabaab accgcgccac abagcagaac bbtaaaagbg cbcatcabbg 7500 
gaaaacgtbc bbcggggcga aaacbcbcaa ggabcbbacc gcbgbbgaga bccagbbcga 7560 
tgbaacccac bcgbgcaccc aacbgabcbb cagcabcbbb bacbtbcacc agcgbbbcbg 7620 
ggbgagcaaa aacaggaagg caaaabgccg caaaaaaggg aabaagggcg acacggaaat 7680 
gbtgaabacb cabacbcbbc cbbbbbcaab abbabbgaag catbbabcag ggbbattgbc 7740 
bcabgagcgg abacababbt gaabgbabbb agaaaaabaa acaaabaggg gbbccgcgca 7800 
cabbbccccg aaaagbgcca ccbgacgbcb aagaaaccab tabbabcabg acabbaaccb 7860 
abaaaaatag gcgbabcacg aggcccbtbc gbcbbcaaga abbgtb 7906 



<210> 41 
<211> 7909 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pAdApb containing HIV genes 
<400> 41 

bbaabbaacc gcaabtctca tgtttgacag ctbatcabca tcaabaatat acctbabttb 60 

ggatbgaagc caababgaba atgagggggt ggagtbbgbg acgtggcgcg gggcgtggga 120 

acggggcggg bgacgbagba gtgtggcgga agbgtgabgt tgcaagtgbg gcggaacaca 180 

bgbaagcgac ggabgtggca aaagtgacgt tbbbggbgbg cgccggtgta cacaggaagb 24 0 

gacaabbbbc gcgcggbbbb aggcggabgt bgbagbaaat btgggcgtaa ccgagtaaga 300 

tttggccabb tbcgcgggaa aacbgaataa gaggaagbga aabcbgaaba atbtbgbgtb 360 

acbcatagcg cgtaatattt gtctagggcc gcggggacbt tgaccgbbba cgbggagacb 420 

cgcccaggbg tbttbcbcag gbgtbbtccg cgttccgggt caaagttggc gtbbbabbat 4 80 

babagbcagt acgbaccagb gcacbggccb agagcggccc cattgcabac gttgbatcca 540 

tabcabaaba tgbacabbba tabbggcbca bgbccaacat baccgccatg btgacabbga 600 

ttabtgacba gbbabbaaba gbaabcaabb acggggbcat bagtbcabag cccabababg 660 

gagbbccgcg bbacabaact bacggbaaab ggcccgccbg gcbgaccgcc caacgacccc 720 

cgcccabtga cgtcaataat gacgtabgbb cccatagbaa cgccaabagg gactbtccab 780 
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bgacgtcaab gggtggagba bbbacggbaa acbgcccacb hggcagtaca tcaagtgtat 840 
cababgccaa gtacgccccc tattgacgtc aabgacggba aatggcccgc ctggcattat 900 
gcccagtaca tgaccttatg ggactbbccb acttggcagt acabctacgb attagtcatc 960 
gctattacca tggtgatgcg gttttggcag tacatcaatg ggcgbggaba gcggbbbgac 1020 
tcacggggat ttccaagtct ccaccccatt gacgtcaabg ggagtttgtt ttggcaccaa 1080 
aatcaacggg acbbbccaaa atgtcgtaac aactccgccc cattgacgca aatgggcggt 1140 
aggcgtgtac ggbgggaggt cbababaagc agagctcgbt bagbgaaccg bcagabcgcc 1200 
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagccbc 1260 
cgbcaccgbc gtcgacacgb gtgatcagat atcgcggccg ctcbagaatg cgcgtgaagg 1320 
agaagtacca gcacctgtgg cgcbggggct ggcgctgggg caccatgctg cbgggcabgc 1380 
tgatgatctg cagcgccacc gagaagctgt gggfcgaccgt gtactacggc gbgcccgtgt 144 0 
ggaaggaggc caccaccacc cbgcbcbgcg ccagcgacgc caaggcctac gacaccgagg 1500 
bgcacaacgt gbgggccacc cacgcctgcg bgcccaccga ccccaacccc caggaggtgg 1560 
tgcbggtgaa cgbgaccgag aactbcgaca bgbggaagaa cgacabggbg gagcagabgc 162 0 
acgaggacat catcagcctg tgggaccaga gccbgaagcc ctgcgbgaag cbgacccccc 1680 
bgbgcgbgag ccfcgaagbgc accgacgcba gcaagaacbg cagctbcaac abcagcacca 1740 
gcabccgcgg caaggtgcag aaggagtacg cctbcttcba caagctggac abcabcccca 1800 
bcgacaacga caccaccagc tacagccbga ccagctgcaa caccagcgbg atcacccagg 1860 
cctgccccaa ggbgagcbtc gagcccabcc ccaaccacba cbgcgccccc gccggctbcg 1920 
ccabcctgaa gbgcaaggac aagaagbbca acggcaaggg cccctgcacc aacgtgagca 1980 
ccgtgcagtg cacccacggc atccgccccg tggbgagcac ccagcbgcbg cbgaacggba 2040 
gcctggccga ggaggaggtg gtgabccgca gcgctaacbt cgccgacaac gccaaggtga 2100 
bcatcgbgca gcbgaacgag agcgtggaga bcaacbgcac ccgccccaac aacaacaccc 2160 
gcaagagcab ccacabcggc cccggccgcg ccbbcbacac caccggcgag abcabcggcg 2220 
acabccgcca ggcccacbgc aaccbgagcc gcgccaagbg gaacgacacc ctgaacaaga 2280 
bcgtgabcaa gcbgcgcgag cagbtcggca acaagaccat cgbgbbcaag cacagcagcg 2340 
gcggcgaccc cgagatcgtg acccacagcb bcaacbgcgg cggcgagttc ttctactgca 24 00 
acagcgcbag ccbgcccbgc cgcabcaagc agabcabcaa cabgbggcag aaggtgggca 2460 
aggccabgta cgcccccccc abcagcggcc agatccgcbg cagcagcaac atcaccggcc 2520 
bgctgcbgac ccgcgacggc ggcaacagca acaacgagag cgagabcbbc cgbccgggcg 2 580 
gcggcgacab gcgcgacaac bggcgcagcg agcbgbacaa gbacaaggbg gbgaagatcg 2 640 
agccccbggg cgbggccccc accaaggcca agcbbaccgb ccaggcccgc cagctgcbga 2700 
gcggcabcgb gcagcagcag aacaaccbgc bgcgcgccat cgaggcccag cagcacctgc 2760 
bgcagcbgac cgtgbggggc abcaagcagc bgcaggcccg cacccbggcc gbggagcgcb 2820 
acctgaagga ccagcagcag cbcgagcaga tcbggaacca caccaccbgg abggagbggg 2 880 
accgcgagat caacaacbac accagccbga bccacagcct gabcgaggag agccagaacc 2940 
agcaggagaa gaacgagcag gagctgcbgg agcbggacaa gbgggccagc cbgbggaacb 3 000 
ggbtcaacab caccaacbgg cbgbggbgag gatccagabc bgcbgbgcct bcbagbbgcc 3 060 
agccatctgb bgtbbgcccc bcccccgbgc cbbccbbgac cctggaaggt gccactccca 3120 
cbgtcctbbc cbaabaaaab gaggaaabbg cabcgcabbg bcbgagbagg bgbcabbcba 3180 
tbcbgggggg tggggbgggg cagcacagca agggggagga bbgggaagac aabagcaggc 3240 
abgctgggga bgcggbgggc bctabgggba cccagggccg cabaacbbcg babaabgbab 33 00 
gcbabacgaa gbtabaagab cbgbacbgaa atgbgbgggc gbggcbtaag ggtgggaaag 3360 
aabababaag gtgggggbcb batgbagbbb bgtabcbgbb bbgcagcagc cgccgccgcc 3420 
abgagcacca acbcgtbbga bggaagcabt gbgagcbcat abbbgacaac gcgcabgccc 3480 
ccabgggccg gggbgcgbca gaabgbgabg ggcbccagca bbgabggbcg ccccgbccbg 3540 
cccgcaaacb ctacbaccbb gaccbacgag accgbgbcbg gaacgccgbb ggagacbgca 3600 
gccbccgccg ccgcbtcagc cgctgcagcc accgcccgcg ggabbgtgac tgacbbbgcb 3660 
bbcctgagcc cgcbbgcaag cagbgcagcb tcccgbbcab ccgcccgcga tgacaagtbg 3720 
acggcbcbtb bggcacaatb ggabtctbtg acccgggaac bbaabgtcgb bbcbcagcag 3780 
ctgttggabc tgcgccagca ggbbtcbgcc cbgaaggcbt ccbccccbcc caabgcggbt 3840 
fcaaaacataa abaaaaaacc agactcbgtb tggabbbgga bcaagcaagb gbcbbgcbgb 3900 
cbbtabbbag gggbbbbgcg cgcgcggbag gcccgggacc agcggbcbcg gbcgbbgagg 3 960 
gtcctgbgba tbbbbtccag gacgtggbaa aggbgacbcb ggabgttcag abacabgggc 4020 
abaagcccgb cbcbggggbg gaggtagcac cacbgcagag cbbcabgctg cggggbggbg 4 080 
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ttgtagatga tccagtcgta gcaggagcgc 
agcaagctga ttgccagggg caggcccttg 
gatgggtgca tacgtgggga tatgagatgc 
ttcccagcca tatccctccg gggattcatg 
gtgcacttgg gaaatttgtc atgtagctta 
cccttgtgac ctccaagatt ttccabgcat 
gcggcggcct gggcgaagat atttctggga 
agatcgtcat aggccafcfctt tacaaagcgc 
gttccatccg gcccaggggc gtagttaccc 
tcagatgggg ggatcatgtc tacctgcggg 
gagatcagct gggaagaaag caggttcctg 
ccgtaaatca cacctattac cggctgcaac 
tccctgagca ggggggccac ttcgttaagc 
aaatccgcca gaaggcgctc gccgcccagc 
btcaacggtt tgagaccgtc cgccgtaggc 
aggcggtccc acagctcggt cacctgctct 
ttcgcgggtt ggggcggctt tcgctgtacg 
gggtcatgtc tttccacggg cgcagggtcc 
ggtgcgctcc gggctgcgcg ctggccaggg 
agcgctgccg gtcttcgccc tgcgcgtcgg 
ccagcccctc cgcggcgtgg cccttggcgc 
aggggcagtg cagacttttg agggcgtaga 
agtaggcatc cgcgccgcag gccccgcaga 
ctggccgttc ggggtcaaaa accaggtttc 
tggtttccat gagccggtgt ccacgctcgg 
cagacttgag aggcctgtcc tcgagcggtg 
accactctga gacaaaggct cgcgtccagg 
agcggtcgtt gtccactagg gggtccactc 
cttcggcatc aaggaaggtg attggtttgt 
aaggggggct ataaaagggg gtgggggcgc 
ctgcgagggc cagctgttgg ggtgagtcga 
ttcttctcgc ttccggcggc atcgggatgc 
tagatgacga ccatcaggga cagcttcaag 
*ccgcgttgct ggcgtttttc cataggctcc 
gctcaagtca gaggtggcga aacccgacag 
gaagctccct cgtgcgctct cctgttccga 
ttctcccttc gggaagcgtg gcgctttctc 
tgtaggtcgt tcgctccaag ctgggctgtg 
gcgccttatc cggtaactat cgtcttgagt 
tggcagcagc cactggtaac aggattagca 
tcttgaagtg gtggcctaac tacggctaca 
tgctgaagcc agttaccttc ggaaaaagag 
ccgcfcggtag cggtggtttt tttgtttgca 
ctcaagaaga tcctttgatc ttttctacgg 
gttaagggat tttggtcatg agattatcaa 
aaaaatgaag ttttaaatca atctaaagta 
aatgcttaat cagtgaggca cctatctcag 
cctgactccc cgtcgtghag ataactacga 
ctgcaatgat accgcgagac ccacgctcac 
cagccggaag ggccgagcgc agaagtggtc 
ttaattgttg ccgggaagct agagtaagta 
ttgccattgc tgcaggcatc gtggtgtcac 
ccggttccca acgatcaagg cgagttacat 
gctccttcgg tcctccgafcc gttgtcagaa 
ttatggcagc actgcataat tctcttactg 



tgggcgtggt gcctaaaaat gtctttcagt 4140 

gtgtaagtgt ttacaaagcg ghtaagctgg 4200 
atcttggact gtatttttag gttggctatg 4260 
ttgtgcagaa ccaccagcac agtgtatccg 4320 
gaaggaaatg cgtggaagaa chtggagacg 43 80 
tcgtccataa tgatggcaat gggcccacgg 4440 
tcactaacgt catagttgtg ttccaggatg 4500 
999 c 99 a 999 tgccagactg cggtataatg 4560 
tcacagattt gcatttccca cgctttgagt 4620 
gcgatgaaga aaacggtttc cggggtaggg 4680 
agcagctgcg acttaccgca gccggtgggc 4740 
tggtagttaa gagagctgca gctgccgtca 4800 
atgtccctga ctcgcatgtt ttccctgacc 4860 
gatagcagtt cttgcaagga agcaaagttt 4920 
atgchtttga gcgtttgacc aagcagttcc 4980 
acggcatctc gatccagcat atctcctcgt 5040 
gcagtagtcg gtgctcgtcc agacgggcca 5100 
tcghcagcgt agtctgggtc acggtgaagg 5160 
tgcgcttgag gctggtcctg ctggtgctga 5220 
ccaggtagca tttgaccatg gtgtcatagt 52 80 
gcagcttgcc cttggaggag gcgccgcacg 5340 
gcttgggcgc gagaaatacc gattccgggg 54 00 
cggtctcgca ttccacgagc cagghgagct 5460 
ccccatgcht fcttgatgcgt ttcttacctc 5520 
tgacgaaaag gctgtccgtg bccccgtata 5580 
ttccgcggtc ctcctcgtat agaaactcgg 5640 
ccagcacgaa ggaggctaag fcgggaggggt 5700 
gctccagggt gtgaagacac atgtcgccct 5760 
aggtgtaggc cacgtgaccg ggtgttcctg 5820 
gttcgtcctc actctcttcc gcatcgchgt 5880 
cgcgaggctg gatggccttc cccattatga 5940 
ccgcgttgca ggccatgctg tccaggcagg 6000 
gccagcaaaa ggccaggaac cgtaaaaagg 6060 
gcccccctga cgagcatcac aaaaatcgac 6120 
gacfcataaag ataccaggcg tttccccctg 6180 
ccctgccgct taccggatac ctgtccgcct 6240 
atagctcacg ctgtaggtat ctcagttcgg 63 00 
tgcacgaacc ccccgttcag cccgaccgct 6360 
ccaacccggt aagacacgac ttatcgccac 6420 
gagcg[aggta tgtaggcggt gctacagagt 64 80 
ctagaaggac agfcatttggt atctgcgctc 6540 
ttggtagctc ttgatccggc aaacaaacca 6600 
agcagcagat tacgcgcaga aaaaaaggat 6660 
ggtctgacgc tcagtggaac gaaaactcac 6720 
aaaggatctt cacct agate cttttaaatt 6780 
tatatgagta aacttggfcct gacagtfcacc 6840 
cgatctgtct atttegttea tccatagttg 6900 
tacgggaggg cttaccatct ggccccagtg 6960 
cggctccaga tttatcagca ataaaccagc 7020 
ctgcaacttt atccgcctcc atccagtcta 7080 
gttcgccagt taatagtttg cgcaacgttg 7140 
getegtegtt tggtatggct fceattcaget 7200 
gatcccccat gttgtgcaaa aaagcggtta 7260 
gtaagttggc cgcagtgtta tcactcatgg 7320 
bcatgccatc cgtaagatgc ttttctgtga 7380 
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ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 7440 

gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 7500 

ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 7560 

cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 7620 

ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 7680 

aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 7740 

gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaaba ggggttccgc 7800 

gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc atgacattaa 7860 

cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaattgtt 7909 



<210> 42 
<211> 7B67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pAdApt containing HIV genes 
<400> 42 

ttaattaacc gcaattctca tgtttgacag cttatcatca tcaataatat accttatttt 60 
ggattgaagc caatatgata atgagggggt ggagtttgtg acgtggcgcg gggcgtggga 120 
acggggcggg tgacgtagta gtgtggcgga agtgtgatgt tgcaagtgtg gcggaacaca 18 0 
tgtaagcgac ggatgtggca aaagtgacgt ttttggtgtg cgccggtgta cacaggaagt 240 
gacaattttc gcgcggtttt aggcggatgt tgtagtaaat ttgggcgtaa ccgagtaaga 300 
tttggccatt ttcgcgggaa aactgaataa gaggaagtga aatctgaata attttgtgtt 360 
actcatagcg cgtaatattt gtctagggcc gcggggactt tgaccgttta cgtggagact 420 
cgcccaggtg tttttctcag gtgttttccg cgttccgggt caaagttggc gttttattat 480 
tatagtcagt acgtaccagt gcactggcct agagcggccc cattgcatac gttgtatcca 540 
tatcataata tgtacattta tattggctca tgtccaacat taccgccatg ttgacattga 600 
ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag cccatatatg 660 
gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc caacgacccc 720 
cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat 780 
tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat 84 0 
catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattat 900 
gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt attagtcatc 960 
gctattacca tggtgatgcg gttttggcag tacatcaatg ggcgtggata gcggtttgac 1020 
tcacggggat ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa 1080 
aatcaacggg actttccaaa atgtcgtaac aactccgccc cattgacgca aatgggcggt 1140 
aggcgtgtac ggtgggaggt ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 1200 
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 1260 
cgtcaccgtc gtcgacacgt gtgatcagat atcgcggccg ctctagaatg cgcgtgaagg 1320 
agaagtacca gcacctgtgg cgctggggct ggcgctgggg caccatgctg ctgggcatgc 1380 
tgatgatctg cagcgccacc gagaagctgt gggtgaccgt gtactacggc gtgcccgtgt 1440 
ggaaggaggc caccaccacc ctgctctgcg ccagcgacgc caaggcctac gacaccgagg 1500 
tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc caggaggtgg 1560 
tgctggtgaa cgtgaccgag aacttcgaca tgtggaagaa cgacatggtg gagcagatgc 1620 
acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag ctgacccccc 1680 
tgtgcgtgag cctgaagtgc accgacctga agaacgacac caacaccaac agcagcagcg 174 0 
gccgcatgat catggagaag ggcgagatca agaactgcag cttcgctagc accagctgca 1800 
acaccagcgt gatcacccag gcctgcccca aggtgagctt cgagcccatc cccaaccact 1860 
actgcgcccc cgccggcttc gccatcctga agtgcaagga caagaagttc aacggcaagg 1920 
gcccctgcac caacgtgagc accgtgcagt gcacccacgg catccgcccc gtggtgagca 1980 
cccagctgct gctgaacggt agcctggccg aggaggaggt ggtgatccgc agcgctaact 204 0 
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tcgccgacaa cgccaaggbg atcatcgtgc agctgaacga gagcgtggag atcaactgca 2100 
cccgcgcbag cgcccacbgc aaccbgagcc gcgccaagtg gaacgacacc ctgaacaaga 2160 
tcgtgatcaa gctgcgcgag cagttcggca acaagaccat cgtgttcaag cacagcagcg 2220 
gcggcgaccc cgagatcgtg acccacagct tcaactgcgg cggcgagttc ttctactgca 22 80 
acagcaccca gctgttcaac agcacctggt tcaacagcac ctggagcacc gagggcagca 2340 
acaacaccga gggcagcgac accabcaccc tgccctgccg catcaagcag abcatcaaca 24 00 
t 9 t 99 ca 9 aa ggtgggcaag gccatgtacg ccccccccat cagcggccag atccgctgca 2460 
gcagcaacat caccggcctg ctgctgaccc gcgacggcgg caacagcaac aacgagagcg 2520 
agatcttccg tccgggcggc ggcgacatgc gcgacaactg gcgcagcgag ctgtacaagt 2580 
acaaggtggt gaagatcgag cccctgggcg tggcccccac caaggccaag cttaccgtcc 2640 
aggcccgcca gcbgcbgagc ggcabcgbgc agcagcagaa caacctgctg cgcgccatcg 2700 
aggcccagca gcaccbgcbg cagctgaccg tgtggggcat caagcagchg caggcccgca 2760 
ccctggccgt ggagcgctac ctgaaggacc agcagcagct cgagcagatc tggaaccaca 2 8 20 
ccacctggat ggagbgggac cgcgagatca acaactacac cagcctgatc cacagccbga 2880 
tcgaggagag ccagaaccag caggagaaga acgagcagga gctgctggag ctggacaagt 2 940 
gggccagcct gtggaacbgg ttcaacatca ccaacbggcb gtggtgagga tccagatctg 3000 
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct bccbbgaccc 3060 
tggaaggtgc cacbcccacb gtcctttcct aabaaaabga ggaaattgca tcgcattgtc 3120 
tgagtaggtg tcattctath ctggggggtg gggtggggca gcacagcaag ggggaggatt 3180 
gggaagacaa tagcaggcat gctggggatg cggtgggctc babgggbacc cagggccgca 3240 
taacttcgta taabgbabgc tatacgaagb bataagabcb gtactgaaat gtgtgggcgt 33 00 
ggcttaaggg tgggaaagaa tatataaggt gggggbcbba tgbagtbbtg tatctgbbbb 3360 
gcagcagccg ccgccgccab gagcaccaac bcgbbbgabg gaagcattgt gagctcabab 3420 
ttgacaacgc gcatgccccc atgggccggg gtgcgtcaga abgtgabggg cbccagcabb 34 80 
gatggfccgcc ccgtcctgcc cgcaaactct actaccttga cctacgagac cgtgtctgga 3540 
acgccgttgg agactgcagc ctccgccgcc gcbtcagccg ctgcagccac cgcccgcggg 3600 
attgtgactg actttgcttt cctgagcccg cttgcaagca gtgcagcttc ccgttcatcc 3660 
gcccgcgatg acaagttgac ggctcttttg gcacaattgg attctttgac ccgggaacbb 3720 
aatgtcgttt cbcagcagcb gttggatctg cgccagcagg tttctgcccb gaaggcttcc 3780 
tcccctccca atgcggttta aaacataaat aaaaaaccag actctgtttg gathtggatc 3840 
aagcaagtgt cttgctgtct tbabbtaggg gttttgcgcg cgcggtaggc ccgggaccag 3 900 
cggtctcggt cgtbgagggb ccbgbgbabb ttttccagga cgtggtaaag gtgacbctgg 3 960 
atgttcagat acatgggcat aagcccgtct ctggggtgga ggtagcacca ctgcagagct 4 020 
tcatgctgcg gggtggtgtt gtagatgatc cagtcgtagc aggagcgctg ggcgtggtgc 4 080 
cbaaaaabgb ctttcagtag caagctgatt gccaggggca ggcccttggt gtaagtgttt 4140 
acaaagcggt taagctggga tgggtgcata cgtggggata tgagatgcat cttggactgt 42 00 
atttttaggt tggctatgtt cccagccafca tccctccggg gafctcatgtt gtgcagaacc 4260 
accagcacag tgtatccggt gcacttggga aafcttgtcat gtagcttaga aggaaatgcg 4320 
tggaagaact tggagacgcc cttgtgacct ccaagatttt ccatgcattc gtccafcaatg 43 80 
atggcaatgg gcccacgggc ggcggcctgg gcgaagatat ttctgggatc actaacgtca 4440 
tagttgtgtt ccaggatgag atcgtcatag gccattttta caaagcgcgg gcggagggtg 4500 
ccagactgcg gtataatggt tccatccggc ccaggggcgt agttaccctc acagatttgc 4560 
atttcccacg ctthgagtbc agabgggggg abcabgbcba ccbgcggggc gabgaagaaa 4620 
acggbbbccg gggbagggga gabcagcbgg gaagaaagca ggbbccbgag cagcbgcgac 4680 
bbaccgcagc cggbgggccc gbaaabcaca ccbabbaccg gcbgcaacbg gbagbtaaga 4 74 0 
gagcbgcagc bgccgtcabc cctgagcagg ggggccacbb cgbbaagcat gbccctgacb 4800 
cgcabgbtbt cccbgaccaa atccgccaga aggcgcbcgc cgcccagcga bagcagbbcb 4860 
bgcaaggaag caaagtbbbb caacggbttg agaccgbccg ccgbaggcat gcbtbtgagc 4920 
gbbbgaccaa gcagbtccag gcggtcccac agctcggbca ccbgcbcbac ggcabctcga 4980 
tccagcatab cbcctcgbbb cgcgggttgg ggcggcbbtc gctgtacggc agbagbcggt 5040 
gcbcgbccag acgggccagg gtcabgtcbt bccacgggcg cagggbccbc gtcagcgbag 5100 
tcbgggbcac ggbgaagggg tgcgcbccgg gcbgcgcgcb ggccagggbg cgcbbgaggc 5160 
tggbccbgcb ggbgcbgaag cgcbgccggt cttcgcccbg cgcgbcggcc aggbagcabb 5220 
tgaccabggb gbcabagbcc agccccbccg cggcgbggcc cbtggcgcgc agcbbgcccb 5280 
tggaggaggc gccgcacgag gggcagbgca gactbbbgag ggcgbagagc btgggcgcga 5340 
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gaaataccga bbccggggag baggcabccg 
ccacgagcca ggtgagctct ggccgttcgg 
tgatgcgttt cttacctctg gbbbccabga 
tgtccgbgtc cccgtataca gacttgagag 
cctcgtatag aaacbcggac cacbcbgaga 
aggctaagtg ggaggggtag cggtcgttgt 
gaagacaeab gbcgcccbcb tcggcatcaa 
cgtgaccggg tgttcctgaa ggggggctat 
tctcttccgc abcgcbgbcb gcgagggcca 
bggccbbccc cafcbabgabb cttctcgctt 
ccatgctgtc caggcaggta gatgacgacc 
ccaggaaccg taaaaaggcc gcgbbgcbgg 
agcatcacaa aaafccgacgc tcaagtcaga 
accaggcgbb bcccccbgga agcbcccbcg 
ccggabaccb gtccgccttt ctcccttcgg 
gbaggbabcb cagttcggtg taggtcgttc 
ccgbbcagcc cgaccgctgc gccttatccg 
gacacgacbb atcgccactg gcagcagcca 
taggcggtgc tacagagttc ttgaagtggt 
tatttggtat ctgcgctctg ctgaagccag 
gatccggcaa acaaaccacc gctggtagcg 
cgcgcagaaa aaaaggatct caagaagatc 
agtggaacga aaactcacgt taagggattt 
cctagatcct tttaaattaa aaatgaagbb 
cttggtctga cagbbaccaa tgcttaatca 
ttcgttcatc catagttgcc tgactccccg 
taccatctgg ccccagtgct gcaatgatac 
babcagcaab aaaccagcca gccggaaggg 
ccgcctccat ccagtctatt aattgttgcc 
atagtttgcg caacgttgtt gccattgctg 
gtatggcttc attcagctcc ggttcccaac 
bgbgcaaaaa agcggttagc tccttcggtc 
cagbgbtabc actcatggtt atggcagcac 
taagatgctt bbcbgbgacb ggbgagtacb 
ggcgaccgag ttgctchtgc ccggcgtcaa 
ctttaaaagt gctcatcatt ggaaaacgtt 
cgctgttgag atccagttcg atgtaaccca 
bbacbbbcac cagcgtfctct gggtgagcaa 
gaataagggc gacacggaaa tgfctgaatac 
gcatttatca gggttattgt ctcatgagcg 
aacaaatagg ggttccgcgc acatttcccc 
ttattatcat gacattaacc tataaaaaba 
aabbgbb 



cgccgcaggc cccgcagacg gbcbcgcabb 5400 
ggbcaaaaac caggbbbccc ccabgcbbbb 5460 
gccggbgbcc acgcbcggbg acgaaaaggc 5520 
gccbgbccbc gagcggbgbb ccgcggbccb 5580 
caaaggcbcg cgbccaggcc agcacgaagg 5640 
ccactagggg gbccacbcgc tccagggbgb 5700 
ggaaggtgat bggtbtgbag gtgbaggcca 5760 
aaaagggggt gggggcgcgb bcgbcctcac 5820 
gcbgbbgggg bgagbcgacg cgaggcbgga 5880 
ccggcggcab cgggabgccc gcgbbgcagg 5940 
abcagggaca gcbbcaaggc cagcaaaagg 6000 
cgbbttbcca baggctccgc cccccbgacg 6060 
ggtggcgaaa cccgacagga ctataaagab 6120 
bgcgcbctcc tgbbccgacc cbgccgcbta 6180 
gaagcgbggc gcbbbcbcab agcbcacgcb 6240 
gctccaagcb gggcbgbgbg cacgaacccc 6300 
gtaacbabcg tcbtgagtcc aacccggbaa 6360 
cbggbaacag gabbagcaga gcgaggbabg 6420 
ggccbaacba cggcbacacb agaaggacag 6480 
bbaccbbcgg aaaaagagbb ggtagcbctb 6540 
gbggbbbbbb bgbbbgcaag cagcagabba 6600 
cbbbgatcbb btcbacgggg bcbgacgcbc 6660 
tggbcabgag abbabcaaaa aggabcbbca 6720 
bbaaabcaab cbaaagtaba tabgagbaaa 6780 
gbgaggcacc babcbcagcg abcbgbcbab 684 0 
bcgbgbagab aacbacgaba cgggagggcb 6900 
cgcgagaccc acgcbcaccg gcbccagabt 6960 
ccgagcgcag aagbggbccb gcaacbbbab 7020 
gggaagcbag agbaagbagb bcgccagbba 7080 
caggcabcgb ggbgtcacgc bcgbcgbbbg 7140 
gabcaaggcg agbbacabga tcccccabgb 7200 
cbccgabcgb bgbcagaagb aagbbggccg 7260 
bgcabaabbc bcbbacbgbc abgccabccg 7320 
caaccaagbc abtcbgagaa bagbgbabgc 7380 
cacgggabaa baccgcgcca cabagcagaa 7440 
cbbcggggcg aaaacbctca aggabctbac 7500 
cbcgbgcacc caacbgatcb bcagcatcbb 7560 
aaacaggaag gcaaaabgcc gcaaaaaagg 7620 
tcabacbcbb ccbbbbbcaa babbabbgaa 7680 
gabacababb bgaabgbabb bagaaaaaba 7740 
gaaaagbgcc accbgacgtc baagaaacca 7800 
ggcgtabcac gaggcccbbb cgbcbbcaag 7860 

7867 



<210> 43 
<211> 7870 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plastnid pAdApb conbaining HIV genes 
<400> 43 



- xoo - 



WO 02/32943 



PCT/US01/25721 



ttaattaacc gcaattctca tgtttgacag 
ggattgaagc caatatgata abgagggggb 
acggggcggg tgacgtagta gtgtggcgga 
tgbaagcgac ggatgtggca aaagtgacgt 
gacaattttc gcgcggtttt aggcggatgt 
bbbggccatt ttcgcgggaa aactgaataa 
acbcabagcg cgtaatattt gtctagggcc 
cgcccaggtg tttttctcag gtgttttccg 
tatagtcagt acgtaccagt gcacbggccb 
babcabaaba tgtacattta tattggctca 
btabbgacta gttattaaba gbaabcaabb 
gagttccgcg ttacataact tacggtaaat 
cgcccattga cgtcaataat gacgtatgbt 
tgacgtcaat gggtggagta tttacggtaa 
cababgccaa gtacgccccc babtgacgbc 
gcccagtaca tgaccttatg ggacbbbccb 
gctattacca tggtgatgcg gttttggcag 
bcacggggab bbccaagbcb ccaccccatt 
aatcaacggg actttccaaa abgbcgbaac 
aggcgtgtac ggtgggaggt ctatataagc 
tggagacgcc atccacgctg bbtbgaccbc 
cgbcaccgbc gbcgacacgb gbgabcagab 
agaagtacca gcaccbgbgg cgctggggcb 
bgabgabcbg cagcgccacc gagaagcbgb 
ggaaggaggc caccaccacc cbgctcbgcg" 
tgcacaacgb gtgggccacc cacgccbgcg 
tgcbggbgaa cgbgaccgag aacbbcgaca 
acgaggacab cabcagccbg bgggaccaga 
tgtgcgbgag ccbgaagtgc accgaccbga 
gccgcabgab cabggagaag ggcgagatca 
acaccagcgb gabcacccag gccbgcccca 
acbgcgcccc cgccggcbbc" gccatcctga 
gccccbgcac caacgtgagc accgbgcagb 
cccagcbgcb gcbgaacggt agccbggccg 
bcgccgacaa cgccaaggbg .abcabcgbgc 
cccgccccaa caacaacacc cgcaagagca 
ccaccggcga gabcabcggc gacabccgcc 
ggaacgacac ccbgaacaag abcgtgabca 
bcgbgbtcaa gcacagcagc ggcggcgacc 
gcggcgagbb ctbcbacbgc aacagcgcba 
acabgbggca gaaggbgggc aaggccabgb 
gcagcagcaa cabcaccggc cbgcbgctga 
gcgagabcbb ccgbccgggc ggcggcgaca 
agtacaaggb ggbgaagabc gagcccctgg 
bccaggcccg ccagcbgcbg agcggcabcg 
bcgaggccca gcagcaccbg cbgcagcbga 
gcacccbggc cgbggagcgc baccbgaagg 
acaccaccbg gabggagbgg gaccgcgaga 
bgabcgagga gagccagaac cagcaggaga 
agbgggccag ccbgbggaac bggbbcaaca 
cbgcbgbgcc bbcbagtbgc cagccabcbg 
cccbggaagg bgccactccc acbgbccbbb 
gtcbgagbag gbgtcatbcb abbcbggggg 
abbgggaaga caatagcagg catgcbgggg 
gcabaacbbc gbabaabgba bgcbatacga 



cbbabcabca bcaataabat accbbabtbb 60 
ggagbtbgtg acgbggcgcg gggcgbggga 120 
agtgtgatgb bgcaagbgtg gcggaacaca 180 
bbtbggbgbg cgccggtgba cacaggaagb 240 
bgbagbaaab btgggcgbaa ccgagbaaga 300 
gaggaagbga aabcbgaata abtbtgbgbb 360 
gcggggacbb bgaccgtbba cgtggagacb 42 0 
cgtbccgggt caaagttggc gbtbtabbab 480 
agagcggccc cattgcabac gbtgbabcca 540 
bgtccaacab baccgccabg bbgacabbga 600 
acggggbcat bagbbcabag cccabababg 660 
ggcccgccbg gcbgaccgcc caacgacccc 720 
cccabagbaa cgccaabagg gacttbccat 780 
acbgcccacb bggcagbaca bcaagtgbat 840 
aabgacggba aabggcccgc cbggcabbab 900 
acbbggcagb acabcbacgb abbagbcabc 960 
bacabcaabg ggcgbggaba gcggbbtgac 1020 
gacgbcaabg ggagbbbgbb tbggcaccaa 1080 
aacbccgccc cabtgacgca aabgggcggb 114 0 
agagctcgbt bagtgaaccg tcagabcgcc 12 00 
cabagaagac accgggaccg atccagccbc 1260 
atcgcggccg cbcbagaabg cgcgbgaagg 1320 
ggcgcbgggg caccabgcbg cbgggcabgc 1380 
gggbgaccgb gbacbacggc gbgcccgbgb 144 0 
ccagcgacgc caaggccbac gacaccgagg 1500 
tgcccaccga ccccaacccc caggaggbgg 1560 
bgbggaagaa cgacatggbg gagcagabgc 1620 
gccbgaagcc cbgcgtgaag cbgacccccc 1680 
agaacgacac Caacaccaac agcagcagcg 1740 
agaacbgcag cbbcgcbagc accagcbgca 1800 
aggtgagcbb cgagcccabc cccaaccacb 1860 
agbgcaagga caagaagbbc aacggcaagg 192 0 
gcacccacgg cabccgcccc gbggbgagca 1980 
aggaggaggb ggbgabccgc agcgcbaact 2 040 
agcbgaacga gagcgbggag abcaactgca 2100 
bccacabcgg ccccggccgc gcctbcbaca 2160 
aggcccactg caaccbgagc cgcgccaagb 222 0 
agcbgcgcga gcagbbcggc aacaagacca 2280 
ccgagabcgb gacccacagc bbcaacbgcg 2340 
gccbgcccbg ccgcabcaag cagabcabca 24 00 
acgccccccc cabcagcggc cagabccgcb 2460 
cccgcgacgg cggcaacagc aacaacgaga 2520 
bgcgcgacaa cbggcgcagc gagcbgbaca 2580 
gcgbggcccc caccaaggcc aagcbbaccg 2640 
bgcagcagca gaacaaccbg cbgcgcgcca 27 00 
ccgbgbgggg cabcaagcag cbgcaggccc 27 60 
accagcagca gcbcgagcag atcbggaacc 2820 
bcaacaacta caccagccbg abccacagcc 2880 
agaacgagca ggagcbgcbg gagcbggaca 2940 
bcaccaacbg gctgbggbga ggabccagab 3000 
bbgbbtgccc cbcccccgbg ccbbccbbga 3060 
ccbaabaaaa bgaggaaabb gcabcgcafcb 3120 
gtggggbggg gcagcacagc aagggggagg 3180 
at g c gg t ggg ct cbabgggt acccagggcc 3240 
agbbataaga bctgbacbga aatgbgbggg 33 00 
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cgtggcttaa gggbgggaaa gaatatataa 
tthgcagcag ccgccgccgc cahgagcacc 
tatttgacaa cgcgcatgcc cccatgggcc 
atbgatggbc gccccgtcct gcccgcaaac 
ggaacgccgt tggagacbgc agcctccgcc 
gggattgtga ctgactttgc tttcctgagc 
tccgcccgcg atgacaagtt gacggctctt 
cttaatgtcg tttctcagca gctgttggat 
tcctccccbc ccaatgcggt ttaaaacata 
atcaagcaag bgbcttgcbg bctttattta 
cagcggtctc ggtcgttgag ggtcctgtgt 
tggatgttca gatacatggg cataagcccg 
gcttcatgct gcggggtggt gttgtagatg 
tgcctaaaaa tgtctttcag bagcaagctg 
tttacaaagc ggttaagctg ggatgggtgc 
tgbatbbtba ggttggctat gttcccagcc 
accaccagca cagtgtatcc ggtgcacttg 
gcgtggaaga acttggagac gcccttgtga 
atgatggcaa tgggcccacg ggcggcggcc 
tcatagttgt gbbccaggat gagatcgtca 
gtgccagact gcggtataat ggttccatcc 
bgcatttccc acgctttgag tbcagabggg 
aaaacggbbt ccggggtagg ggagatcagc 
gacttaccgc agccggtggg cccgbaaatc 
agagagctgc agctgccgtc atccctgagc 
actcgcatgt tttccctgac caaatccgcc 
tcbbgcaagg aagcaaagtt tbtcaacggb 
agcgbtbgac caagcagbbc caggcggtcc 
cgatccagca batctcctcg bbbcgcgggt 
ggbgcbcgtc cagacgggcc agggbcatgt 
bagtcbgggt cacggbgaag gggtgcgctc 
ggcbggtcct gctggbgcbg aagcgctgcc 
abbbgaccat ggbgbcabag tccagcccct 
ccbtggagga ggcgccgcac gaggggcagt 
cgagaaabac cgabtccggg gagtaggcat 
abtccacgag ccaggtgagc tcbggccgbb 
tbbtgabgcg tttcbbaccb ctggbbbcca 
ggctgtccgb gtccccgbab acagacbtga 
ccbcctcgba bagaaactcg gaccacbcbg 
aggaggcbaa gtgggagggg bagcggbcgb 
bgbgaagaca catgbcgccc bcttcggcab 
ccacgtgacc gggtgtbccfc gaaggggggc 
cactctctbc cgcabcgcbg bcbgcgaggg 
ggabggcctb ccccatbatg atbcbbctcg 
aggccabgct gbccaggcag gbagabgacg 
aggccaggaa ccgbaaaaag gccgcgttgc 
acgagcatca caaaaabcga cgcbcaagtc 
gabaccaggc gbbbccccct ggaagcbccc 
btaccggaba ccbgtccgcc tbbctcccbt 
gcbgtaggta tctcagbbcg gbgbaggbcg 
cccccgtbca gcccgaccgc bgcgcctbat 
baagacacga ctbatcgcca cbggcagcag 
abgbaggcgg bgcbacagag bbctbgaagt 
cagbatfctgg babcbgcgct ctgcbgaagc 
cttgabccgg caaacaaacc accgcbggta 



ggbgggggtc ttabgbagtb bbgtabcbgt 3360 
aacbcgtbbg atggaagcab bgtgagcbca 3420 
ggggbgcgtc agaatgtgat gggctccagc 3480 
bctacbaccb tgaccbacga gaccgbgbct 3540 
gccgcbtcag ccgctgcagc caccgcccgc 3600 
ccgctbgcaa gcagbgcagc btcccgbtca 3660 
btggcacaab tggattctbt gacccgggaa 3720 
cbgcgccagc aggbbbctgc ccbgaaggcb 3780 
aataaaaaac cagacbctgb ttggabtbgg 3840 
ggggtbttgc gcgcgcggba ggcccgggac 3 900 
atbbtttcca ggacgbggba aaggbgactc 3960 
tcbctggggt ggaggbagca ccactgcaga 4020 
atccagtcgt agcaggagcg ctgggcgtgg 4080 
atbgccaggg gcaggccctt ggbgbaagbg 4140 
atacgbgggg atatgagabg catctbggac 4200 
abatcccbcc ggggattcat gttgbgcaga 4260 
ggaaabttgb cabgtagcbb agaaggaaab 4320 
ccbccaagab bttccabgca ttcgbccaba 4380 
bgggcgaaga babbtcbggg abcacbaacg 4440 
taggccatbb bbacaaagcg cgggcggagg 4500 
ggcccagggg cgbagbtacc ctcacagatb 4560 
gggatcatgb cbaccbgcgg ggcgabgaag 4620 
tgggaagaaa gcaggttccb gagcagctgc 4680 
acacctatba ccggcbgcaa cbggtagbba 4740 
aggggggcca ctbcgbbaag catgtcccbg 4800 
agaaggcgcb cgccgcccag cgatagcagb 4860 
bbgagaccgb ccgccgtagg catgctbbtg 4920 
cacagctcgg bcacctgctc bacggcabcb 4980 
bggggcggct bbcgctgbac ggcagbagbc 5040 
cbttccacgg gcgcagggbc cbcgtcagcg 5100 
cgggctgcgc gcbggccagg gbgcgcbbga 5160 
ggtcbtcgcc cbgcgcgbcg gccaggtagc 5220 
ccgcggcgtg gcccbtggcg cgcagcbbgc 52 8 0 
gcagactbbb gagggcgtag agcbbgggcg 534 0 
ccgcgccgca ggccccgcag acggbctcgc 5400 
cggggbcaaa aaccaggtbb cccccatgct 5460 
tgagccggbg bccacgctcg gbgacgaaaa 5520 
gaggcctgtc cbcgagcggb gbtccgcggb 5580 
agacaaaggc bcgcgbccag gccagcacga 5640 
bgtccacbag ggggbccacb cgctccaggg 5700 
caaggaaggt gatbggbtbg baggbgbagg 5760 
tataaaaggg ggtgggggcg cgbtcgbccb 5820 
ccagcbgbbg gggbgagbcg acgcgaggcb 58 80 
cbtccggcgg cabcgggabg cccgcgtbgc 594 0 
accabcaggg acagcttcaa ggccagcaaa 6000 
bggcgtbbbt ccabaggcbc cgccccccbg 6060 
agaggtggcg aaacccgaca ggacbabaaa 6120 
bcgbgcgcbc tccbgbtccg acccbgccgc 6180 
cgggaagcgb ggcgcbbbcb cabagcbcac 6240 
bbcgctccaa gctgggctgb gtgcacgaac 6300 
ccggbaacba tcgbcttgag tccaacccgg 6360 
ccactggtaa caggabtagc agagcgaggb 6420 
ggbggccbaa ctacggcbac acbagaagga 6480 
cagbbacctt cggaaaaaga gbtggtagcb 6540 
gcggbggbbb bbbtgbtbgc aagcagcaga 6600 
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ttacgcgcag aaaaaaagga tctcaagaag abccbtbgat cttbbctacg gggtctgacg 6660 
ctcagbggaa cgaaaactca cgbbaaggga ttthggtcat gagattatca aaaaggatct 6720 
tcaccfcagat ccbtbbaaab traaaaatgaa gttttaaatc aatctaaagt atatatgagt 6780 
aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 6840 
babttcgbbc atccatagbb gcctgactcc ccgbcgbgta gabaactacg atacgggagg 6900 
gcbbaccabc tggccccagt gctgcaatga baccgcgaga cccacgctca ccggctccag 6960 
atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaacbb 7020 
babccgccbc catccagtct atbaabtgbb gccgggaagc tagagtaagt agttcgccag 7080 
ttaatagttt gcgcaacgtt gttgccattg ctgcaggcat cgbggbgbca cgctcgtcgt 7140 
ttggtatggc ttcattcagc tccggtbccc aacgatcaag gcgagbtaca tgatccccca 7200 
bgtbgbgcaa aaaagcggtt agctccbtcg gtcctccgat cgttgtcaga agtaagttgg 7260 
ccgcagtgtt abcacbcabg gtbabggcag cactgcataa btcbctbacb gtcatgccab 7320 
ccgbaagatg cbbtbctgbg acbggbgagt acbcaaccaa gtcattcbga gaabagbgba 7380 
bgcggcgacc gagbbgcbcb bgcccggcgt caacacggga baataccgcg ccacatagca 744 0 
gaactttaaa agtgctcabc atbggaaaac gttcttcggg gcgaaaactc tcaaggabcb 7500 
baccgcbgbt gagatccagb bcgatgtaac ccactcgtgc acccaactga tctbcagcab 7560 
cbbbbacbbt caccagcgbb tcbgggtgag caaaaacagg aaggcaaaab gccgcaaaaa 7620 
agggaabaag ggcgacacgg aaatgttgaa tacbcabacb ctbccbtbbb caababbabb 7680 
gaagcabbba bcagggbbat bgbcbcabga gcggatacab abbbgaabgb abbbagaaaa 7740 
abaaacaaab aggggbbccg cgcacatbbc cccgaaaagb gccaccbgac gbcbaagaaa 7800 
ccatbabbab cabgacabta accbabaaaa abaggcgbab cacgaggccc bbbcgtcbbc 7860 
aagaatbgbb 7870 



<210> 44 
<211> 7885 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pAdApb containing HIV genes 
<400> 44 

bbaabbaacc gcaatbcbca bgtbtgacag cbbabcabca bcaataatab accbbatbbb 60 

ggabbgaagc caabatgata abgagggggb ggagbbbgbg acgbggcgcg gggcgbggga 12 0 

ac 9ggg c 99g bgacgbagba gbgbggcgga agbgbgabgb tgcaagtgbg gcggaacaca 180 

t bgtaagcgac ggatgtggca aaagtgacgt btbtggtgtg cgccggtgba cacaggaagb 240 

gacaatbbtc gcgcggbbbb aggcggatgt bgtagtaaat bbgggcgbaa ccgagbaaga 300 

bbbggccabb bbcgcgggaa aacbgaataa gaggaagbga aabctgaaba abtbbgtgbb 3 60 

acbcatagcg cgtaababbb gbcbagggcc gcggggacbt bgaccgbbba cgtggagact 420 

cgcccaggbg tttbbctcag gtgtttbccg cgbbccgggt caaagbtggc gtbbbatbab 480 

babagtcagt acgbaccagb gcacbggccb agagcggccc cabtgcabac gbbgbabcca 540 

babcataaba bgbacabbta tatbggctca bgbccaacat baccgccatg tbgacabbga 600 

bbabbgacba gbtatbaaba gbaabcaabb acggggbcat bagtbcabag cccabababg 660 

gagbbccgcg bbacabaact tacggbaaab ggcccgccbg gctgaccgcc caacgacccc 720 

cgcccabbga cgbcaabaat gacgbabgtb cccatagbaa cgccaabagg gacbbbccat 780 

tgacgbcaab gggbggagba tbbacggbaa acbgcccacb bggcagbaca bcaagbgbat 840 

cababgccaa gtacgccccc tabbgacgbc aabgacggba aabggcccgc ctggcabbat 900 

gcccagtaca tgaccttabg ggactttccb acbtggcagt acabcbacgb abbagbcabc 960 

gcbabbacca tggbgatgcg gbtbbggcag bacabcaabg ggcgtggata gcggtbtgac 1020 

bcacggggab ttccaagbct ccaccccabb gacgbcaatg ggagttbgbt bbggcaccaa 1080 

aatcaacggg acbtbccaaa abgbcgtaac aactccgccc cabtgacgca aabgggcggb 114 0 

aggcgbgtac ggbgggaggt cbabataagc agagcbcgbt bagbgaaccg tcagabcgcc 1200 

bggagacgcc abccacgcbg btbbgaccbc cabagaagac accgggaccg abccagcctc 1260 
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cgtcaccgtc gbcgacacgb gtgatcagat atcgcggccg ctctagaatg cgcgtgaagg 1320 

agaagbacca gcacctgtgg cgcbggggct ggcgcbgggg caccabgcbg ctgggcatgc 13 80 

tgabgatcbg cagcgccacc gagaagctgt gggfcgaccgt gtactacggc gbgcccgbgb 1440 

ggaaggaggc caccaccacc cbgcbcbgcg ccagcgacgc caaggcctac gacaccgagg 1500 

bgcacaacgb gtgggccacc cacgcctgcg tgcccaccga ccccaacccc caggaggtgg 1560 

tgctggtgaa cgtgaccgag aacbbcgaca tgtggaagaa cgacatggtg gagcagabgc 1620 

acgaggacat cabcagccbg tgggaccaga gcctgaagcc cbgcgbgaag cbgacccccc 1680 

tgtgcgtgag cctgaagtgc accgacctga agaacgacac caacaccaac agcagcagcg 1740 

gccgcatgat cabggagaag ggcgagatca agaactgcag cbbcaacabc agcaccagca 1800 

tccgcggcaa ggtgcagaag gagtacgcct bcbbctacaa gctggacatc atccccatcg 1860 

acaacgacac caccagctac agcctgacca gctgcaacac cagcgbgatc acccaggcct 1920 

gccccaaggt gagcttcgag cccatcccca accactactg cgcccccgcc ggctbcgcca 1980 

tcctgaagtg caaggacaag aagttcaacg gcaagggccc ctgcaccaac gtgagcaccg 2040 

tgcagtgcac ccacggcatc cgccccgtgg tgagcaccca gctgctgctg aacggtagcc 2100 

tggccgagga ggaggtggtg atccgcagcg ctaacttcgc cgacaacgcc aaggtgatca 2160 

bcgbgcagcb gaacgagagc gtggagatca actgcacccg cgctagcgcc cactgcaacc 2 220 

tgagccgcgc caagtggaac gacaccctga acaagabcgb gatcaagctg cgcgagcagt 2280 

tcggcaacaa gaccatcgtg btcaagcaca gcagcggcgg cgaccccgag atcgtgaccc 234 0 

acagcttcaa ctgcggcggc gagttcttct actgcaacag cgcbagccbg ccctgccgca 2400 

tcaagcagat catcaacatg tggcagaagg tgggcaaggc catgtacgcc ccccccatca 2460 

gcggccagat ccgctgcagc agcaacatca ccggccbgct gctgacccgc gacggcggca 252 0 

acagcaacaa cgagagcgag abcbbccgbc cgggcggcgg cgacatgcgc gacaactggc 2580 

gcagcgagct gtacaagtac aaggtggtga agabcgagcc cctgggcgtg gcccccacca 2640 

aggccaagct taccgtccag gcccgccagc tgctgagcgg catcgtgcag cagcagaaca 2700 

accbgcbgcg cgccatcgag gcccagcagc acctgctgca gctgaccgtg tggggcatca 2760 

agcagctgca ggcccgcacc ctggccgtgg agcgcbaccb gaaggaccag cagcagctcg 2 820 

agcagatctg gaaccacacc accbggabgg agtgggaccg cgagabcaac aactacacca 2 880 

gcctgatcca cagcctgatc gaggagagcc agaaccagca ggagaagaac gagcaggagc 2940 

tgctggagct ggacaagtgg gccagccbgb ggaactggtt caacatcacc aacbggctgb 3000 

ggtgaggatc cagabcbgct gtgccttcta gttgccagcc abctgbbgtb tgcccctccc 3060 

ccgbgccbbc cbbgacccbg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg 3120 

aaattgcatc gcattgtctg agtaggtgtc abtcbabbcb ggggggbggg gtggggcagc 3180 

acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gbgggcbcta 3240 

bgggbaccca gggccgcata acbbcgbaba atgtatgcta tacgaagtta baagabcbgb 3300 

actgaaatgt gtgggcgtgg cttaagggtg ggaaagaata tataaggtgg gggtcttatg 3360 

hagttttgta tctgttttgc agcagccgcc gccgccatga gcaccaactc gtttgabgga 3420 

agcattgtga gctcatattt gacaacgcgc atgcccccat gggccggggt gcgtcagaat 34 80 

gtgatgggct ccagcattga hggtcgcccc gtcctgcccg caaactctac taccttgacc 3540 

tacgagaccg tgtctggaac gccgthggag actgcagcct ccgccgccgc ttcagccgct 3600 

gcagccaccg cccgcgggat fcgtgactgac tttgctttcc tgagcccgct tgcaagcagt 3660 

gcagcttccc gttcatccgc ccgcgatgac aagttgacgg ctchtttggc acaattggat 3720 

tctttgaccc gggaacttaa tgtcgtttct cagcagchgt tggatctgcg ccagcaggtt 3780 

tctgccctga aggcttcctc ccctcccaat gcggtttaaa acataaataa aaaaccagac 3 840 

tctgtfctgga fcttggatcaa gcaagtgtct bgctgtcttt attfcaggggt tttgcgcgcg 3900 

cggtaggccc gggaccagcg gtctcggtcg ttgagggtcc tgtgtatttt ttccaggacg 3960 

tggtaaaggt gactctggat gttcagatac atgggcataa gcccgtctct ggggtggagg 4020 

tagcaccact gcagagcttc atgctgcggg gtggtgttgt agatgatcca gtcgtagcag 4080 

gagcgctggg cgtggtgcct aaaaatgtct tfccagtagca agctgattgc caggggcagg 4140 

cccttggtgt aagtgtttac aaagcggtta agctgggatg ggtgcatacg tggggatatg 4200 

agatgcabct bggacbgbab btbbaggttg gcbatgbbcc cagccababc cctccgggga 4260 

bbcabgbbgb gcagaaccac cagcacagtg babccggbgc acbbgggaaa bbtgtcabgt 4320 

agcbbagaag gaaatgcgbg gaagaacbbg gagacgcccb bgbgaccbcc aagatbbbcc 4380 

abgcabbcgb ccabaabgab ggcaabgggc ccacgggcgg cggccbgggc gaagababbb 4440 

cbgggabcac baacgbcaba gbtgbgbbcc aggatgagat cgbcabaggc catbtfcbaca 4500 

aagcgcgggc ggagggtgcc agacbgcggb abaatggbbc cabccggccc aggggcgbag 4560 
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ttaccctcac agatbbgcab tbcccacgcb bbgagtbcag atggggggat catgtctacc 4620 
tgcggggcga tgaagaaaac ggtbbccggg gtaggggaga tcagctggga agaaagcagg 4 680 
bbccbgagca gctgcgactt accgcagccg gtgggcccgt aaatcacacc battaccggc 4740 
tgcaactggt agttaagaga gctgcagctg ccgtcatccc tgagcagggg ggccacttcg 4 800 
btaagcatgt ccctgactcg cabgbbbbcc ctgaccaaat ccgccagaag gcgctcgccg 4860 
cccagcgata gcagttcttg caaggaagca aagbbbbbca acggtttgag accgtccgcc 4920 
ghaggcatgc bbbbgagcgb ttgaccaagc agbbccaggc ggtcccacag ctcggtcacc 4980 
bgcbcbacgg catctcgatc cagcatatct cctcghttcg cgggttgggg cggctbbcgc 5040 
tgtacggcag tagtcggtgc tcgtccagac gggccagggt catgtctttc cacgggcgca 5100 
gggbcctcgb cagcgtagtc tgggtcacgg tgaaggggtg cgctccgggc tgcgcgcbgg 5160 
ccagggtgcg cbbgaggcbg gtcctgctgg tgctgaagcg ctgccggtct tcgccctgcg 5220 
cgtcggccag gbagcabbbg accabggbgb cabagtccag ccccbccgcg gcgtggcccb 5280 
bggcgcgcag cbtgcccbbg gaggaggcgc cgcacgaggg gcagbgcaga ctbbbgaggg 5340 
cgbagagcbt gggcgcgaga aabaccgatt ccggggagta ggcabccgcg ccgcaggccc 5400 
cgcagacggt cbcgcatbcc acgagccagg bgagcbcbgg ccgttcgggg tcaaaaacca 54 60 
ggbbtccccc abgcbtbbbg abgcgbtbct taccbcbggb bbccabgagc cggtgtccac 5520 
gcbcggbgac gaaaaggcbg bccgbgbccc cgtabacaga cbbgagaggc cbgtcctcga 5580 
gcggbgbtcc gcggtccbcc bcgbabagaa acbcggacca ctcbgagaca aaggctcgcg 5640 
tccaggccag cacgaaggag gcbaagbggg aggggbagcg gbcgbbgbcc actagggggb 57 00 
ccacbcgcbc cagggbgbga agacacabgb cgcccbcbbc ggcabcaagg aaggbgabtg 5760 
gbbbgtaggb gbaggccacg bgaccgggtg btccbgaagg ggggcbataa aagggggbgg 5820 
gggcgcgtbc gbccbcacbc bcbbccgcab cgcbgbcbgc gagggccagc bgbtggggbg 5880 
agbcgacgcg aggcbggabg gccbbcccca btatgatbcb bctcgctbcc ggcggcatcg 5940 
ggabgcccgc gbbgcaggcc atgcbgbcca ggcaggtaga bgacgaccat cagggacagc 6000 
ttcaaggcca gcaaaaggcc aggaaccgba aaaaggccgc gbtgctggcg bbbtbccaba 6060 
ggcbccgccc cccbgacgag catcacaaaa atcgacgcbc aagbcagagg bggcgaaacc 6120 
cgacaggacb abaaagabac caggcgbbbc ccccbggaag cbcccbcgbg cgcbcbccbg 6180 
bbccgacccb gccgcbbacc ggabacctgb ccgccbbbcb cccbbcggga agcgtggcgc 6240 

bbbcbcabag cbcacgcbgb aggbabctca gbbcggbgba ggbcgbbcgc bccaagcbgg 6300 > 

gcbgbgbgca cgaacccccc gbtcagcccg accgcbgcgc cbbabccggb aacbabcgbc 6360 

bbgagtccaa cccggbaaga cacgacbbat cgccacbggc agcagccacb ggtaacagga 6420 

bbagcagagc gaggbabgba ggcggbgcba cagagbbctb gaagbggtgg ccbaactacg 64 80 

gcbacacbag aaggacagba bbbggbatcb gcgctcbgcb gaagccagbb accbbcggaa 6540 

aaagagbbgg bagcbctbga bccggcaaac aaaccaccgc tggbagcggb ggbbbbbbbg 6600 

bbbgcaagca gcagabtacg cgcagaaaaa aaggabcbca agaagabccb bbgabcbbbb 6660 

cbacggggbc bgacgctcag bggaacgaaa acbcacgbta agggatbbtg gbcabgagab 6720 

tabcaaaaag gabcbbcacc bagabccbbb baaatbaaaa abgaagbbbb aaatcaabcb 6780 

aaagbababa bgagbaaacb tggbctgaca gbbaccaabg cbbaabcagb gaggcaccba 6840 

tcbcagcgab cbgbcbabbb cgbbcabcca bagbbgccbg acbccccgbc gbgbagabaa 6900 

cbacgabacg ggagggcbta ccabcbggcc ccagbgcbgc aabgabaccg cgagacccac 6960 

gcbcaccggc bccagabbba bcagcaabaa accagccagc cggaagggcc gagcgcagaa 7020 

gbggbccbgc aacbbbabcc gcctccabcc agtcbabbaa bbgtbgccgg gaagcbagag 7080 

baagbagbbc gccagbbaab agbbbgcgca acgtbgbtgc cattgcbgca ggcabcgbgg 7140 

bgbcacgctc gbcgbbbggb abggcbbcab bcagcbccgg bbcccaacga bcaaggcgag 7200 

tbacabgatc ccccabgbbg bgcaaaaaag cggbbagcbc cbbcggbccb ccgatcgbbg 7260 

bcagaagbaa gbtggccgca gbgbbabcac bcabggbbab ggcagcacbg cabaabbcbc 7320 

bbacbgbcab gccabccgba agabgcbbtb cbgbgactgg bgagbacbca accaagbcab 73 80 

bcbgagaata gbgbabgcgg cgaccgagtb gcbcbbgccc ggcgbcaaca cgggabaaba 7440 

ccgcgccaca bagcagaact bbaaaagbgc bcabcabtgg aaaacgbbcb tcggggcgaa 7500 

aacbcbcaag gabcbbaccg cbgbbgagab ccagbbcgab gbaacccacb cgbgcaccca 7560 

acbgabcbbc agcabcbbbb acbbbcacca gcgtbtcbgg gbgagcaaaa acaggaaggc 7620 

aaaabgccgc aaaaaaggga abaagggcga cacggaaabg bbgaabactc abacbcbbcc 7680 

bbbbtcaaba bbabbgaagc atbtabcagg gbbattgbcb cabgagcgga bacababbbg 7740 

aabgbatbba gaaaaabaaa caaabagggg tbccgcgcac atbtccccga aaagbgccac 7800 

cbgacgtcba agaaaccabb abbabcabga cabbaaccba baaaaabagg cgtabcacga 7860 
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» 

<400> 45 

ttaattaacc gcaattctca tgtttgacag cttatcatca tcaataatat accttatttt 60 
ggattgaagc caatatgata atgagggggt ggagtttgtg acgtggcgcg gggcgtggga 120 
acggggcggg tgacgtagta gtgtggcgga agtgtgatgt tgcaagtgtg gcggaacaca 18 0 
tgtaagcgac ggatgtggca aaagtgacgt ttttggtgtg cgccggtgta cacaggaagt 240 
gacaattttc gcgcggtttt aggcggatgt tgtagtaaat ttgggcgtaa ccgagtaaga 3 00 
tttggccatt ttcgcgggaa aactgaataa gaggaagtga aatctgaata attttgtgtt 360 
actcatagcg cgtaatattt gtctagggcc gcggggactt tgaccgttta cgtggagact 420 
cgcccaggtg tttttctcag| gtgttttccg cgttccgggt caaagttggc gttttattat 480 
tatagtcagt acgtaccagt gcactggcct agagcggccc cattgcatac gttgtatcca 54 0 
tatcataata tgtacattta tattggctca tgtccaacat taccgccatg ttgacattga 600 
ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag cccatatatg 660 
gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc caacgacccc 720 
cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat 780 
tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat 84 0 
catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattat 900 
gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt attagtcatc 960 
gctattacca tggtgatgcg gttttggcag tacatcaatg ggcgtggata gcggtttgac 1020 
tcacggggat ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa 1080 
aatcaacggg actttccaaa atgtcgtaac aactccgccc cattgacgca aatgggcggt 1140 
aggcgtgtac ggtgggaggt ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 1200 
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 1260 
cgtcaccgtc gtcgacacgt gtgatcagat atcgcggccg ctctagaatg cgcgtgaagg 1320 
agaagtacca gcacctgtgg cgctggggct ggcgctgggg caccatgctg ctgggcatgc 1380 
tgatgatctg cagcgccacc gagaagctgt gggtgaccgt gtactacggc gtgcccgtgt 1440 
ggaaggaggc caccaccacc ctgctctgcg ccagcgacgc caaggcctac gacaccgagg 1500 
tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc caggaggtgg 1560 
tgctggtgaa cgtgaccgag aacttcgaca tgtggaagaa cgacatggtg gagcagatgc 1620 
acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag ctgacccccc 1680 
tgtgcgtgag cctgaagtgc accgacgcta gcaccagctg caacaccagc gtgatcaccc 1740 
aggcctgccc caaggtgagc ttcgagccca tccccaacca ctactgcgcc cccgccggct 1800 
tcgccatcct gaagtgcaag gacaagaagt tcaacggcaa gggcccctgc accaacgtga 1860 
gcaccgtgca gtgcacccac ggcatccgcc ccgtggtgag cacccagctg ctgctgaacg 1920 
gtagcctggc cgaggaggag gtggtgatcc gcagcgctaa cttcgccgac aacgccaagg 1980 
tgatcatcgt gcagctgaac gagagcgtgg agatcaactg cacccgcgct agcgcccact 2040 
gcaacctgag ccgcgccaag tggaacgaca ccctgaacaa gatcgtgatc aagctgcgcg 2100 
agcagttcgg caacaagacc atcgtgttca agcacagcag cggcggcgac cccgagatcg 2160 
tgacccacag cttcaactgc ggcggcgagt tcttctactg caacagcacc cagctgttca 2220 
acagcacctg gttcaacagc acctggagca ccgagggcag caacaacacc gagggcagcg 22 80 
acaccatcac cctgccctgc cgcatcaagc agatcatcaa catgtggcag aaggtgggca 2340 
aggccatgta cgcccccccc atcagcggcc agatccgctg cagcagcaac atcaccggcc 2400 
tgctgctgac ccgcgacggc ggcaacagca acaacgagag cgagatcttc cgtccgggcg 2460 
gcggcgacat gcgcgacaac tggcgcagcg agctgtacaa gtacaaggtg gtgaagatcg 2520 
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agcccctggg cgtggccccc accaaggcca agcttaccgt ccaggcccgc cagctgctga 2580 
gcggcatcgt gcagcagcag aacaaccbgc bgcgcgccab cgaggcccag cagcacctgc 2 64 0 
tgcagctgac cgtgtggggc atcaagcagc tgcaggcccg caccctggcc gbggagcgcb 2700 
acctgaagga ccagcagcag ctcgagcaga tctggaacca caccacctgg atggagtggg 2760 
accgcgagat caacaactac accagcctga tccacagcct gatcgaggag agccagaacc 2620 
agcaggagaa gaacgagcag gagctgctgg agctggacaa gtgggccagc ctgtggaact 2880 
ggttcaacat caccaactgg ctgtggtgag gatccagatc tgctgtgcct tcbagbbgcc 2940 
agccatctgt bgbbbgcccc tcccccgfcgc cbtccbbgac cctggaaggt gccactccca 3000 
cbgbcctbtc cbaabaaaab gaggaaattg catcgcattg tctgagtagg tgtcattcta 3060 
ttctgggggg tggggtgggg cagcacagca agggggagga btgggaagac aabagcaggc 3120 
atgctgggga bgcggbgggc tcbatgggta cccagggccg cataacttcg tataatgtat 3180 
gcbabacgaa gbbabaagab cbgbacbgaa abgbgtgggc gbggctbaag ggbgggaaag 3240 
aabababaag gbgggggbcb babgbagbbb bgtatcbgbt bbgcagcagc cgccgccgcc 3300 
abgagcacca acbcgbbbga bggaagcabb gbgagctcat abbtgacaac gcgcatgccc 3360 
ccabgggccg gggbgcgbca gaatgbgabg ggctccagca bbgabggbcg ccccgtccbg 3420 
cccgcaaacb cbacbacctt gacctacgag accgtgbcbg gaacgccgtt ggagacbgca 3480 
gccbccgccg ccgcbbcagc cgctgcagcc accgcccgcg ggabbgbgac tgactbbgct 3540 
btccbgagcc cgcbtgcaag fcagbgcagcb bcccgtbcab ccgcccgcga bgacaagbbg 3600 
acggcbcbtb bggcacaabt ggabbcbbtg acccgggaac tbaatgbcgb bbcbcagcag 3660 
cbgbbggatc bgcgccagca ggbbtcbgcc cbgaaggctb ccbccccbcc caabgcggbt 3720 
taaaacabaa abaaaaaacc agacbcbgtb bggabbbgga bcaagcaagb gbcbbgctgt 3780 
cbbtabbbag gggbtbbgcg cgcgcggbag gcccgggacc agcggbcbcg gbcgbbgagg 3 840 
gbccbgbgta bbbbbbccag gacgbggbaa aggbgacbcb ggabgbbcag abacabgggc 3900 
abaagcccgb cbcbggggbg gaggbagcac cacbgcagag ctbcabgctg cggggbggbg 3960 
bbgbagabga bccagbcgba gcaggagcgc bgggcgbggt gcctaaaaat gbcbttcagt 4020 
agcaagcbga bbgccagggg caggcccbtg gbgbaagbgb bbacaaagcg gbtaagcbgg 4 080 
gabgggbgca bacgtgggga babgagabgc abcbbggacb gbabbbbbag gbtggcbabg 4140 
btcccagcca babcccbccg gggabtcabg bbgbgcagaa ccaccagcac agbgtabccg 4200 
gbgcacbbgg gaaatbbgbc abgbagcbta gaaggaaatg cgbggaagaa cbtggagacg 4260 
cccbbgbgac cbccaagabb bbccatgcab bcgbccabaa bgatggcaat gggcccacgg 4320 
gcggcggccb gggcgaagat atbbcbggga bcacbaacgb cabagbbgtg tbccaggabg 4380 
agabcgbcat aggccabbbt bacaaagcgc gggcggaggg bgccagactg cggbabaabg 4440 
gtbccabccg gcccaggggc gbagbbaccc bcacagabbt gcabbbccca cgcbtbgagt 4500 
bcagabgggg ggabcabgbc baccbgcggg gcgabgaaga aaacggbbbc cggggbaggg 4560 
gagabcagcb gggaagaaag caggbtcctg agcagcbgcg acbtaccgca gccggbgggc 4 620 
ccgbaaabca cacctabbac . cggcbgcaac tggbagbbaa gagagcbgca gctgccgbca 4680 
bcccbgagca ggggggccac bbcgbtaagc abgbcccbga ctcgcabgtb bbcccbgacc 4740 
aaabccgcca gaaggcgctc gccgcccagc gabagcagbb cbbgcaagga agcaaagbbb 4800 
btcaacggbb bgagaccgbc cgccgbaggc abgcbbbbga gcgbbbgacc aagcagbbcc 4860 
aggcggbccc acagcbcggb caccbgcbcb acggcabcbc gabccagcab abcbccbcgb 4920 
bbcgcgggbb ggggcggcbb bcgcbgbacg gcagbagbcg gbgcbcgbcc agacgggcca 4 980 
gggbcabgbc bbbccacggg cgcagggbcc bcgbcagcgb agbcbgggbc acggbgaagg 5040 
ggbgcgcbcc gggcbgcgcg cbggccaggg bgcgctbgag gctggbccbg cbggbgcbga 5100 
agcgcbgccg gbcbbcgccc tgcgcgbcgg ccaggbagca tbbgaccabg gbgbcabagb 5160 
ccagccccbc cgcggcgbgg cccbbggcgc gcagcbbgcc cbbggaggag gcgccgcacg 5220 
aggggcagbg cagacbbbbg agggcgtaga gcbbgggcgc gagaaabacc gabbccgggg 5280 
agbaggcabc cgcgccgcag gccccgcaga cggtcbcgca tbccacgagc caggbgagcb 5340 
cbggccgtbc ggggbcaaaa accaggbbbc ccccabgcbb tbbgabgcgb bbcbbaccbc 5400 
tggbbbccab gagccggbgb ccacgcbcgg bgacgaaaag gcbgbccgbg bccccgbaba 5460 
cagacbbgag aggccbgbcc tcgagcggbg bbccgcggbc cbccbcgbab agaaacbcgg 5520 
accactcbga, gacaaaggcb cgcgbccagg ccagcacgaa ggaggcbaag bgggaggggb 5580 
agcggtcgbb gtccacbagg gggbccacbc gcbccagggb gbgaagacac abgbcgcccb 5640 
cbbcggcatc aaggaaggbg abbggtbbgb aggbgbaggc cacgbgaccg ggbgbbccbg 5700 
aaggggggcb ataaaagggg gbgggggcgc gtbcgbccbc acbctcbbcc gcabcgcbgb 5760 
cbgcgagggc cagcbgbbgg ggbgagbcga cgcgaggcbg gabggccbbc cccabbabga 5820 
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ttcbbctcgc ttccggcggc atcgggatgc ccgcgttgca ggccatgctg tccaggcagg 5880 
tagabgacga ccabcaggga cagcttcaag gccagcaaaa ggccaggaac cgtaaaaagg 5940 
ccgcgttgct ggcgbtbbbc cafcaggctcc gcccccctga cgagcatcac aaaaatcgac 6000 
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 6060 
gaagctccct cgtgcgctct ccbgbbccga ccctgccgct baccggabac cbgbccgccb 6120 
bbcbcccbbc gggaagcgtg gcgctttctc atagctcacg cbgbaggbab cbcagbtcgg 6180 
tgtaggtcgt tcgctccaag cbgggcbgbg tgcacgaacc ccccgttcag cccgaccgct 6240 
gcgccbbabc cggtaactat cgbctbgagt ccaacccggt aagacacgac tbatcgccac 6300 
tggcagcagc cacbggbaac aggabtagca gagcgaggba tgbaggcggb gctacagagb 6360 
tcbtgaagbg gbggccbaac bacggcbaca cbagaaggac agtatbbggt abcbgcgctc 6420 
tgcbgaagcc agbbaccbbc ggaaaaagag tbggbagcbc bbgatccggc aaacaaacca 6480 
ccgcbggbag cggbggttbb bbbgttbgca agcagcagab bacgcgcaga aaaaaaggab 6540 
ctcaagaaga bccbbtgabc bbbbcbacgg ggbcbgacgc bcagtggaac gaaaacbcac 6600 
gbbaagggat bbtggtcabg agattabcaa aaaggatcbb cacctagatc cbbtbaaatb 6660 
aaaaatgaag btttaaabca atcbaaagba tatabgagba aacbtggbct gacagtbacc 6720 
aabgcttaat cagbgaggca ccbatcbcag cgabctgtct atbbcgtbca tccabagbtg 6780 
ccbgactccc cgbcgtgbag ataacbacga tacgggaggg cbbaccabct ggccccagtg 6840 
ctgcaabgat accgcgagac ccacgctcac cggctccaga bbbatcagca abaaaccagc 6900 
cagccggaag ggccgagcgc agaagtggbc ctgcaactbb abccgccbcc atccagtcta 6960 
btaabbgtbg ccgggaagcb agagtaagba gbbcgccagb baabagbbtg cgcaacgttg 7020 
ttgccabbgc tgcaggcabc gbggtgbcac gcbcgbcgtb bggtabggct tcatbcagct 7080 
ccggttccca acgabcaagg cgagtbacab gabcccccab gbbgbgcaaa aaagcggtta 7140 
gctccbtcgg bccbccgabc gbtgbcagaa gbaagbbggc cgcagbgbba bcacbcabgg 7200 
bbabggcagc acbgcabaab bctctbacbg bcabgccabc cgtaagabgc bbbtcbgbga 7260 
cbggbgagba cbcaaccaag bcabbcbgag aatagbgbab gcggcgaccg agbbgcbctt 7320 
gcccggcgbc aacacgggat aataccgcgc cacabagcag aactttaaaa gbgctcabca 7380 
bbggaaaacg bbcbtcgggg cgaaaacbct caaggatctb accgctgbtg agabccagbb 7440 
cgatgbaacc cacbcgbgca cccaacbgab cbtcagcatc bbttacbttc accagcgbbt 7500 
cbgggbgagc aaaaacagga aggcaaaabg ccgcaaaaaa gggaabaagg gcgacacgga 7560 
aatgbbgaab acbcatacbc tbcctbttbc aababbabtg aagcabbbab cagggbbabb 7620 
gbctcatgag cggabacaba bbbgaabgba bbbagaaaaa baaacaaaba ggggtbccgc 7680 
gcacatttcc ccgaaaagbg ccaccbgacg bctaagaaac catbabbabc atgacabtaa 7740 
ccbabaaaaa baggcgbabc acgaggcccb bbcgbcbbca agaatbgbb 7789 



<210> 46 
<211> 7792 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pAdApb conbaining HIV genes 
<400> 46 

bbaabbaacc gcaabbcbca tgbbbgacag ctbabcabca tcaabaatab accbbabbbt 60 
ggatbgaagc caababgaba abgagggggb ggagbbtgbg acgbggcgcg gggcgbggga 120 
ac 9999cggg bgacgbagba gbgbggcgga agtgbgabgb tgcaagtgbg gcggaacaca 180 
tgbaagcgac ggatgbggca aaagbgacgb bbbbggtgbg cgccggtgba cacaggaagt 24 0 
gacaabbtbc gcgcggbtbb aggcggabgb bgbagbaaab bbgggcgbaa ccgagbaaga 300 
btbggccabb bbcgcgggaa aactgaabaa gaggaagbga aabcbgaaba atbbbgtgbt 360 
actcabagcg cgbaababbb gbctagggcc gcggggacbb bgaccgbbba cgbggagacb 42 0 
cgcccaggbg bbbtbctcag gbgtbtbccg cgbbccgggb caaagbbggc gbbbbatbab 480 
babagtcagb acgbaccagb gcacbggccb agagcggccc cabtgcatac gbbgbatcca 540 
batcabaaba tgbacatbba trabbggcbca bgbccaacab baccgccabg tbgacattga 600 
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ttahtgacta gbbabbaaba gtaatcaatt acggggbcab tagttcatag cccatababg 660 

gagttccgcg ttacataact bacggtaaab ggcccgcctg gctgaccgcc caacgacccc 720 

cgcccattga cgbcaabaab gacgbabgtb cccatagtaa cgccaatagg gacbbbccab 780 

tgacgtcaat gggtggagta btbacggbaa actgcccact tggcagtaca tcaagtgtat 84 0 

catatgccaa gtacgccccc tattgacgtc aatgacggba aabggcccgc ctggcattat 900 

gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt abbagbcabc 960 

gcbabbacca tggtgatgcg gttttggcag tacatcaatg ggcgtggata gcggtttgac 1020 

bcacggggab ttccaagtct ccaccccatt gacgtcaabg ggagbbbgbb btggcaccaa 1080 

aabcaacggg actbbccaaa abgtcgbaac aacbccgccc cabtgacgca aabgggcggt 1140 

aggcgbgbac ggtgggaggt ctatabaagc agagcbcgbb bagbgaaccg bcagabcgcc 1200 

bggagacgcc abccacgcbg bbbtgacctc cabagaagac accgggaccg atccagccbc 1260 

cgbcaccgbc gbcgacacgt gbgabcagab abcgcggccg cbctagaabg cgcgtgaagg 1320 

agaagbacca gcaccbgbgg cgcbggggcb ggcgctgggg caccatgctg ctgggcabgc 1380 

bgatgabctg cagcgccacc gagaagcbgt gggbgaccgb gtacbacggc gbgcccgbgt 1440 

ggaaggaggc caccaccacc cbgctcbgcg ccagcgacgc caaggccbac gacaccgagg 1500 

bgcacaacgt gtgggccacc cacgccbgcg bgcccaccga ccccaacccc caggaggbgg 1560 

bgctggbgaa cgtgaccgag aacbbcgaca bgbggaagaa cgacabggtg gagcagabgc 1620 

acgaggacab catcagccbg bgggaccaga gccbgaagcc ctgcgbgaag ctgacccccc 1680 

bgtgcgbgag cctgaagbgc accgacgcba gcaccagcbg caacaccagc gbgatcaccc 1740 

aggcctgccc caaggbgagc bbcgagccca bccccaacca ctacbgcgcc cccgccggcb 1800 

tcgccabccb gaagbgcaag gacaagaagb bcaacggcaa gggccccbgc accaacgtga 1860 

gcaccgbgca gbgcacccac ggcabccgcc ccgbggtgag cacccagcbg cbgctgaacg 1920 

gbagccbggc cgaggaggag gbggbgabcc gcagcgcbaa cbbcgccgac aacgccaagg 1980 

tgabcabcgb gcagcbgaac gagagcgbgg agatcaacbg cacccgcccc aacaacaaca 2040 

cccgcaagag catccacabc ggccccggcc gcgccbbcta caccaccggc gagabcabcg 2100 

gcgacabccg ccaggcccac tgcaaccbga gccgcgccaa gbggaacgac acccbgaaca 2160 

agabcgbgab caagctgcgc gagcagbbcg. gcaacaagac cabcgbgbbc aagcacagca 222 0 

gcggcggcga ccccgagabc gbgacccaca gcbbcaacbg cggcggcgag bbcbbctacb 2280 

gcaacagcgc bagccbgccc bgccgcatca agcagabcab caacabgtgg cagaaggbgg 2 340 

gcaaggccab gbacgccccc cccatcagcg gccagabccg cbgcagcagc aacatcaccg 2400 

gccbgcbgcb gacccgcgac ggcggcaaca gcaacaacga gagcgagatc btccgbccgg 2460 

gcggcggcga cabgcgcgac aactggcgca gcgagcbgba caagbacaag gtggbgaaga 2520 

bcgagccccb gggcgbggcc cccaccaagg ccaagcbbac cgbccaggcc cgccagcbgc 2 580 

bgagcggcab cgbgcagcag cagaacaacc bgcbgcgcgc cabcgaggcc cagcagcacc 2 64 0 

bgcbgcagcb gaccgbgbgg ggcatcaagc agcbgcaggc ccgcacccbg gccgbggagc 2700 

gcbaccbgaa ggaccagcag cagctcgagc agabcbggaa ccacaccacc bggatggagb 2760 

gggaccgcga gabcaacaac bacaccagcc bgabccacag ccbgabcgag gagagccaga 2820 

accagcagga gaagaacgag caggagcbgc tggagcbgga caagbgggcc agccbgbgga 2880 

acbggbbcaa cabcaccaac bggctgbggb gaggabccag abcbgcbgbg ccbbcbagbb 2940 

gccagccabc bgbbgbbbgc cccbcccccg bgccbbccbb gacccbggaa ggbgccacbc 3000 

ccacbgbccb bbccbaabaa aabgaggaaa bbgcabcgca btgbcbgagb aggbgbcabb 3060 

cbabbcbggg gggbggggbg gggcagcaca gcaaggggga ggabbgggaa gacaabagca 3120 

ggcabgcbgg ggabgcggtg ggctcbabgg gbacccaggg ccgcabaacb bcgtabaatg 3180 

babgcbabac gaagbbataa gabcbgbacb. gaaabgbgbg ggcgtggcbt aagggtggga 3240 

aagaababab aaggbggggg bcbbabgbag bbbbgbabcb gbbtbgcagc agccgccgcc 3300 

gccabgagca ccaacbcgbb bgabggaagc abbgbgagcb cabatbbgac aacgcgcatg 3360 

cccccabggg ccggggbgcg bcagaabgbg abgggcbcca gcabbgabgg bcgccccgtc 3420 

cbgcccgcaa acbcbactac cbbgaccbac gagaccgbgb cbggaacgcc gbbggagacb 3480 

gcagccbccg ccgccgcbbc agccgcbgca gccaccgccc gcgggatbgb gacbgacbbb 3540 

gcbbtccbga gcccgcbbgc aagcagbgca gcbbcccgtb catccgcccg cgabgacaag 3600 

bbgacggcbc bbbbggcaca abbggabbcb bbgacccggg aacbtaabgt cgbbbcbcag 3660 

cagctgbbgg abctgcgcca gcaggbbbcb gcccbgaagg cbtccbcccc bcccaabgcg 3720 

gbbbaaaaca taaabaaaaa accagactcb gbbbggatbb ggabcaagca agbgbctbgc 3780 

bgbcbbbabt baggggbbbb gcgcgcgcgg baggcccggg accagcggtc bcggbcgbbg 3840 

agggbccbgt gbatbbbbbc caggacgbgg baaaggbgac bcbggatgbb cagabacabg 3900 
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ggcataagcc cgbcbcbggg gtggaggtag 
gbgbbgbaga bgabccagbc gtagcaggag 
agtagcaagc bgabtgccag gggcaggccc 
tgggatgggt gcatacgtgg ggatatgaga 
atgttcccag ccatatccct ccggggattc 
ccggtgcact bgggaaabbb gtcatgtagc 
acgcccttgt gacctccaag atbbtccatg 
cgggcggcgg cctgggcgaa gatatttctg 
abgagatcgb cabaggccac ttttacaaag 
atggttccat ccggcccagg ggcgtagtta 
agttcagatg gggggabcat gtctacctgc 
ggggagatca gctgggaaga aagcaggttc 
ggcccgtaaa tcacacctat taccggctgc 
bcatccctga gcaggggggc cacttcgtta 
accaaatccg ccagaaggcg ctcgccgccc 
bttbtcaacg gtbbgagacc gtccgccgta 
tccaggcggt cccacagctc ggbcaccbgc 
cgbttcgcgg gttggggcgg cttbcgcbgt 
ccagggtcab gtcbttccac gggcgcaggg 
aggggtgcgc tccgggcbgc gcgctggcca 
tgaagcgctg ccggtcttcg ccctgcgcgt 
agtccagccc ctccgcggcg bggccctbgg 
acgaggggca gtgcagacbb ttgagggcgt 
gggagbaggc atccgcgccg caggccccgc 
gctctggccg btcggggtca aaaaccaggt 
ctcbggbbtc catgagccgg bgbccacgct 
atacagactb gagaggccbg bcctcgagcg 
cggaccactc bgagacaaag gctcgcgbcc 
ggbagcggbc gbtgbccact agggggtcca 
ccbcbtcggc abcaaggaag gtgatbggtt 
cbgaaggggg gctataaaag ggggbggggg 
bgbctgcgag ggccagctgt bggggbgagb 
tgattcttcb cgctbccggc ggcabcggga 
aggtagabga cgaccabcag ggacagcbbc 
aggccgcgbb gcbggcgttb ttccabaggc 
gacgcbcaag bcagaggtgg cgaaacccga 
cbggaagcbc ccbcgbgcgc bcbccbgtbc 
cctttcbccc tbcgggaagc gtggcgcbtt 
cggtgbaggb cgtbcgctcc aagctgggct 
gcbgcgccbb abccggbaac babcgbcttg 
cacbggcagc agccacbggb aacaggabta 
agbbcttgaa gbggtggcct aactacggct 
ctcbgcbgaa gccagbtacc ttcggaaaaa 
ccaccgctgg tagcggbggb ttttttgbtt 
gabcbcaaga agabcctbbg atctbbtcba 
cacgttaagg gatbbtggbc abgagabbat 
atbaaaaabg aagbtbtaaa tcaafcctaaa 
accaatgctt aabcagbgag gcacctatct 
bbgccbgacb ccccgbcgbg bagataacba 
gtgcbgcaab gabaccgcga gacccacgcb 
agccagccgg aagggccgag cgcagaagbg 
ctabbaabbg btgccgggaa gcfcagagbaa 
ttgtbgccat tgctgcaggc atcgtggtgt 
gctccggbbc ccaacgabca aggcgagtta 
ttagctccbt cggbcctccg atcgbbgtca 



caccactgca gagcbtcatg ctgcggggbg 3960 
cgctgggcgb ggbgccbaaa aatgtcbtbc 4020 
bbggtgtaag tgbbbacaaa gcggttaagc 4080 
bgcatcbbgg acbgbatbtt baggbtggcb 4140 
abgtbgtgca gaaccaccag cacagtgtab 4200 
bbagaaggaa abgcgtggaa gaacttggag 4260 
cabtcgbcca baabgabggc aabgggccca 4320 
ggatcacbaa cgtcatagtb gbgttccagg 4380 
cgcgggcgga gggbgccaga cbgcggbaba 44 40 
cccbcacaga bbtgcatttc ccacgctttg 4500 
ggggcgabga agaaaacggt bbccggggba 4560 
cbgagcagct gcgacbbacc gcagccggbg 4620 
aacbggbagt taagagagct gcagcbgccg 4680 
agcatgbccc tgacbcgcat gttbtcccbg 4740 
agcgatagca gbbcbtgcaa ggaagcaaag 4800 
ggcabgcbtb tgagcgbbbg accaagcagb 4860 
tcbacggcab ctcgatccag cabatcbccb 4920 
acggcagbag tcggtgcbcg bccagacggg 4980 
bccbcgbcag cgtagtctgg gtcacggbga 5040 
gggbgcgcbb gaggctggtc cbgctggbgc 5100 
cggccaggta gcabbtgacc atggtgbcab 5160 
cgcgcagctb gcccbbggag gaggcgccgc 5220 
agagctbggg cgcgagaaab accgattccg 5280 
agacggbctc gcabbccacg agccaggtga 5340 
bbcccccatg cbtbbbgatg cgtttctbac 5400 
cggtgacgaa aaggcbgtcc gtgtccccgt 54 60 
gbgtbccgcg gtccbcctcg babagaaact 5520 
aggccagcac gaaggaggcb aagbgggagg 5580 
cbcgcbccag ggtgbgaaga cacatgbcgc 5640 
bgtaggbgta ggccacgtga ccgggbgbbc 5700 
cgcgttcgbc cbcacbctcb bccgcatcgc 5760 
cgacgcgagg ctggabggcc bbccccabba 5820 
bgcccgcgtb gcaggccabg ctgbccaggc S880 
aaggccagca aaaggccagg aaccgbaaaa 5940 
tccgcccccc bgacgagcab cacaaaaabc 6000 
caggacbafca aagabaccag gcgbtbcccc 6060 
cgacccbgcc gctbaccgga baccbgtccg 6120 
cbcatagctc acgctgtagg babcbcagbb 6180 
gtgbgcacga accccccgtb cagcccgacc 6240 
agbccaaccc ggbaagacac gacbtabcgc 6300 
gcagagcgag gbatgbaggc ggbgcbacag 6360 
acacbagaag gacagbabbb ggbatcbgcg 6420 
gagbbggbag cbctbgatcc ggcaaacaaa 6480 
gcaagcagca gabbacgcgc agaaaaaaag 6540 
cggggtcbga cgcbcagbgg aacgaaaacb 6600 
caaaaaggab cbbcaccbag abcctbbbaa 6660 
gbababatga gbaaacbtgg bctgacagtb 6720 
cagcgabctg tcbabbbcgt bcabccabag 6780 
cgabacggga gggcbbacca tctggcccca 6840 
caccggcbcc agabbbabca gcaataaacc 6900 
gbccbgcaac tbtabccgcc tccatccagb 6960 
gbagbbcgcc agttaatagb btgcgcaacg 7020 
cacgcbcgbc gbtbggbabg gctbcabbca 7080 
cabgabcccc cabgbbgbgc aaaaaagcgg 714 0 
gaagbaagtb ggccgcagbg btatcacbca 7200 
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tggttatggc agcactgcat aattctctta cbgtcatgcc atccgtaaga tgcbbtbcbg 7260 
tgacbggtga gtacbcaacc aagtcattct gagaatagtg babgcggcga ccgagttgct 7320 
cttgcccggc gtcaacacgg gataataccg cgccacatag cagaacttta aaagtgctca 7380 
tcafctggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg btgagabcca 7440 
gttcgatgta acccactcgt gcacccaact gabcbtcagc atcbbbbacb ttcaccagcg 7500 
bbbcbgggbg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 7560 
ggaaatgttg aabacbcata cbcbbccbbb ttcaatatta ttgaagcatt babcagggbt 7620 
attgtctcat gagcggatac atatttgaat gtabttagaa aaataaacaa abaggggbtc 7680 
cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga aaccabbatb atcatgacat 7740 
baaccbabaa aaabaggcgb atcacgaggc ccbbbcgtcb bcaagaabtg bb 7792 



<210> 47 
<211> 7828 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pAdApb containing HIV genes 
<400> 47 

ttaattaacc gcaatbcbca tgttbgacag cbbatcabca bcaataabab accbbatbtb 60 
ggatbgaagc caababgata abgagggggb ggagbttgbg acgtggcgcg gggcgtggga 12 0 
acggggcggg tgacgbagba gtgtggcgga agtgtgatgt bgcaagtgbg gcggaacaca 180 
tgtaagcgac ggabgbggca aaagbgacgt tbttggtgtg cgccggbgta cacaggaagt 24 0 
gacaatbbtc gcgcggbtbt aggcggabgt tgtagbaaat bbgggcgbaa ccgagbaaga 3 00 
bbbggccabt ttcgcgggaa aactgaabaa gaggaagbga aabctgaaba atbbtgbgtt 360 
acbcatagcg cgbaatabbb gbctagggcc gcggggacbt bgaccgbtba cgbggagact 420 
cgcccaggbg ttbbbcbcag gbgbtbbccg cgttccgggt caaagbtggc gbtbtatbat 480 
tatagtcagt acgtaccagb gcacbggcct agagcggccc cattgcatac gbtgtabcca 540 
tatcataata tgbacatbba tabbggcbca tgtccaacat baccgccabg btgacatbga 600 
ttattgacta gtbatbaaba gbaatcaabb acggggbcat bagbbcabag cccatabatg 660 
gagtbccgcg tbacabaacb tacggbaaat ggcccgccbg gcbgaccgcc . caacgacccc 720 
cgcccabbga cgbcaabaat gacgbabgtt cccatagbaa cgccaabagg gactttccat 780 
bgacgbcaat gggbggagba tbbacggbaa acbgcccacb bggcagbaca bcaagbgbat 84 0 
catatgccaa gtacgccccc tabbgacgbc aabgacggba aabggcccgc cbggcabbat 900 
gcccagbaca tgacctbatg ggacttbccb acbbggcagb acabctacgb abtagbcabc 960 
gctabtacca bggbgatgcg gbbbbggcag bacatcaabg ggcgtggaba gcggbttgac 1020 
tcacggggat tbccaagtcb ccaccccatb gacgbcaabg ggagbtbgtb bbggcaccaa 1080 
aabcaacggg acbbbccaaa atgtcgbaac aactccgccc cabtgacgca aabgggcggb 1140 
aggcgbgbac ggbgggaggb cbabataagc agagctcgbb tagbgaaccg tcagabcgcc 1200 
bggagacgcc atccacgctg bbbbgaccbc cabagaagac accgggaccg atccagccbc 12 60 
cgtcaccgtc gtcgacacgt gbgatcagab abcgcggccg cbcbagaabg cgcgbgaagg 1320 
agaagtacca gcacctgtgg cgcbggggcb ggcgcbgggg caeca bgcbg cbgggcabgc 13 80 
bgabgabctg cagcgccacc gagaagebgb gggbgaccgt gbactaegge gtgcccgbgb 1440 
ggaaggaggc caccaccacc ctgcbcbgcg ccagcgacgc caaggcctac gacaccgagg 1500 
bgcacaacgt gtgggccacc cacgccbgcg fcgcccaccga ccccaacccc caggaggbgg 1560 
bgctggbgaa cgtgaccgag aacttcgaca tgbggaagaa cgacabggbg gagcagabgc 1620 
acgaggacat catcagcctg bgggaccaga gccbgaagcc cbgcgbgaag ctgacccccc 1680 
tgbgegbgag ccbgaagtgc accgacgcba gcaagaacbg cagctbcaac atcagcacca 1740 
gcatccgcgg caaggbgcag aaggagbacg ccbbcttcba caagebggae atcabcccca 1800 
begacaaega caccaccagc bacagccfcga ccagcbgcaa caccagcgtg atcacccagg 1860 
ccbgccccaa ggbgagcttc gagcccabcc ccaaccacba ctgcgccccc gccggcbbcg 1920 
ccabccbgaa gtgeaaggae aagaagbbca aeggcaaggg ccccbgcacc aaegbgagea 1980 
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ccgtgcagtg cacccacggc atccgccccg 
gcctggccga ggaggaggtg gtgabccgca 
bcatcgtgca gctgaacgag agcgtggaga 
acctgagccg cgccaagtgg aacgacaccc 
agttcggcaa caagaccabc gtgttcaagc 
cccacagctt caactgcggc ggcgagbbcb 
gcatcaagca gatcatcaac atgtggcaga 
tcagcggcca gatccgctgc agcagcaaca 
gcaacagcaa caacgagagc gagatcttcc 
ggcgcagcga gctgtacaag tacaaggtgg 
ccaaggccaa gcbbaccgbc caggcccgcc 
acaaccbgcb gcgcgccabc gaggcccagc 
bcaagcagcb gcaggcccgc accctggccg 
tcgagcagat ctggaaccac accaccbgga 
ccagcctgat ccacagcctg atcgaggaga 
agctgctgga gctggacaag tgggccagcc 
tgtggtgagg abccagatct gcbgbgccbb 
cccccgtgcc ttccttgacc ctggaaggtg 
aggaaattgc atcgcabbgt ctgagtaggt 
agcacagcaa gggggaggat tgggaagaca 
ctatgggtac ccagggccgc ataacfctcgt 
tgtactgaaa tgtgtgggcg bggcbbaagg 
abgbagbbbb gbabcbgbbb bgcagcagcc 
ggaagcattg tgagctcata tttgacaacg 
aatgtgatgg gcbccagcat tgatggtcgc 
acctacgaga ccgbgbcbgg aacgccgttg 
gctgcagcca ccgcccgcgg gattgtgact 
agtgcagctt cccgttcatc cgcccgcgat 
gabbcbbtga cccgggaacb baabgbcgbt 
gtttctgccc tgaaggcttc ctcccctccc 
gacbcbgtbb ggabbbggab caagcaagtg 
gcgcggtagg cccgggacca gcggtctcgg 
acgtggtaaa ggtgactctg gabgbtcaga 
aggtagcacc actgcagagc bbcabgcbgc 
caggagcgct gggcgbggbg cctaaaaatg 
aggcccttgg bgbaagtgbb tacaaagcgg 
atgagatgca bcbbggacbg tatttttagg 
ggattcatgt tgtgcagaac caccagcaca 
bgbagcbbag aaggaaatgc gtggaagaac 
tccatgcatt cgtccataat gabggcaatg 
tttctgggat cactaacgtc atagttgtgt 
acaaagcgcg ggcggagggb gccagactgc 
tagttaccct cacagafcttg cabbbcccac 
acctgcgggg cgatgaagaa aacggtttcc 
aggttcctga gcagchgcga cttaccgcag 
ggctgcaact ggtagttaag agagcbgcag 
tcgttaagca tgtccctgac bcgcabgbfcb 
ccgcccagcg atagcagttc tbgcaaggaa 
gccgbaggca tgctthtgag cgbbbgacca 
accbgcbcta cggcatctcg atccagcata 
cgctgtacgg cagbagtcgg tgctcgtcca 
gcagggtcct cgbcagcgba gtctgggtca 
tggccagggt gcgcttgagg ctggtcctgc 
gcgcgtcggc caggbagcab ttgaccatgg 
ccttggcgcg cagcttgccc ttggaggagg 



tggtgagcac ccagctgctg ctgaacggta 2040 
gcgctaactt cgccgacaac gccaaggtga 2100 
tcaactgcac ccgcgctagc gcccactgca 2160 
tgaacaagat cgtgatcaag ctgcgcgagc 2220 
acagcagcgg cggcgacccc gagatcgtga 2280 
tcfcactgcaa cagcgctagc chgccctgcc 2340 
aggtgggcaa ggccatgtac gcccccccca 2400 
tcaccggcct gctgctgacc cgcgacggcg 2460 
gtccgggcgg cggcgacatg cgcgacaact 2 52 0 
tgaagatcga gcccctgggc gtggccccca 2580 
agctgctgag cggcatcgtg cagcagcaga 2640 
agcacctgct gcagctgacc gtgtggggca 2700 
tggagcgcta cctgaaggac cagcagcagc 2760 
tggagtggga ccgcgagatc aacaactaca 2820 
gccagaacca gcaggagaag aacgagcagg 2 88 0 
tgtggaactg gttcaacatc accaactggc 2940 
cfcagttgcca gccatctgtt gtttgcccct 3000 
ccactcccac tgtcctttcc taataaaatg 3060 
gtcattctat tctggggggb ggggtggggc 3120 
atagcaggca tgctggggat gcggtgggct 3180 
ataatgtatg ctatacgaag ttataagatc 3240 
gtgggaaaga atatataagg tgggggtctt 3300 
gccgccgcca tgagcaccaa cfccgtttgat 3360 
cgcatgcccc catgggccgg ggtgcgtcag 3420 
cccgtcctgc ccgcaaactc tactaccttg 3480 
gagactgcag cctccgccgc cgcttcagcc 3540 
gactttgctt tcctgagccc gcttgcaagc 3600 
gacaagttga cggctctttt ggcacaattg 3660 
tctcagcagc tgttggatct gcgccagcag 3720 
aatgcggttfc aaaacataaa taaaaaacca 3780 
tctfcgctgtc bttatttagg ggbbbbgcgc 3840 
bcgbbgaggg bccbgtgbat bbbbbccagg 3900 
bacabgggca baagcccgtc bcbggggbgg 3960 
ggggbggbgb bgtagabgat ccagtcgbag 4 02 0 
tcbbbcagta gcaagcbgat tgccaggggc 4080 
tbaagcbggg abgggbgcab acgbggggab 414 0 
tbggcbabgb bcccagccab abcccbccgg 4200 
gbgbatccgg bgcacbbggg aaatbbgtca 4260 
bbggagacgc ccbbgbgacc bccaagatbt 4320 
ggcccacggg cggcggccbg ggcgaagaba 4380 
bccaggabga gabcgbcata ggccabbbbb 4440 
ggbabaabgg bbccabccgg cccaggggcg 4500 
gcbbbgagbb cagabggggg gabcabgbcb 4560 
ggggbagggg agabcagctg ggaagaaagc 4620 
ccggtgggcc cgtaaatcac acctabbacc 4680 
cbgccgbcab cccbgagcag gggggccacb 4740 
bcccbgacca aabccgccag aaggcgcbcg 4 800 
gcaaagbtbb bcaacggbtb gagaccgbcc 4860 
agcagbbcca ggcggbccca cagcbcggbc 4920 
bcbccbcgbb bcgcgggbtg gggcggcbbt 4 980 
gacgggccag ggbcabgbcb bbccacgggc 5040 
c 99tgaaggg gtgcgctccg ggcbgcgcgc 5100 
bggbgcbgaa gcgcbgccgg bcbbcgcccb 5160 
bgbcabagbc cagccccbcc gcggcgbggc 5220 
cgccgcacga ggggcagbgc agacbbbbga 5280 
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gggcgtagag cttgggcgcg agaaataccg attccgggga gtaggcabcc gcgccgcagg 534 0 
ccccgcagac ggbcbcgcab tccacgagcc aggtgagctc tggccgttcg gggbcaaaaa 54 00 
ccaggtttcc cccatgctbb bbgabgcgbb tcttacctct ggtttccatg agccggtgtc 5460 
cacgcbcggb gacgaaaagg cbgbccgtgb ccccgtatac agacttgaga ggccbgbccb 5520 
cgagcggtgt tccgcggtcc bcctcgbaba gaaactcgga ccactctgag acaaaggctc 5580 
gcgtccaggc cagcacgaag gaggcbaagb gggaggggta gcggtcgbtg tccactaggg 564 0 
ggtccactcg ctccagggtg tgaagacaca tgtcgccctc ttcggcatca aggaaggtga 5700 
bbggbbbgta ggtgtaggcc acgtgaccgg gtgttcctga aggggggcta taaaaggggg 5760 
tgggggcgcg bbcgbcctca ctctcttccg catcgctgtc tgcgagggcc agctgbtggg 5820 
gtgagtcgac gcgaggctgg atggccttcc ccabtatgab tcttctcgct tccggcggca 5880 
tcgggatgcc cgcgttgcag gccabgcbgb ccaggcaggt agatgacgac catcagggac 5940 
agcttcaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc 6000 
ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa 6060 
acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc gbgcgcbcbc 6120 
ctgttccgac ccbgccgcbb accggatacc bgbccgccbb bcbcccbbcg ggaagcgtgg 6180 
cgctbtctca tagctcacgc bgbaggbatc bcagbbcggt gbaggbcgbb cgctccaagc 624 0 
tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc 6300 
gbcbbgagbc caacccggta agacacgact tatcgccact ggcagcagcc acbggbaaca 6360 
ggattagcag agcgaggtat gtaggcggtg cbacagagtb cttgaagtgg tggcctaact 6420 
acggcbacac tagaaggaca gbabbbggba bcbgcgcbcb gctgaagcca gttaccttcg 6480 
gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt 6540 
tbgbbbgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct 6600 
tttctacggg gtctgacgct cagbggaacg aaaactcacg tbaagggabb ttggtcatga 6660 
gattatcaaa aaggatcttc acctagatcc bbbtaaabba aaaatgaagt tttaaatcaa 6720 
tctaaagtat atatgagtaa acbbggbcbg acagbbacca abgcbbaabc agbgaggcac 6780 
ctabcbcagc gabcbgbcba bbbcgbtcab ccabagbtgc cbgacbcccc gtcgbgbaga 6840 
taactacgat acgggagggc bbaccabcbg gccccagbgc tgcaatgaba ccgcgagacc 6900 
cacgcbcacc ggcbccagab bbabcagcaa baaaccagcc agccggaagg gccgagcgca 6960 
gaagbggbcc bgcaacbbba bccgccbcca bccagbcbab baabbgtbgc cgggaagcba 7020 
gagbaagbag bbcgccagtb aabagbtbgc gcaacgtbgb bgccabbgcb gcaggcatcg 7080 
tggbgbcacg ctcgbcgbbb ggtatggcbb cabbcagcbc cggbbcccaa cgabcaaggc 7140 
gagbbacabg abcccccabg bbgbgcaaaa aagcggbtag ctccbbcggb . ccbccgatcg 7200 
tbgbcagaag baagbbggcc gcagbgbbat cacbcabggb babggcagca cbgcabaabb 7260 
cbcbtacbgb cabgccabcc gbaagatgct bbtcbgbgac bggbgagbac bcaaccaagb 7320 
cabbcbgaga atagbgtabg cggcgaccga gbtgcbcbbg cccggcgbca acacgggaba 7380 
abaccgcgcc acabagcaga acbbbaaaag bgcbcabcab bggaaaacgb bcbbcggggc 7440 
gaaaacbctc aaggabcbba ccgcbgbbga gabccagbbc gabgbaaccc actcgtgcac 7500 
ccaacbgabc bbcagcabcb bbbacbtbca ccagcgtbbc bgggbgagca aaaacaggaa 7560 
ggcaaaabgc cgcaaaaaag ggaataaggg cgacacggaa abgtbgaaba cbcabacbcb 7620 
bccbbbbbca abab.batbga agcatbtatc agggbbabbg bcbcabgagc ggabacabab 7680 
bbgaabgbab bbagaaaaab aaacaaabag gggbbccgcg cacabbbccc cgaaaagbgc 7740 
cacctgacgb cbaagaaacc abbabbatca bgacabbaac cbataaaaab aggcgbabca 7800 
cgaggccctb bcgbcbbcaa gaabbgbb 7828 



<210> 48 
<211> 7789 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pAdApb containing HIV genes 
<400> 48 
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ttaattaacc gcaattctca tgtttgacag cttatcahca tcaataatat accttabbtt 60 

ggattgaagc caatatgata atgagggggt ggagtbbgbg acgtggcgcg gggcgtggga 12 0 

acggggcggg tgacgbagba gtgtggcgga agbgbgabgb tgcaagtgtg gcggaacaca 180 

tgtaagcgac ggatgtggca aaagtgacgt ttttggtgtg cgccggtgta cacaggaagt 24 0 

gacaatbbtc gcgcggtttt aggcggabgt bgbagbaaat ttgggcgtaa ccgagtaaga 300 

btbggccabb ttcgcgggaa aacbgaabaa gaggaagtga aatctgaaba abbtbgbgbt 360 

actcatagcg cgtaatattt gtctagggcc gcggggactt bgaccgbbba cgtggagact 420 

cgcccaggtg tttbbcbcag gtgbbbtccg cgbbccgggt caaagtbggc gttbtattab 480 

batagtcagb acgtaccagt gcactggccb agagcggccc cabtgcafcac gbtgtatcca 540 

tatcataaba tgtacabbba tabfcggctca bgbccaacat taccgccabg tbgacabbga 600 

btattgacba gttatbaata gtaabcaatt acggggtcat bagbtcatag cccabababg 660 

gagbbccgcg ttacabaacb bacggtaaat ggcccgccbg gcbgaccgcc caacgacccc 720 

cgcccattga cgtcaabaab gacgtabgbt cccabagbaa cgccaatagg gacbbtccab 780 

tgacgbcaab gggtggagta bbtacggbaa actgcccacb bggcagtaca tcaagtgtat 840 

catabgccaa gbacgccccc batbgacgbc aatgacggta aabggcccgc cbggcabtab 900 

gcccagbaca tgacctbatg ggacbbbcct acttggcagb acabcbacgb abbagbcabc 960 

gctatbacca bggbgabgcg gttbbggcag tacatcaatg ggcgbggata gcggbbbgac 1020 

bcacggggat btccaagbcb ccaccccabb gacgbcaatg ggagtttgbt tbggcaccaa 1080 

aatcaacggg actttccaaa atgtcgtaac aactccgccc catbgacgca aatgggcggb 114 0 

aggcgbgtac ggtgggaggb ctatabaagc agagcbcgtb tagbgaaccg tcagatcgcc 1200 

tggagacgcc atccacgctg ttttgacctc cabagaagac accgggaccg abccagccbc 1260 

cgtcaccgtc gtcgacacgt gbgabcagab abcgcggccg cbctagaabg cgcgtgaagg 1320 

agaagbacca gcaccbgtgg cgctggggct ggcgcbgggg caccabgcbg cbgggcatgc 13 80 

tgabgabctg cagcgccacc gagaagcbgb gggbgaccgb gbacbacggc gbgcccgtgb 144 0 

ggaaggaggc caccaccacc cbgctctgcg ccagcgacgc caaggccbac gacaccgagg 1500 

bgcacaacgt gbgggccacc cacgcctgcg bgcccaccga ccccaacccc caggaggtgg 1560 

bgcbggtgaa cgbgaccgag aacttcgaca tgtggaagaa cgacatggtg gagcagatgc 1620 

acgaggacat cabcagcctg bgggaccaga gccbgaagcc ctgcgtgaag cbgacccccc 1680 

bgtgcgbgag ccbgaagtgc accgacctga agaacgacac caacaccaac agcagcagcg 174 0 

gccgcabgat catggagaag ggcgagatca agaacbgcag cbtcgcbagc accagcbgca 1800 

acaccagcgt gabcacccag gcctgcccca aggbgagcbb cgagcccatc cccaaccact 1860 

acbgcgcccc cgccggcttc gccabccbga agbgcaagga caagaagttc aacggcaagg 192 0 

gccccbgcac caacgtgagc accgbgcagb gcacccacgg catccgcccc gbggbgagca 1980 

cccagcbgct gcbgaacggb agccbggccg aggaggaggt ggbgabccgc agcgcbaacb 2040 

bcgccgacaa cgccaaggbg abcatcgbgc agctgaacga gagcgbggag abcaacbgca 2100 

cccgcgcbag cgcccactgc aacctgagcc gcgccaagbg gaacgacacc ctgaacaaga 2160 

tcgbgabcaa gcbgcgcgag cagttcggca acaagaccab cgbgtbcaag cacagcagcg 222 0 

gcggcgaccc cgagatcgtg acccacagcb tcaactgcgg cggcgagtbc tbcbacbgca 2280 

acagcgcbag ccbgcccbgc cgcabcaagc agabcatcaa catgbggcag aaggtgggca 2340 

aggccabgba cgcccccccc abcagcggcc agabccgcbg cagcagcaac atcaccggcc 2400 

bgctgcbgac ccgcgacggc ggcaacagca acaacgagag cgagatcbbc cgtccgggcg 24 60 

gcggcgacab gcgcgacaac bggcgcagcg agcbgbacaa gbacaaggbg gbgaagabcg 2520 

agcccctggg cgbggccccc accaaggcca agctbaccgb ccaggcccgc cagcbgcbga 2580 

gcggcabcgb gcagcagcag aacaaccbgc tgcgcgccat cgaggcccag cagcaccbgc 2640 

bgcagctgac cgtgbggggc abcaagcagc tgcaggcccg caccctggcc gtggagcgct 2700 

acctgaagga ccagcagcag cbcgagcaga tctggaacca caccacctgg abggagtggg 2760 

accgcgagab caacaacbac accagcctga tccacagccb gatcgaggag agccagaacc 2820 

agcaggagaa gaacgagcag gagcbgcbgg agctggacaa gbgggccagc ctgbggaact 2880 

ggtbcaacat caccaactgg cbgtggtgag gatccagabc bgcbgtgccb bcbagbbgcc 2940 

agccabcbgb bgtttgcccc bcccccgtgc cbtccbtgac ccbggaaggb gccacbccca 3000 

cbgbcctbbc ctaabaaaab gaggaaatbg catcgcabbg bcbgagbagg bgtcabbcba 3060 

tbctgggggg bggggbgggg cagcacagca agggggagga bbgggaagac aafcagcaggc 312 0 

abgcbgggga bgcggbgggc bctabgggba cccagggccg cabaacbbcg babaabgbab 3180 

gcbabacgaa gbtabaagab cbgbacbgaa atgbgbgggc gbggcbbaag ggtgggaaag 3240 

aatabataag gtgggggtcb babgtagbbb bgbatctgbb bbgcagcagc cgccgccgcc 3300 
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atgagcacca actcgtttga tggaagcatt 
ccatgggccg gggtgcgtca gaatgtgatg 
cccgcaaact cbacbaccbb gacctacgag 
gccbccgccg ccgcttcagc cgcbgcagcc 
bbccbgagcc cgcttgcaag cagtgcagct 
acggctcttt tggcacaatt ggattctttg 
cbgtbggatc tgcgccagca ggbbbcbgcc 
taaaacataa ataaaaaacc agactctgtt 
ctbbabbbag gggttttgcg cgcgcggtag 
gtcctgtgta ttttttccag gacgbggbaa 
abaagcccgt ctcbggggbg gaggbagcac 
bbgtagabga tccagtcgta gcaggagcgc 
agcaagctga ttgccagggg caggcccttg 
gatgggbgca tacgtgggga tatgagatgc 
ttcccagcca batcccbccg gggattcatg 
gtgcacttgg gaaatbbgbc atgtagctta 
cccttgtgac cbccaagabb ttccatgcat 
gcggcggcct gggcgaagat atttctggga 
agabcgtcat aggccabtbt bacaaagcgc 
gtbccatccg gcccaggggc gbagtbaccc 
tcagabgggg ggabcabgtc baccbgcggg 
gagabcagcb gggaagaaag caggbtcctg 
ccgbaaatca cacctabtac cggcbgcaac 
bcccbgagca ggggggccac bbcgbtaagc 
aaabccgcca gaaggcgctc gccgcccagc 
bbcaacggbt tgagaccgtc cgccgtaggc 
aggcggtccc acagcbcggt caccbgctct 
btcgcgggbt ggggcggcbt bcgctgbacg 
gggtcabgbc bbbccacggg cgcagggtcc 
ggtgcgcbcc gggctgcgcg cbggccaggg 
agcgcbgccg gtcbbcgccc bgcgcgbcgg 
ccagccccbc cgcggcgbgg cccbbggcgc 
aggggcagbg cagacbbbbg agggcgbaga 
agtaggcatc cgcgccgcag gccccgcaga 
cbggccgbtc ggggbcaaaa accaggbbbc 
bggbbbccat gagccggbgb ccacgcbcgg 
cagacbbgag aggccbgbcc bcgagcggbg 
accacbcbga gacaaaggcb cgcgbccagg 
agcggtcgbb gbccactagg gggbccactc 
cbbcggcatc aaggaaggtg abtggtttgb 
aaggggggct ataaaagggg gbgggggcgc 
cbgcgagggc cagcbgbbgg ggbgagbcga 
btcbtcbcgc tbccggcggc abcgggabgc 
bagabgacga ccabcaggga cagcbbcaag 
ccgcgttgcb ggcgbtbbtc cataggcbcc 
gctcaagbca gaggtggcga aacccgacag 
gaagctcccb cgbgcgcbcb ccbgbbccga 
bbcbcccbtc gggaagcgbg gcgcbbbcbc 
bgbaggbcgb bcgcbccaag cbgggctgtg 
gcgccfcbatc cggbaacbab cgtctbgagb 
tggcagcagc cactggbaac aggabbagca 
bcbtgaagbg gbggcctaac bacggctaca 
tgctgaagcc agtbaccbbc ggaaaaagag 
ccgctggbag cggbggbbtb btbgbtbgca 
cbcaagaaga tccbbbgabc bbbtctacgg 



gtgagcbcab abbbgacaac gcgcatgccc 3360 
ggcbccagca bbgabggbcg ccccgbccbg 3420 
accgbgbcbg gaacgccgbb ggagacbgca 34 80 
accgcccgcg ggabtgtgac bgacttbgcb 3540 
bcccgbbcab ccgcccgcga bgacaagbtg 3600 
acccgggaac bbaatgtcgb bbcbcagcag 3660 
cbgaaggcbb ccbcccctcc caabgcggbt 3720 
bggabbbgga bcaagcaagb gbcbbgcbgt 3780 
gcccgggacc agcggbcbcg gbcgbbgagg 3840 
aggbgacbcb ggabgbbcag atacabgggc 3900 
cacbgcagag cbbcabgcbg cggggtggbg 3960 
bgggcgbggb gccbaaaaab gbctbbcagb 4020 
gbgbaagbgb bbacaaagcg gbbaagcbgg 4080 
abcbtggact gbabbbbbag gbbggcbabg 4140 
btgbgcagaa ccaccagcac agbgbabccg 4200 
gaaggaaatg cgbggaagaa cbbggagacg 42 60 
bcgbccabaa bgatggcaab gggcccacgg 4320 
bcacbaacgb cabagbbgbg bbccaggabg 4380 
999 c 99 a 999 bgccagacbg cggbabaabg 4440 
bcacagabtb gcatbbccca cgcbbbgagb 4500 
gcgabgaaga aaacggttbc cggggbaggg 4560 
agcagctgcg acbbaccgca gccggbgggc 4 620 
bggbagtbaa gagagcbgca gcbgccgbca 4680 
abgtcccbga cbcgcabgbb bbccctgacc 4740 
gabagcagtb cbbgcaagga agcaaagbtb 4 800 
abgctbbbga gcgbbtgacc aagcagbbcc 4860 
acggcabcbc gabccagcab atcbccbcgb 4920 
gcagbagbcg gbgcbcgtcc agacgggcca 4980 
bcgtcagcgb agbcbgggbc acggbgaagg 504 0 
bgcgcbbgag gcbggtccbg ctggbgcbga 5100 
ccaggbagca bbbgaccabg gtgbcabagb 5160 
gcagctbgcc cbbggaggag gcgccgcacg 5220 
gcbbgggcgc gagaaabacc gabbccgggg 5280 
cggbcbcgca bbccacgagc caggbgagcb 5340 
ccccabgcbb bbtgabgcgb bbcbbaccbc 5400 
bgacgaaaag gctgtccgtg bccccgbaba 54 60 
bbccgcggbc cbccbcgbat agaaacbcgg 5520 
ccagcacgaa ggaggcbaag bgggaggggb 5580 
gcbccagggb gbgaagacac abgbcgcccb 5640 
aggbgbaggc cacgbgaccg ggbgbbccbg 5700 
gbbcgbccbc acbctctbcc gcabcgcbgb 5760 
cgcgaggcbg gabggccbbc cccatbabga 5820 
ccgcgbbgca ggccabgcbg bccaggcagg 5880 
gccagcaaaa ggccaggaac cgbaaaaagg 5940 
gccccccbga cgagcabcac aaaaabcgac 6000 
gacbabaaag abaccaggcg bbtcccccbg 6060 
cccbgccgcb baccggabac ctgbccgcct 6120 
abagcbcacg ctgbaggbab cbcagbtcgg 6180 
bgcacgaacc ccccgbbcag cccgaccgct 6240 
ccaacccggb aagacacgac bbabcgccac 6300 
gagcgaggba bgbaggcggb gcbacagagb 6360 
cbagaaggac agbabbbggb abcbgcgcbc 6420 
bbggbagcbc tbgatccggc aaacaaacca 6480 
agcagcagat bacgcgcaga aaaaaaggab 654 0 
ggbcbgacgc bcagbggaac gaaaacbcac 6600 
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gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 6660 
aaaaatgaag ttttaaatca atctaaagta hatatgagta aacttggtct gacagttacc 6720 
aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttea tccatagttg 6780 
cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 6840 
ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 6900 
cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 6960 
ttaattgttg ccgggaagct agagfcaagta gttcgccag.t taatagtttg cgcaacgttg 7020 
ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct ccattcagct 7080 
ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 7140 
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 7200 
ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 7260 
ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgcfcctt 7320 
gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa gfcgctcatca 7380 
ttggaaaacg ttcttcgggg cgaaaacfcct caaggatctt accgctgttg agatccagtt 744 0 
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 7500 
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 7560 
aatgttgaat actcabactc ttcctttttc aatattattg aagcatttat cagggttatt 7620 
gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 7680 
gcacattccc ccgaaaagtg ccacctgacg tctaagaaac cabtattatc atgacattaa 774 0 
cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaattgtt 7789 



<210> 49 
<211> 7711 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pAdApt containing HIV genes 
<400> 49 

ttaattaacc gcaattctca tgtttgacag cttatcatca tcaataatat accttatttt 60 
ggattgaagc caatatgata atgagggggt ggagtttgtg acgtggcgcg gggcgtggga 120 
acggggcggg tgacgtagta gtgtggcgga agtgtgatgt tgcaagtgtg gcggaacaca 180 
tgtaagcgac ggatgtggca aaagtgacgt ttttggtgtg cgccggtgta cacaggaagt 24 0 
gacaattttc gcgcggtttt aggcggatgt tgtagtaaat ttgggcgtaa ccgagtaaga 300 
tttggccatt ttcgcgggaa aactgaataa gaggaagtga aatctgaata attttgtgtt 360 
actcatagcg cgtaatattt gtctagggcc gcggggactt tgaccgttta cgtggagact 420 
cgcccaggtg tttttctcag gtgttttccg cgttccgggt caaagttggc gttttattat 4B0 
tatagtcagt acgtaccagt gcactggcct agagcggccc cattgcatac gttgtatcca 540 
tatcataata tgtacattta tattggctca tgtccaacat taccgccatg ttgacattga 600 
ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag cccatatatg 660 
gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc caacgacccc 720 
cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat 780 
tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca tcaagtgtat 840 
catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc ctggcattat 900 
gcccagtaca tgaccttatg ggactttcct actcggcagt acatctacgt attagtcatc 960 
gctattacca tggtgatgcg gttttggcag tacatcaatg ggcgtggata gcggtttgac 1020 
tcacggggat ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa 1080 
aatcaacggg actttccaaa atgtcgtaac aactccgccc cattgacgca aatgggcggt 114 0 
a99 c 9tgtac ggtgggaggt ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 1200 
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 1260 
cgtcaccgtc gtcgacacgt gtgatcagat atcgcggccg ctctagaatg cgcgtgaagg 1320 
agaagtacca gcacctgtgg cgctggggct ggcgctgggg caccatgctg ctgggcatgc 1380 
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bgabgabctg cagcgccacc gagaagcbgb gggbgaccgb gbacbacggc gtgcccgtgt 1440 
ggaaggaggc caccaccacc ctgctctgcg ccagcgacgc caaggcctac gacaccgagg 1500 
tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc caggaggtgg 1560 
tgctggtgaa cgtgaccgag aacbbcgaca tgtggaagaa cgacatggtg gagcagatgc 1620 
acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag ctgacccccc 1680 
tgtgcgtgag cctgaagbgc accgacgcta gcaccagctg caacaccagc gtgatcaccc 1740 
aggcctgccc caaggtgagc ttcgagccca bccccaacca ctactgcgcc cccgccggct 1800 
bcgccabccb gaagtgcaag gacaagaagt bcaacggcaa gggcccctgc accaacgtga 1860 
gcaccgtgca gtgcacccac ggcatccgcc ccgtggtgag cacccagctg ctgctgaacg 1920 
gtagcctggc cgaggaggag gbggbgabcc gcagcgctaa cbbcgccgac aacgccaagg 1980 
bgabcabcgb gcagctgaac gagagcgtgg agatcaactg cacccgcgct agcgcccact 2040 
gcaacctgag ccgcgccaag tggaacgaca ccctgaacaa gatcgtgatc aagctgcgcg 2100 
agcagttcgg caacaagacc atcgtgttca agcacagcag cggcggcgac cccgagatcg 2160 
tgacccacag cttcaactgc ggcggcgagt tcttctactg caacagcgct agcctgccct 2220 
gccgcatcaa gcagatcatc aacatgtggc agaaggtggg caaggccatg tacgcccccc 2280 
ccabcagcgg ccagatccgc tgcagcagca acatcaccgg cctgctgctg acccgcgacg 2340 
gcggcaacag caacaacgag agcgagatct bccgbccggg cggcggcgac atgcgcgaca 24 00 
actggcgcag cgagctgtac aagtacaagg tggtgaagat cgagcccctg ggcgtggccc 2460 
ccaccaaggc caagcttacc gtccaggccc gccagctgct gagcggcatc gtgcagcagc 2520 
agaacaacct gctgcgcgcc atcgaggccc agcagcacct gctgcagctg accgtgtggg 2580 
gcatcaagca gcbgcaggcc cgcaccctgg ccgbggagcg ctacctgaag gaccagcagc 2640 
agctcgagca gatctggaac cacaccacct ggatggagtg ggaccgcgag atcaacaact 2700 
acaccagccb gabccacagc ctgatcgagg agagccagaa ccagcaggag aagaacgagc 2760 
aggagctgct ggagctggac aagtgggcca gcctgfcggaa ctggttcaac atcaccaact 2820 
ggctgtggtg aggabccaga tctgctgtgc cttctagttg ccagccatct gbtgtbbgcc 2 880 
cctcccccgt gccbtcctbg accctggaag gbgccactcc cactgtcctt tcctaataaa 2940 
abgaggaaab tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggbgg 3000 
ggcagcacag caagggggag gattgggaag acaatagcag gcatgctggg gatgcggtgg 3060 
gctctatggg bacccagggc cgcataactt cgtataatgt atgctatacg aagttataag 3120 
atctgtactg aaatgtgtgg gcgtggctta agggbgggaa agaatatata aggbgggggb 3180 
cttatgtagt tttgtatctg bbttgcagca gccgccgccg ccatgagcac caactcgttt 3 240 
gatggaagca bbgbgagcbc atatttgaca acgcgcatgc ccccatgggc cggggtgcgt 3300 
cagaatgtga tgggctccag cattgatggt cgccccgtcc tgcccgcaaa ctctactacc 3360 
ttgacctacg agaccgtgtc tggaacgccg ttggagactg cagcctccgc cgccgcttca 3420 
gccgctgcag ccaccgcccg cgggattgtg actgactttg ctbbccbgag cccgcttgca 3480 
agcagbgcag cbbcccgbbc abccgcccgc gatgacaagt tgacggctct tttggcacaa 354 0 
bbggabbcbb tgacccggga acbbaatgbc gtttctcagc agctgttgga bctgcgccag 3600 
caggbbbcbg ccctgaaggc ttcctcccct cccaatgcgg tttaaaacat aaataaaaaa 3660 
ccagacbcbg bbbggabbbg gatcaagcaa gtgtcttgct gtctttattt aggggttttg 3720 
cgcgcgcggt aggcccggga ccagcggtct cggtcgttga gggtcctgtg tattttttcc 3780 
aggacgtggt aaaggtgact ctggatgttc agatacatgg gcataagccc gtctctgggg 3840 
tggaggtagc accactgcag agcttcatgc tgcggggtgg tgttgtagat gatccagtcg 3900 
tagcaggagc gctgggcgtg gtgcctaaaa atgtctttca gbagcaagct gattgccagg 3960 
ggcaggccct tggtgtaagt gtttacaaag cggttaagct gggatgggtg catacgtggg 4 020 
gabatgagat gcabcbtgga cbgbabbbbt aggtbggcba tgbbcccagc catabcccbc 4080 
cggggattca bgbbgbgcag aaccaccagc acagtgbabc cggbgcactb gggaaabtbg 4140 
bcatgbagct bagaaggaaa bgcgbggaag aacbbggaga cgcccbbgbg accbccaaga 42 00 
tbbbccabgc atbcgbccab aabgabggca abgggcccac gggcggcggc cbgggcgaag 42 60 
ababtbcbgg gabcactaac gbcatagbbg bgbtccagga bgagabcgbc ataggccabb 4320 
bbbacaaagc gcgggcggag ggtgccagac bgcggbabaa bggtbccatc cggcccaggg 43 80 
gcgbagbbac ccbcacagab bbgcabbtcc cacgcttbga gbtcagabgg ggggabcabg 4440 
bcbacctgcg gggcgabgaa gaaaacggbt bccggggbag gggagatcag cbgggaagaa 4 500 
agcaggtbcc bgagcagcbg cgacbbaccg cagccggbgg gcccgbaaab cacaccbabb 4560 
accggctgca acbggbagbb aagagagcbg cagcbgccgt catccctgag caggggggcc 4620 
acbbcgbbaa gcabgfccccb gacbcgcatg bbbbcccbga ccaaabccgc cagaaggcgc 4680 
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tcgccgccca gcgatagcag bbcbbgcaag gaagcaaagt ttttcaacgg bbtgagaccg 4740 

bccgccgbag gcatgctttt gagcgbbbga ccaagcagtt ccaggcggtc ccacagctcg 4800 

gtcacctgct ctacggcatc bcgabccagc ababcbccbc gtttcgcggg ttggggcggc 4860 

bbbcgcbgba cggcagtagt cggtgctcgt ccagacgggc cagggtcatg tctttccacg 4920 

ggcgcagggt cctcgtcagc gtagtctggg tcacggbgaa ggggtgcgct ccgggctgcg 4980 

cgctggccag ggbgcgcbtg aggctggtcc tgctggtgct gaagcgcbgc cggtcttcgc 5040 

cctgcgcgtc ggccaggtag cafcttgacca bggbgbcata gtccagcccc tccgcggcgt 5100 

ggcccttggc gcgcagcttg cccttggagg aggcgccgca cgaggggcag tgcagacttt 5160 

tgagggcgta gagcttgggc gcgagaaata ccgattccgg ggagtaggca tccgcgccgc 5220 

aggccccgca gacggtctcg cathccacga gccaggtgag cbcbggccgt tcggggtcaa 5280 

aaaccaggtt tcccccatgc tttttgahgc gbbtcbbacc fcctggtttcc atgagccggt 5340 

gtccacgctc ggbgacgaaa aggctgtccg tgtccccgta tacagacbbg agaggcctgt 5400 

cctcgagcgg tgttccgcgg tcctcctcgt atagaaactc ggaccactct gagacaaagg 5460 

ctcgcgtcca ggccagcacg aaggaggcta agtgggaggg gtagcgghcg ttgtccacta 5520 

gggggtccac tcgctccagg gtgtgaagac acatgtcgcc ctcttcggca tcaaggaagg 5580 

tgabbggbbt gtaggtgtag gccacgtgac cgggbgbbcc tgaagggggg ctataaaagg 5640 

gggbgggggc gcgtbcgbcc tcactctctt ccgcatcgct gbcbgcgagg gccagctgtt 5700 

ggggtgagtc gacgcgaggc tggatggcct bcccGabbab gabbcbbcbc gcttccggcg 5760 

gcabcgggat gcccgcgtbg caggccatgc tgbccaggca ggbagabgac gaccatcagg 582 0 

gacagcbbca aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgbbtb 5880 

tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 594 0 

gaaacccgac aggactataa agataccagg cgbbbccccc tggaagctcc cbcgtgcgcb 6000 

ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctbbcbcccb tcgggaagcg 6060 

tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gbbcgcbcca 6120 

agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact 6180 

abcgtcbbga gbccaacccg gbaagacacg acttatcgcc actggcagca gccactggta 624 0 

acaggattag cagagcgagg tatgtaggcg gtgctacaga gttctfcgaag tggtggccta 6300 

actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagtbacct 6360 

tcggaaaaag agbbggbagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt 6420 

tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga 6480 

fccttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca 6540 

tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaafcga agttttaaat 6600 

caatctaaag tatatatgag taaacbtggfc ctgacagtba ccaatgcbba abcagbgagg 6660 

caccbabcbc agcgabcfcgt cbabttcgtb catccabagb bgcctgacbc cccgbcgbgb 6720 

agataacbac gatacgggag ggcbtaccab ctggccccag tgcbgcaabg abaccgcgag 6780 

acccacgcbc accggcbcca gabbtatcag caabaaacca gccagccgga agggccgagc 6840 

gcagaagbgg bccbgcaacb btabccgccb ccabccagbc tabbaabbgb tgccgggaag 690 0 

ctagagbaag bagbbcgcca gbbaabagbb bgcgcaacgb bgtbgccabb gcbgcaggca 6960 

tcgbggbgbc acgcbcgtcg btbggbabgg cbbcabbcag cbccggbbcc caacgabcaa 7020 

ggcgagbtac atgabccccc atgtbgtgca aaaaagcggb bagcbccbbc ggbccbccga 7080 

tcgbbgtcag aagbaagbbg gccgcagbgb batcacbcab ggbbabggca gcacbgcaba 714 0 

attcbcbtac tgbcabgcca bccgbaagab gcbbtbcbgb gacbggtgag bacbcaacca 7200 

agbcatbcbg agaabagbgt abgcggcgac cgagbbgcbc tbgcccggcg bcaacacggg 7260 

ataabaccgc gccacabagc agaactbbaa aagbgctcat catbggaaaa cgtbcbbcgg 7320 

ggcgaaaacb cbcaaggabc bbaccgcbgb tgagabccag btcgabgbaa cccactcgbg 73 80 

cacccaacbg atcttcagca bcbttbacbb bcaccagcgt bbctgggbga gcaaaaacag 7440 

gaaggcaaaa bgccgcaaaa aagggaataa gggcgacacg gaaabgtbga abacbcabac 7500 

bcbtccbtbb bcaatabbab bgaagcabbb abcagggbba bbgtcbcatg agcggabaca 7560 

babbbgaabg babbbagaaa aabaaacaaa taggggbbcc gcgcacabtb ccccgaaaag 7620 

bgccacctga cgbcbaagaa accabbabba bcabgacabb aaccbabaaa aabaggcgba 7680 

bcacgaggcc cbbbcgbcbb caagaabbgt t 7711 
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<211> 6577 
<212> DNA 

<213> Artificial Sequence 

* 

<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 50 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 3 00 

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt ttcggcagta 720 

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtfc ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 

tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 

gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 

ggcatgctga tgatctgcag cgccaccgag gctagcagcc tgaagtgcac cgacctgaag 204.0 

aacgacacca acaccaacag cagcagcggc cgcatgatca tggagaaggg cgagatcaag 2100 

aactgcagct ccaacatcag caccagcatc cgcggcaagg tgcagaagga gtacgccttc 2160 

ttctacaagc tggacatcat ccccatcgac aacgacacca ccagctacag cctgaccagc 2220 

tgcaacacca gcgtgatcac ccaggcctgc cccaaggtga gcttcgagcc catccccaac 2280 

cactactgcg cccccgccgg cttcgccatc ctgaagtgca aggacaagaa gttcaacggc 2340 

aagggcccct gcaccaacgt gagcaccgtg cagtgcaccc acggcatccg ccccgtggtg 24 00 

agcacccagc tgctgctgaa cggtagcctg gccgaggagg aggtggtgat ccgcagcgct 24 60 

aacttcgccg acaacgccaa ggtgatcatc gtgcagctga acgagagcgt ggagatcaac 2520 

tgcacccgcc ccaacaacaa cacccgcaag agcatccaca tcggccccgg ccgcgccttc 2580 

tacaccaccg gcgagatcat cggcgacatc cgccaggccc actgcaacct gagccgcgcc 2640 

aagtggaacg acaccctgaa caagatcgtg atcaagctgc gcgagcagtt cggcaacaag 2700 

accatcgtgt tcaagcacag cagcggcggc gaccccgaga tcgtgaccca cagcttcaac 2760 

tgcggcggcg agttcttcta ctgcaacagc acccagctgt tcaacagcac ctggttcaac 2820 
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agcacctgga gcaccgaggg cagcaacaac 
bgccgcabca agcagatcat caacatgbgg 
cccatcagcg gccagatccg ctgcagcagc 
ggcggcaaca ■ gcaacaacga gagcgagatc 
aacbggcgca gcgagctgta caagtacaag 
cccaccaagg ccaagcttac cgtccaggcc 
cagaacaacc tgctgcgcgc cabcgaggcc 
ggcatcaagc agctgcaggc ccgcaccctg 
cagctcgagc agabctggaa ccacaccacc 
tacaccagcc tgabccacag ccbgabcgag 
caggagctgc tggagctgga caagtgggcc 
tggctgtggt acatcaagct gttcatcatg 
gtgttcgccg tgctgagcat ctgaggabcc 
tctgttgttt gcccctcccc cgtgccttcc 
cbbbccbaab aaaabgagga aattgcatcg 
gggggtgggg tggggcagga cagcaagggg 
ggggabgcgg bgggcbcbab gggtacccag 
gccagaaaga agcaggcaca tccccttctc 
bbagbbccag ccccacbcat aggacactca 
cccgctaaag tacttggagc ggtctctccc 
cctccaagag tgggaagaaa ttaaagcaag 
bgcctccaac abgbgaggaa gtaatgagag 
ttaatcttcc gcttcctcgc tcactgactc 
ggtatcagct cactcaaagg cggtaatacg 
aaagaacatg tgagcaaaag gccagcaaaa 
ggcgtttttc cataggctcc gcccccctga 
gaggtggcga aacccgacag gactataaag 
cgtgcgctct cctgttccga ccctgccgct 
gggaagcgtg gcgcbbbctc atagctcacg 
tcgctccaag ctgggctgtg tgcacgaacc 
cggtaactat cgbcbbgagb ccaacccggt 
cactggtaac aggattagca gagcgaggta 
gtggccbaac tacggctaca ctagaagaac 
agttaccttc ggaaaaagag ttggtagctc 
cggtggtttt tttgtttgca agcagcagat 
tcctttgatc ttttctacgg ggbctgacgc 
bbbggbcabg agattatcaa aaaggabctb 
ttttaaatca atctaaagta batabgagba 
cagtgaggca cctatctcag cgatctgtct 
9999999999 cgctgaggtc tgcctcgtga 
aatcgcccca tcatccagcc agaaagtgag 
ggtggaccag bbggbgabbb tgaacttttg 
aagatgcgtg atctgatcct tcaactcagc 
tcccgtcaag tcagcgtaat gcbcbgccag 
aaaaactcat cgagcatcaa atgaaactgc 
tatttttgaa aaagccgttt ctgtaatgaa 
atggcaagat cctggtatcg gbcbgcgabb 
aatttcccct cgtcaaaaat aaggbbabca 
tccggtgaga atggcaaaag cttatgcatt 
ttacgctcgt catcaaaatc actcgcatca 
bgagcgagac gaaatacgcg atcgctgtta 
aaccggcgca ggaacactgc cagcgcabca 
tctaatacct ggaatgcfcgt tttcccgggg 
ggagtacgga baaaabgcbb gatggtcgga 
ctgaccatct catcbgtaac abcabbggca 



accgagggca gcgacaccat caccctgccc 2880 
cagaaggbgg gcaaggccab gbacgccccc 2 94 0 
aacatcaccg gcctgctgct gacccgcgac 3000 
ttccgtccgg gcggcggcga catgcgcgac 3 060 
gbggbgaaga tcgagcccct gggcgtggcc 3120 
cgccagctgc tgagcggcat cgtgcagcag 3180 
cagcagcacc tgctgcagct gaccgtgtgg 3240 
gccgtggagc gcbacctgaa ggaccagcag 3300 
bggabggagb gggaccgcga gabcaacaac 3360 
gagagccaga accagcacga gaagaacgag 3420 
agccbgbgga acbggttcaa catcaccaac 3480 
atcgbgggcg gccbggtggg ccbgcgcabc 3540 
agatcbgcbg bgccttctag tbgccagcca 3 600 
bbgacccbgg aaggbgccac bcccacbgbc 3660 
cabbgbcbga gtaggbgtca ttcbabtcbg 3720 
gaggabbggg aagacaabag caggcatgcb 3780 
gtgcbgaaga abtgacccgg ttcctccbgg 3840 
bgtgacacac ccbgtccacg ccccbggbtc 3900 
tagcbcagga gggcbccgcc tbcaabccca 3 960 
bcccbcabca gcccaccaaa ccaaaccbag 4020 
abaggctabb aagbgcagag ggagagaaaa 4080 
aaabcabaga abbbbaaggc cabcatggcc 4140 
gcbgcgcbcg gbcgttcggc tgcggcgagc 4200 
gtbabccaca gaabcagggg abaacgcagg 4260 
ggccaggaac cgtaaaaagg ccgcgtbgcb 4320 
cgagcabcac aaaaabcgac gcbcaagbca 4380 
ataccaggcg bbbccccctg gaagctcccb 4440 
taccggabac cbgbccgccb ttctcccbbc 4500 
cbgbaggbab cbcagbbcgg bgbaggtcgb 4560 
ccccgtbcag cccgaccgcb gcgccbbabc 4620 
aagacacgac bbabcgccac tggcagcagc 4680 
bgbaggcggb gcbacagagt tctbgaagbg 474 0 
agbabbbggb abcbgcgctc tgcbgaagcc 4 800 
bbgabccggc aaacaaacca ccgcbggbag 4860 
bacgcgcaga aaaaaaggab cbcaagaaga 4 92 0 
bcagbggaac gaaaacbcac gtbaagggab 4980 
caccbagabc cbbbbaaabt aaaaabgaag 5040 
aacbbggbcb gacagbbacc aabgcbbaab 5100 
abbbcgbtca bccabagbtg ccbgacbcgg 5160 
agaaggbgbb gcbgactcab accaggccbg 5220 
ggagccacgg bbgabgagag cbbbgbbgba 5280 
cbbbgccacg gaacggbcbg cgbbgtcggg 5340 
aaaagbbcga btbabbcaac aaagccgccg 5400 
bgbbacaacc aabbaaccaa ttcbgatbag 5460 
aabbbabbca batcaggatb atcaabacca 5520 
ggagaaaact caccgaggca gtbccabagg 558 0 
ccgacbcgbc caacatcaab acaaccbabb 5640 
agbgagaaab caccabgagb gacgacbgaa 5700 
bcbbbccaga cbbgbbcaac aggccagcca 5760 
accaaaccgb babbcabbcg bgatbgcgcc 5820 
aaaggacaab bacaaacagg aabcgaabgc 5880 
acaababbbb caccbgaabc aggababbcb 5940 
abcgcagbgg bgagbaacca bgcabcabca 6000 
agaggcabaa abbccgbcag ccagbbbagb 6060 
acgcbaccbb bgccabgbbb cagaaacaac 6120 



- 120 



WO 02/32943 



PCT/US01/25721 



tctggcgcat cgggcttccc atacaatcga tagattgtcg cacctgattg cccgacatta 6180 

tcgcgagccc atttataccc atataaatca gcatccatgt tggaatttaa tcgcggcctc 6240 

gagcaagacg tttcccgttg aatatggctc ataacacccc ttgtattact gtttatgtaa 6300 

gcagacagtt ttattgttca tgatgatata tttttatctt gtgcaatgta acatcagaga 6360 

ttttgagaca caacgtggct ttcccccccc ccccattatt gaagcattta tcagggttat 6420 

tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 6480 

cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta 6540 

acctataaaa ataggcgtat cacgaggccc tttcgtc 6577 



<210> 51 
<211> 6577 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 

» 

<400> 51 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgcatgttcc 480 
catagtaacg ccaataggga cfcttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 12 00 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
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accgaggbgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgbgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcabgatcab ggagaagggc gagatcaaga actgcagctt caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tcbacaagcb ggacatcabc 2460 
cccatcgaca acgacaccac cagctacagc cbgaccagct gcaacaccgc tagcatcaac 2520 
tgcacccgcc ccaacaacaa cacccgcaag agcatccaca bcggccccgg ccgcgccttc 2580 
tacaccaccg gcgagabcab cggcgacatc cgccaggccc actgcaacct gagccgcgcc 2640 
aagtggaacg acaccctgaa caagatcgtg atcaagctgc gcgagcagtt cggcaacaag 2700 
accatcgtgt tcaagcacag cagcggcggc gaccccgaga tcgtgaccca cagcttcaac 2760 
tgcggcggcg agttcttcta ctgcaacagc acccagctgt tcaacagcac ctggttcaac 2820 
agcacctgga gcaccgaggg cagcaacaac accgagggca gcgacaccat caccctgccc 2880 
bgccgcabca agcagatcat caacatgtgg cagaaggtgg gcaaggccat gtacgccccc 2940 
cccatcagcg gccagatccg ctgcagcagc aacatcaccg gccbgcbgcb gacccgcgac 3000 
ggcggcaaca gcaacaacga gagcgagatc tbccgbccgg gcggcggcga cabgcgcgac 3060 
aactggcgca gcgagctgta caagbacaag gtggtgaaga tcgagcccct gggcgtggcc 3120 
cccaccaagg ccaagcttac cgtccaggcc cgccagctgc tgagcggcat cgtgcagcag 3180 
cagaacaacc tgctgcgcgc catcgaggcc cagcagcacc tgctgcagct gaccgtgtgg 3240 
ggcatcaagc agctgcaggc ccgcaccctg gccgtggagc gctacctgaa ggaccagcag 3300 
cagctcgagc agabcbggaa ccacaccacc bggabggagb gggaccgcga gatcaacaac 3360 
tacaccagcc tgatccacag cctgatcgag gagagccaga accagcacga gaagaacgag 3420 
caggagctgc bggagcbgga caagtgggcc agcctgtgga acbggbbcaa catcaccaac 34 80 
bggcbgbggb acabcaagcb gttcabcabg atcgbgggcg gcctggtggg cctgcgcatc 3540 
gtgttcgccg tgctgagcat cbgaggabcc agabcbgcbg bgccbbctag ttgccagcca 3600 
tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggbgccac tcccactgtc 3660 
cbtbcctaab aaaatgagga aattgcatcg cabbgbctga gbaggtgbca ttctattctg 3720 
gggggtgggg tggggcagga cagcaagggg gaggabtggg aagacaatag caggcatgct 3780 
ggggatgcgg tgggctctat gggtacccag gtgctgaaga attgacccgg ttcctcctgg 3 84 0 
gccagaaaga agcaggcaca tccccttctc tgtgacacac cctgtccacg cccctggttc 3900 
ttagttccag ccccactcat aggacactca tagctcagga gggctccgcc ttcaatccca 3960 
cccgctaaag tacttggagc ggtcbctccc tccctcatca gcccaccaaa ccaaacchag 4020 
ccbccaagag tgggaagaaa ttaaagcaag ataggctatt aagtgcagag ggagagaaaa 4080 
tgcctccaac abgbgaggaa gtaatgagag aaatcataga attttaaggc catcatggcc 4140 
bbaabctbcc gcttcctcgc tcacbgacbc gcbgcgcbcg gtcgttcggc tgcggcgagc 4200 
ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg 4260 
aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct 4320 
ggcgtttttc cataggctcc gcccccctga cgagcabcac aaaaatcgac gctcaagtca 43B0 
gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct 4440 
cgtgcgchct cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc 4500 
gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt 4560 
tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc 4620 
cggtaacfcat cghcttgagt ccaacccggt aagacacgac ttatcgccac bggcagcagc 4 680 
cacfcggtaac aggatbagca gagcgaggfca tgtaggcggt gctacagagt tcttgaagtg 4740 
gtggcctaac tacggctaca ctagaagaac agtatttggt atctgcgctc tgctgaagcc 4 BOO 
agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag 4860 
cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga 4920 
tcctttgatc tfcttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat 4980 
tttggtcatg agattatcaa aaaggatctt caccbagatc cbbttaaabt aaaaabgaag 5040 
bttbaaabca abcbaaagba bababgagba aacbbggbct gacagtbacc aabgcbbaat 5100 
cagbgaggca ccbabctcag cgatctgbcb atbtcgtbca bccabagtbg ccbgacbcgg 5160 
gggggggggg cgcbgaggtc bgcctcgbga agaaggbgbb gcbgactcab accaggccbg 522 0 
aabcgcccca bcabccagcc agaaagbgag ggagccacgg bbgabgagag cbbbgtbgba 5280 
ggbggaccag btggbgabtb bgaacbbbbg cbbtgccacg gaacggbcbg cgbtgbcggg 5340 
aagatgcgbg abcbgabccb tcaactcagc aaaagbtcga ttbatbcaac aaagccgccg 5400 
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tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa tbcbgabbag 5460 
aaaaactcat cgagcatcaa atgaaactgc aatttattca babcaggabb abcaabacca 5520 
babbbtbgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca gttccatagg 5580 
atggcaagat cctggbabcg gtctgcgatt ccgactcgtc caacatcaat acaacctatt 5640 
aatttccccb cgtcaaaaat aaggbbabca agtgagaaat caccatgagt gacgactgaa 5700 
bccggbgaga atggcaaaag cttatgcatt bcbbtccaga cbbgbbcaac aggccagcca 5760 
bbacgcbcgb catcaaaabc acbcgcabca accaaaccgt babtcabbcg tgabtgcgcc 5B20 
bgagcgagac gaaatacgcg atcgcbgbta aaaggacaab tacaaacagg aatcgaabgc 5880 
aaccggcgca ggaacactgc cagcgcabca acaabatbtb cacctgaabc aggababbct 5940 
tcbaataccb ggaatgctgb btbcccgggg abcgcagbgg bgagbaacca bgcabcabca 6000 
ggagbacgga baaaabgcbt gabggbcgga agaggcataa abbccgbcag ccagtbbagt 6060 
cbgaccatcb cabcbgbaac atcabbggca acgctaccbt bgccabgbtb cagaaacaac 6120 
bcbggcgcab cgggcbbccc abacaabcga bagatbgbcg caccbgabtg cccgacatba 6180 
bcgcgagccc abbbabaccc abataaabca gcatccabgt tggaabtbaa bcgcggccbc 6240 
gagcaagacg bbbcccgbbg aabatggctc abaacacccc tbgtatbact gbtbabgbaa 6300 
gcagacagbt btabtgttca bgabgabata btbbtabcbt gbgcaabgta acabcagaga 6360 
bbbbgagaca caacgbggcb bbcccccccc ccccabbabb gaagcabbba bcagggbbab 6420 
bgbctcabga gcggabacab abbbgaabgt abbbagaaaa abaaacaaab aggggbbccg 6480 
cgcacabbbc cccgaaaagb gccaccbgac gtcbaagaaa ccatbatbat cabgacabba 6540 
accbabaaaa abaggcgbab cacgaggccc btbcgbc 6577 



<210> 52 
<211> 6718' 
<212> DNA 

<213> Arbificiai Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 52 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggtg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagatbgba ctgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagat gcgbaaggag aaaabaccgc atcagabbgg 240 
cbabbggcca bbgcatacgb bgtabccaba bcabaababg bacabbtaba bbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgatb abtgacbagb babbaabagb aabcaabbac 360 
ggggbcabba gbbcatagcc cabababgga gbbccgcgbb acataacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaabaabga cgbabgtbcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggbaaac 54 0 
bgcccacbbg gcagbacabc aagbgbatca babgccaagb acgcccccba bbgacgbcaa 600 
tgacggbaaa bggcccgccb ggcabbatgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagbac abcbacgbab tagbcabcgc babtaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbtbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 84 0 
cbccgcccca bbgacgcaaa bgggcggtag gcgtgbacgg bgggaggbcb ababaagcag 90 0 
agcbcgtbba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg* cggccgggaa cggtgcabbg gaacgcggab 1020 
bccccgtgcc aagagbgacg taagbaccgc cbatagacbc bataggcaca ccccbbbggc 1080 
bcbbabgcab gcbabacbgb bbtbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggbgabgg babagcbbag ccbabaggbg bgggbbabbg accabbatbg accacbcccc 1200 
babbggbgac gabacbbbcc abtacbaabc cabaacatgg cbcbbbgcca caacbabcbc 1260 
tabbggcbafc abgccaabac bctgtccbbc agagacbgac acggactcbg babbbbbaca 1320 
ggatggggbc ccabbtabba bbtacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
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cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gbgbbccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agcccbggbc ccatgccbcc 1500 
agcggcbcab ggtcgctcgg cagcbccbbg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgbb gbabtcbgab aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gbagbcbgag cagbacbcgb tgctgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagactgt bccbbbccab gggtctfcttc 1860 
bgcagbcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gbgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga bgabcbgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgbgbg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggbggbgc bggbgaacgb gaccgagaac ttcgacabgt ggaagaacga cabggbggag 2220 
cagabgcacg aggacabcab cagccbgbgg gaccagagcc bgaagcccbg cgbgaagcbg 2280 
accccccbgb gcgbgagcct gaagbgcacc gaccbgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcabgabcab ggagaagggc gagabcaaga acbgcagcbb caacabcagc 24 00 
accagcabcc gcggcaaggb gcagaaggag bacgccbbcb bcbacaagct ggacabcatc 24 60 
cccabcgaca acgacaccac cagcbacagc cbgaccagct gcaacaccag cgbgabcacc 2520 
caggccbgcc ccaaggtgag cbbcgagccc atccccaacc acbacbgcgc ccccgccggc 2580 
bbcgccabcc bgaagtgcaa ggacaagaag bbcaacggca agggccccbg caccaacgbg 264 0 
agcaccgbgc agtgcaccca cggcabccgc cccgbggtga gcacccagct gctgctgaac 2700 
ggbagcctgg ccgaggagga ggbggbgatc cgcagcgcba acbbcgccga caacgccaag 2760 
gbgabcatcg bgcagcbgaa cgagagcgbg gagabcaacb gcacccgccc caacaacaac 2820 
acccgcaaga gcatccacab cggccccggc cgcgccbbct acaccaccgg cgagatcatc 2880 
ggcgacabcc gccaggccca cbgcaaccbg gcbagcbbcb acbgcaacag cacccagcbg 2940 
bbcaacagca ccbggbbcaa cagcacctgg agcaccgagg gcagcaacaa caccgagggc 3000 
agcgacacca bcacccbgcc cbgccgcatc aagcagatca bcaacabgbg gcagaaggtg 3060 
ggcaaggcca bgtacgcccc ccccabcagc ggccagabcc gcbgcagcag caacabcacc 3120 
ggccbgcbgc tgacccgcga cggcggcaac agcaacaacg agagcgagab cttccgtccg 3180 
ggcggcggcg acabgcgcga caacbggcgc agcgagcbgb acaagbacaa ggbggbgaag 324 0 
abcgagcccc bgggcgbggc ccccaccaag gccaagctba ccgbccaggc ccgccagcbg 3300 
cbgagcggca bcgbgcagca gcagaacaac cbgctgcgcg ccabcgaggc ccagcagcac 3360 
cbgcbgcagc bgaccgbgbg gggcabcaag cagctgcagg cccgcacccb ggccgbggag 3420 
cgctaccbga aggaccagca gcagcbcgag cagabcbgga accacaccac cbggabggag 3480 
bgggaccgcg agatcaacaa cbacaccagc cbgatccaca gccbgabcga ggagagccag 3540 
aaccagcacg agaagaacga gcaggagctg cbggagcbgg acaagbgggc cagcctgbgg 3 600 
aacbggbbca acabcaccaa cbggctgbgg bacabcaagc tgttcabcab gabcgbgggc 3660 
ggccbggbgg gccbgcgcab cgbgbtcgcc gbgcbgagca bcbgaggabc cagabctgcb 3 72 0 
gtgccbbcba gbbgccagcc abcbgbbgbb bgccccbccc ccgbgcctbc cbtgacccbg 3780 
gaaggbgcca cbcccactgt cctbbccbaa baaaatgagg aaabbgcabc gcabtgbctg 3840 
agbaggtgtc abbcbabbct ggggggbggg gbggggcagg acagcaaggg ggaggabbgg 3900 
gaagacaaba gcaggcatgc bggggabgcg gbgggcbcba bgggbaccca ggbgcbgaag 3960 
aabbgacccg gbbccbccbg ggccagaaag aagcaggcac atcccctbcb cbgbgacaca 4 020 
ccctgbccac gccccbggbb cbtagbbcca gccccacbca baggacacbc abagcbcagg 4080 
agggcbccgc cbbcaabccc acccgcbaaa gbacbtggag cggbcbcbcc ctcccbcabc 4140 
agcccaccaa accaaaccba gccbccaaga gbgggaagaa abbaaagcaa gabaggcbab 4200 
baagtgcaga gggagagaaa abgccbccaa cabgbgagga agbaabgaga gaaabcabag 4260 
aabbbbaagg ccabcabggc cbbaatcbbc cgcbtccbcg cbcacbgacb cgcbgcgcbc 4320 
ggbcgbbcgg cbgcggcgag cggbabcagc bcacbcaaag gcggtaabac ggbtabccac 4380 
agaabcaggg gabaacgcag gaaagaacab gbgagcaaaa ggccagcaaa aggccaggaa 4440 
ccgtaaaaag gccgcgbbgc bggcgbtbbb ccabaggctc cgccccccbg acgagcabca 4500 
caaaaabcga cgctcaagbc agaggbggcg aaacccgaca ggacbabaaa gabaccaggc 4560 
gbbbcccccb ggaagcbccc bcgbgcgcbc bcctgbbccg acccbgccgc bbaccggata 4620 
ccbgbccgcc bbbcbcccbb cgggaagcgt ggcgcbbbcb cabagcbcac gcbgbaggba 4680 
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tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 4740 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 4800 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 4860 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac achagaagaa cagtathtgg 4920 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggbagcb cttgatccgg 4980 
caaacaaacc accgctggfca gcggtggttt ttttgtttgc aagcagcaga tbacgcgcag 5040 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggbctgacg ctcagtggaa 5100 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 5160 
ccttttaaat taaaaabgaa gttttaaatc aatctaaagt atatatgagt aaacthggbc 5220 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tabttcgttc 5280 
atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg aagaaggbgb 5340 
tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga gggagccacg 5400 
gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt gctttgccac 5460 
ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagttcg 5520 
atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca gtgttacaac 5580 
caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg caatttattc 5640 
atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga aggagaaaac 5700 
tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat tccgactcgt 5760 
ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc aagtgagaaa 5820 
tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat ttctttccag 5880 
acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc aaccaaaccg 5940 
ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa 6000 
ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaatattt 6060 
tcacctgaat caggatattc ttctaatacc tggaatgcbg tbtbcccggg gatcgcagtg 6120 
gbgagtaacc atgcatcabc aggagbacgg abaaaatgcb tgabggbcgg aagaggcata 6180 
aatbccgtca gccagbbtag tcbgaccatc bcabctgtaa cabcabbggc aacgctacct 6240 
btgccabgbb tcagaaacaa ctcbggcgca bcgggcbtcc catacaatcg abagattgbc 6300 
gcacctgatt gcccgacabt atcgcgagcc cattbatacc cababaaabc agcatccabg 6360 
ttggaabtba atcgcggcct cgagcaagac gbttcccgtb gaababggcb cabaacaccc 6420 
cttgtabbac bgtbbatgba agcagacagt tbtatbgbtc abgabgabat atbbbtabct 6480 
bgbgcaabgt aacabcagag attbbgagac acaacgtggc bbtccccccc cccccatbat 6540 
bgaagcattb abcagggbba tbgtctcatg agcggabaca tabbbgaabg babtbagaaa 6600 
aabaaacaaa baggggttcc gcgcacabtb ccccgaaaag tgccaccbga cgbcbaagaa 6660 
accatbabta bcabgacabt aaccbabaaa aabaggcgba tcacgaggcc ctbbcgtc 6718 



<210>. 53 
<211> 6742 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 53 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gbaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
bccaacatta ccgccatgtt gacattgatt atbgacbagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg bcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
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tgcccacttg gcagbacabc aagtgbabca batgccaagt acgcccccta ttgacgtcaa 600 
fcgacggtaaa bggcccgccb ggcattatgc ccagtacatg accbbabggg actttcctac 660 
bbggcagbac atctacgtat tagbcabcgc tathaccatg gtgatgcggt tttggcagta 720 
catcaafcggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agbtbgbbbb ggcaccaaaa tcaacgggac btbccaaaab gbcgbaacaa 840 
cbccgcccca ttgacgcaaa bgggcggtag gcgtgtacgg fcgggaggbct atabaagcag 900 
agctcgttba gbgaaccgtc agabcgcctg gagacgccab ccacgctgtt tbgaccbcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
tccccgbgcc aagagtgacg baagtaccgc cbabagacbc tataggcaca ccccbbtggc 1080 
tcttatgcab gcbabacbgb bttbggcttg gggcctatac acccccgcbt ccttabgcba 1140 
taggtgabgg babagcbbag cctataggbg bgggbbatbg accabbabbg accacbcccc 1200 
babbggtgac gatacbbtcc abtacbaatc cabaacatgg ctcbtbgcca caacbatcbc 1260 
battggcbab atgccaatac tcbgtcctbc agagactgac acggacbctg tatbbbbaca 1320 
ggatggggtc ccabbfcatta bbtacaaatb cacatabaca acaacgccgb cccccgbgcc 1380 
cgcagbbttb atbaaacata gcgtgggabc tccacgcgaa bcbcgggtac gbgtbccgga 1440 
cabgggcbct tctccggtag cggcggagcb bccacabccg agccctggtc ccabgccbcc 1500 
agcggctcab ggtcgcbcgg cagcbccbbg ctcctaacag tggaggccag actbaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggbagg gbatgbgbcb 1620 
gaaaabgagc gtggagabbg ggctcgcacg gctgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagtbgbt gbatbcbgab aagagtcaga ggtaacbccc 1740. 
gbbgcggbgc bgtbaacggt ggagggcagb gtagtctgag cagtactcgt tgcbgccgcg 1800 
cgcgccacca gacataabag cbgacagact aacagactgb bccbttccab gggbcbtbtc 1860 
bgcagtcacc gtcgbcgaca cgbgtgabca gabatcgcgg ccgcbctaga caccabgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac cabgcbgcbg 1980 
ggcatgcbga tgatcbgcag cgccaccgag aagcbgbggg bgaccgbgba ctacggcgbg 2040 
cccgbgtgga aggaggccac caccacccbg ctcbgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gccbgcgtgc ccaccgaccc caacccccag 2160 
gaggbggbgc tggtgaacgb gaccgagaac bbcgacabgb ggaagaacga catggtggag 222 0 
cagatgcacg aggacabcab cagcctgtgg gaccagagcc tgaagcccbg cgbgaagctg 2280 
acccccctgb gcgbgagcct gaagbgcacc gaccbgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcab ggagaagggc gagatcaaga acbgcagctb caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag bacgcctbcb bcbacaagct ggacatcatc 2460 
cccabcgaca acgacaccac cagctacagc cbgaccagcb gcaacaccag cgbgatcacc 2520 
caggcctgcc ccaaggbgag cbtcgagccc atccccaacc acbacbgcgc ccccgccggc 2580 
tbcgccabcc tgaagbgcaa ggacaagaag btcaacggca agggccccbg caccaacgbg 2 64 0 
agcaccgbgc agbgcaccca cggcabccgc cccgbggfcga gcacccagcb gctgctgaac 2700 
ggtagccbgg ccgaggagga ggbggtgatc cgcagcgcba acbbcgccga caacgccaag 2760 
gbgatcatcg tgcagcbgaa cgagagcgbg gagabcaacb gcacccgccc caacaacaac 282 0 
acccgcaaga gcabccacab cggccccggc cgcgccttcb acaccaccgg cgagabcabc 2880 
ggcgacatcc gccaggccca ctgcaacctg agccgcgcca agbggaacga caccctgaac 2 94 0 
aagatcgtga tcaagcbgcg cgagcagttc ggcaacaaga ccatcgbgbb caagcacagc 3000 
agcggcggcg accccgagat cgbgacccac agcbtcaact gcggcggcga gbbcttcbac 3060 
tgcaacagca cccagcbgtb caacagcacc bggtbcaaca gcaccbggag caccgagggc 312 0 
agcaacaaca ccgagggcag cgacaccabc accctgccct gccgcabcgc bagcagcaac 3180 
aacgagagcg agabcbbccg bccgggcggc ggcgacabgc gcgacaacbg gcgcagcgag 324 0 
cbgbacaagb acaaggbggb gaagatcgag cccctgggcg tggcccccac caaggccaag 3300 
cttaccgtcc aggcccgcca gcbgcbgagc ggcatcgtgc agcagcagaa caaccbgctg 3360 
cgcgccatcg aggcccagca gcacctgctg cagcbgaccg tgbggggcab caagcagcbg 342 0 
caggcccgca cccbggccgt ggagcgcbac ctgaaggacc agcagcagcb cgagcagabc 34 8 0 
bggaaccaca ccaccbggat ggagbgggac cgcgagabca acaactacac cagcctgabc 354 0 
cacagcctga bcgaggagag ccagaaccag cacgagaaga acgagcagga gcbgctggag 3600 
cbggacaagb gggccagcct gbggaacbgg btcaacabca ccaacbggcb gtggbacatc 3660 
aagcbgttca tcabgabcgt gggcggccbg gbgggcctgc gcabcgbgbb cgccgbgctg 3720 
agcatcbgag gabccagabc tgctgtgccb bctagtbgcc agccabcbgb bgbbbgcccc 3780 
bcccccgtgc cfctccbbgac ccbggaaggb gccacbccca cbgbccbtbc ctaabaaaat 3840 
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gaggaaattg catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg 3 900 
caggacagca agggggagga htgggaagac aatagcaggc atgctgggga tgcggtgggc 3960 
tctatgggta cccaggtgct gaagaattga cccggttcct cctgggccag aaagaagcag 4 020 
gcacatcccc bbcbcbgbga cacacccbgt ccacgcccct ggbbcbtagt hccagcccca 4080 
cbcabaggac acbcatagct caggagggcb ccgccttcaa bcccacccgc baaagbactb 4140 
ggagcggbcb ctccctccct catcagccca ccaaaccaaa cctagcctcc aagagtggga 4200 
agaaattaaa gcaagatagg cbatbaagbg cagagggaga gaaaabgccb ccaacabgbg 4260 
aggaagtaat gagagaaatc atagaattbb aaggccatca bggccttaab cttccgcttc 4320 
cfccgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagcfccactc 4380 
aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc 4440 
aaaaggccag caaaaggcca ggaaccgbaa aaaggccgcg bbgctggcgb ttttccatag 4500 
gctccgcccc cctgacgagc abcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 4560 
gacaggacha taaagatacc aggcgbbbcc ccctggaagc tccctcgtgc gctctcctgt 4620 
bccgacccbg ccgcttaccg gabacctgbc cgccttfcctc ccbbcgggaa gcgtggcgct 4680 
btcbcabagc tcacgctgta gghatctcag bbcggbgbag gtcgttcgct ccaagctggg 4740 
ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttabccggta actatcgtct 4800 
tgagtccaac ccggtaagac acgacbbabc gccactggca gcagccactg gbaacaggat 4860 
tagcagagcg aggtatgtag gcggtgctac agagttcttg aagbggbggc ctaactacgg 4 920 
cbacacbaga agaacagtat btggtabebg cgctctgctg aagccagtta ccttcggaaa 4980 
aagagbbggb agctcttgat ccggcaaaca aaccaccgct ggbagcggtg gtbbbbbbgb 5040 
tbgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 5100 
tac 99ggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg bcatgagabb 5160 
atcaaaaagg atcbbcaccb agatcctttt aaattaaaaa tgaagtttta aabcaabcba 5220 
aagfcatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat 5280 
ctcagcgatc bgtctatbbc gttcatccat agttgcctga ctcggggggg gggggcgctg 5340 
aggtctgcct cgtgaagaag gtgttgctga ctcataccag gcctgaatcg ccccatcatc 5400 
cagccagaaa gtgagggagc cacggttgat gagagctttg ttgtaggtgg accagttggt 54 60 
gattttgaac ttttgctttg ccacggaacg gtctgcgttg tcgggaagat gcgtgatctg 5520 
atccttcaac tcagcaaaag thcgatttat tcaacaaagc cgccgtcccg tcaagtcagc 5580 
gtaatgctct gccagfcghta caaccaatta accaattctg attagaaaaa ctcatcgagc 5640 
atcaaatgaa actgcaattt attcatatca ggabfcafccaa taccatatfct ttgaaaaagc 5700 
cgtttctgta atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg 5760 
tatcggtctg cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca 5820 
aaaataaggfc tatcaagtga gaaatcacca tgagtgacga. ctgaatccgg tgagaatggc 5880 
aaaagcttat gcatttcttt ccagacttgt tcaacaggcc agccattacg ctcgtcafcca 5940 
aaatcactcg catcaaccaa accgttattc attcgtgatt gcgcctgagc gagacgaaat 6000 
acgcgatcgc tgttaaaagg acaattacaa acaggaatcg aatgcaaccg gcgcaggaac 6060 
actgccagcg catcaacaat attttcacct gaatcaggat attcttctaa tacctggaat 6120 
gctgttttcc cggggatcgc agtggtgagt aaccatgcat catcaggagt acggataaaa 6180 
tgcttgahgg tcggaagagg cataaattcc gtcagccagt ttagtctgac catctcatct 6240 
gtaacatcat tggcaacgct acctttgcca tgtttcagaa acaactctgg cgcatcgggc 6300 
btcccataca atcgatagat tgtcgcacct gattgcccga cattatcgcg agcccattta 6360 
tacccatata aatcagcatc catgttggaa tttaatcgcg gcctcgagca agacgtttcc 6420 
cgttgaatat ggctcataac accccttgta tbactgttta bgbaagcaga cagbttbabb 6480 
gbtcabgabg atababbbbb abcbbgbgca abgbaacabc agagabbbbg agacacaacg 6540 
bggcbtbccc ccccccccca bbabbgaagc abbbabcagg gbbatbgtcb cabgagcgga 6600 
bacababtbg aabgbabbba gaaaaabaaa caaabagggg bbccgcgcac abbbccccga 6660 
aaagbgccac ctgacgbcba agaaaccatt atbatcabga cattaaccba baaaaabagg 6720 
cgtabcacga ggcccbbbcg be 6742 



<210> 54 
<211> 6781 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 54 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga actgcagctt caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tctacaagct ggacatcatc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcgccatcc tgaagtgcaa ggacaagaag ttcaacggca agggcccctg caccaacgtg 2640 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagct gctgctgaac 2700 
ggfcagcctgg ccgaggagga ggtggtgatc cgcagcgcta acttcgccga caacgccaag 2760 
gtgatcatcg tgcagctgaa cgagagcgtg gagatcaact gcacccgccc caacaacaac 2820 
acccgcaaga gcatccacat cggccccggc cgcgccttct acaccaccgg cgagatcatc 2880 
ggcgacatcc gccaggccca ctgcaacctg agccgcgcca agfcggaacga caccctgaac 2940 
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aagatcgtga tcaagctgcg cgagcagttc ggcaacaaga ccatcgtgtt caagcacagc 3000 
agcggcggcg accccgagat cgtgacccac agcbtcaacb gcggcggcga gbbcbbcbac 3060 
tgcaacagca cccagctgtt caacagcacc bggbbcaaca gcacctggag caccgagggc 3120 
agcaacaaca ccgagggcag cgacaccatc accctgccct gccgcatcaa gcagatcatc 3180 
aacabgbggc agaaggtggg caaggccatg bacgcccccc ccatcagcgg ccagatccgc 3240 
tgcagcagca acatcaccgg cctgctgctg acccgcgacg gcggcaacag caacaacgag 33 00 
agcgagatct bccgbccggg cggcggcgct agcgccaagc ttaccgtcca ggcccgccag 3360 
ctgctgagcg gcatcgtgca gcagcagaac aaccbgctgc gcgccatcga ggcccagcag 3420 
cacctgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcac ccbggccgbg 34 80 
gagcgctacc bgaaggacca gcagcagctc gagcagatct ggaaccacac caccbggabg 3540 
gagtgggacc gcgagabcaa caactacacc agccbgatcc acagcctgat cgaggagagc 3600 
cagaaccagc acgagaagaa cgagcaggag ctgctggagc tggacaagtg ggccagcctg 3660 
bggaacbggb tcaacatcac caactggctg bggbacabca agctgbbcab catgatcgtg 3720 
ggcggcctgg bgggccbgcg catcgtgttc gccgbgcbga gcatctgagg atccagatct 3780 
gctgtgcctt ctagttgcca gccabcbgbb gtttgcccct cccccgtgcc ttccttgacc 3840 
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3900 
cbgagbaggb gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3960 
tgggaagaca atagcaggca bgcbggggab gcgghgggct ctatgggtac ccaggbgcbg 4020 
aagaattgac ccggttcctc cbgggccaga aagaagcagg cacatcccct tctctgtgac 40 80 
acaccctgtc cacgcccctg gttcttagtt ccagccccac tcataggaca cbcabagcbc 4140 
aggagggctc cgccttcaat cccacccgct aaagtacttg gagcggtctc tccctccctc 4200 
atcagcccac caaaccaaac ctagcctcca agagtgggaa gaaattaaag caagataggc 42 60 
tabbaagtgc agagggagag aaaatgcctc caacatgtga ggaagtaatg agagaaatca 43 2 0 
tagaatttta aggccatcat ggccbbaabc btccgctbcc tcgctcactg actcgctgcg 43 80 
cbcggbcgbb cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc 444 0 
cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag 4500 
gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca 4560 
tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagabacca 4 620 
ggcgtttccc ccbggaagcb ccctcgtgcg cbcbcctgbb ccgaccctgc cgcbbaccgg 4 680 
atacctgtcc gcctttctcc cbbcgggaag cgtggcgcbb tctcatagct cacgctgtag 4740 
gbabcbcagb tcggtgtagg tcgbtcgctc caagcbgggc tgbgbgcacg aaccccccgb 4800 
bcagcccgac cgcbgcgccb babccggbaa cbabcgtcbt gagbccaacc cggbaagaca 4 860 
cgacbbabcg ccacbggcag cagccacbgg baacaggabt agcagagcga ggbabgbagg 4920 
cggbgcbaca gagbtcbbga agtggbggcc baacbacggc tacactagaa gaacagbabb 4980 
bggbabcbgc gctcbgctga agccagtbac cbtcggaaaa agagbtggba gctcbbgabc 5040 
cggcaaacaa accaccgcbg gbagcggbgg bbbbbbtgbt tgcaagcagc agabbacgcg 5100 
cagaaaaaaa ggabcbcaag aagabccbtb gabcbbtbcb acggggbcbg acgcbcagbg 5160 
gaacgaaaac bcacgbbaag ggabbttggb cabgagabba tcaaaaagga bcbbcaccba 5220 
gabccttbba aabbaaaaab gaagbtbbaa abcaabcbaa agbabababg agbaaacbbg 5280 
gbctgacagb baccaabgcb baabcagbga ggcacctabc tcagcgabcb gbcbattbcg 5340 
bbcabccaba gbbgccbgac bcgggggggg ggggcgcbga ggtcbgccbc gbgaagaagg 54 00 
bgbbgcbgac bcabaccagg ccbgaabcgc cccabcabcc agccagaaag bgagggagcc 54 60 
acggbbgabg agagcbbbgb bgbaggbgga ccagbbggbg abbbbgaacb bbbgcbbbgc 5520 
cacggaacgg bcbgcgbbgb cgggaagatg cgbgabcbga tccbbcaacb cagcaaaagb 5580 
tcgabbbabt caacaaagcc gccgbcccgb caagbcagcg taabgcbctg ccagbgtbac 5640 
aaccaabbaa ccaabbcbga tbagaaaaac bcatcgagca bcaaatgaaa cbgcaatbba 5700 
bbcababcag gabbabcaab accababbbb tgaaaaagcc gbbbctgbaa bgaaggagaa 5760 
aacbcaccga ggcagbbcca taggabggca agabccbggb abcggbcbgc gabbccgacb 5820 
cgbccaacab caabacaacc battaabbbc cccbcgbcaa aaabaaggbb abcaagbgag 5880 
aaatcaccab gagbgacgac tgaabccggb gagaabggca aaagcbbabg cabbbcbbbc 5940 
cagacbbgtb caacaggcca gccatbacgc bcgbcabcaa aabcacbcgc abcaaccaaa 6000 
ccgtbabbca bbcgbgabbg cgccbgagcg agacgaaaba cgcgabcgcb gtbaaaagga 6060 
caatbacaaa caggaatcga abgcaaccgg cgcaggaaca ctgccagcgc atcaacaaba 6120 
bbbbcacctg aabcaggaba tbctbcbaab accbggaabg cbgbbbbccc ggggabcgca 6180 
gbggbgagta accabgcabc abcaggagba cggabaaaat gcbbgabggb cggaagaggc 6240 
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abaaabbccg tcagccagbb tagtctgacc abcbcabcbg taacatcabt ggcaacgcta 6300 
cctttgccat gbbbcagaaa caactctggc gcatcgggct tcccahacaa bcgabagabt 6360 
gbcgcacctg attgcccgac abbatcgcga gcccabbbab acccatataa atcagcatcc 6420 
abgtbggaat btaabcgcgg ccbcgagcaa gacgtttccc gtbgaababg gctcataaca 6480 
cccctbgbab tactgbttat gbaagcagac agbtttattg ttcatgatga tatattttta 6540 
tcttgtgcaa bgbaacatca gagabttbga gacacaacgb ggctttcccc ccccccccab 6600 
babbgaagca tbtabcaggg tbabbgbcbc abgagcggat acatattbga atgtatbbag 6660 
aaaaabaaac aaataggggt tccgcgcaca bbbccccgaa aagbgccacc bgacgbcbaa 6720 
gaaaccabba bbatcabgac abbaacctab aaaaabaggc gbabcacgag gcccbbbcgb 6780 
c 6781 



<210> 55 
<211> 6808 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 55 

tcgcgcgbtb cggbgabgac ggtgaaaacc tcbgacacat gcagcbcccg gagacggbca 60 

cagctbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 

btggcgggtg tcggggctgg cbbaactabg cggcatcaga gcagattgta ctgagagbgc 18 0 

accababgcg gbgtgaaata ccgcacagab gcgbaaggag aaaataccgc atcagattgg 24 0 

ctabbggcca ttgcatacgt tgtatccaba tcabaatabg tacatttata ttggcbcabg 300 

tccaacatba ccgccabgtb gacatbgatb attgacbagt tatbaatagt aatcaabbac 360 

ggggbcabta gbbcabagcc cabababgga gbtccgcgbb acabaactta cggtaaatgg 420 

cccgccbggc tgaccgccca acgacccccg cccabtgacg bcaabaatga cgtatgbtcc 480 

catagtaacg ccaataggga ctttccabbg acgbcaatgg gtggagbabt tacggbaaac 540 

bgcccacbbg gcagbacatc aagtgtabca tatgccaagt acgcccccba btgacgbcaa 600 

bgacggtaaa bggcccgccb ggcabtabgc ccagbacabg accbtafcggg acttbcctac 660 

bbggcagbac abcbacgbat tagtcabcgc battaccabg gtgabgcggt bbtggcagta 720 

cabcaabggg cgtggabagc ggtbtgacbc acggggabtt ccaagtctcc accccabbga 780 

cgbcaabggg agtbtgtbtt ggcaccaaaa bcaacgggac tttccaaaab gbcgtaacaa 84 0 

ctccgcccca btgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb atabaagcag 900 

agctcgttta gtgaaccgbc agatcgcctg gagacgccat ccacgcbgbt tbgaccbcca 960 

tagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcatbg gaacgcggab 1020 

bccccgbgcc aagagbgacg taagbaccgc cbabagactc tataggcaca cccctbtggc 1080 

tcbtabgcab gctabacbgt btttggcbbg gggccbabac acccccgcbt ccbtatgcba 1140 

baggtgatgg batagcttag ccbataggbg tgggtbattg accatbabbg accacbcccc 1200 

babtggbgac gabactttcc abtacbaabc cabaacabgg cbcttbgcca caactatcbc 1260 

babtggcbab atgccaatac bcbgbcctbc agagacbgac acggactcbg tabbtbbaca 1320 

ggatggggtc ccabbtatta bbbacaaabb cacatabaca acaacgccgb cccccgtgcc 1380 

cgcagbbttb attaaacaba gcgbgggabc tccacgcgaa tctcgggbac gtgbbccgga 1440 

catgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 

agcggcbcat ggtcgcbcgg cagctcctbg cbcctaacag tggaggccag actbaggcac 1560 

agcacaabgc ccaccaccac cagbgtgccg cacaaggccg bggcggtagg gbabgbgbcb 1620 

gaaaabgagc gtggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 

gcagaagaag abgcaggcag cbgagbbgbb gtabtctgat aagagtcaga ggtaacbccc 174 0 
gbtgcggbgc bgbbaacggt ggagggcagb gtagtcbgag cagtacbcgb tgctgccgcg 1800 

cgcgccacca gacataabag cbgacagact aacagacbgt tccbbtccab gggtcbbbbc 1860 
bgcagbcacc gbcgbcgaca cgtgbgabca gabatcgcgg ccgcbcbaga atgcgcgbga 1920 
aggagaagba ccagcaccbg tggcgcbggg gcbggcgcbg gggcaccabg cbgctgggca 1980 
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tgctgatgat ctgcagcgcc accgagaagc tgtgggbgac cgtgtacbac ggcgtgcccg 2040 
tgtggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc bacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacabg gbggagcaga 2220 
tgcacgagga catcatcagc ctgtgggacc agagccbgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacg ctagcaagaa cbgcagcbbc aacatcagca 234 0 
ccagcatccg cggcaaggtg cagaaggagt acgcctbctb ctacaagctg gacatcatcc 2400 
ccatcgacaa cgacaccacc agctacagcc tgaccagctg caacaccagc gtgatcaccc 2460 
aggcctgccc caaggtgagc bbcgagccca tccccaacca ctactgcgcc cccgccggcb 2520 
tcgccatcct gaagtgcaag gacaagaagb tcaacggcaa gggcccchgc accaacgtga 2580 
gcaccgtgca gtgcacccac ggcatccgcc ccgtggtgag cacccagctg ctgctgaacg 2640 
gtagcctggc cgaggaggag gbggtgatcc gcagcgctaa cttcgccgac aacgccaagg 2700 
tgatcatcgt gcagctgaac gagagcgtgg agatcaactg cacccgcccc aacaacaaca 2760 
cccgcaagag catccacabc ggccccggcc gcgccttcta caccaccggc gagatcatcg 2820 
gcgacatccg ccaggcccac tgcaacctga gccgcgccaa gtggaacgac accctgaaca 2880 
agatcgtgat caagctgcgc gagcagttcg gcaacaagac catcgbgtbc aagcacagca 2940 
gcggcggcga ccccgagatc gtgacccaca gcttcaactg cggcggcgag ttcttctact 3000 
gcaacagcac ccagctgttc aacagcacct ggttcaacag cacctggagc accgagggca 3 060 
gcaacaacac cgagggcagc gacaccatca ccctgccctg ccgcabcaag cagatcatca 312 0 
acabgbggca gaaggtgggc aaggccatgt acgccccccc catcagcggc cagatccgct 3180 
gcagcagcaa catcaccggc ctgctgctga cccgcgacgg cggcaacagc aacaacgaga 3240 
gcgagabcbt ccgtccgggc ggcggcgaca tgcgcgacaa ctggcgcagc gagctgtaca 3300 
agtacaaggt ggtgaagatc gagcccctgg gcgtggcccc caccaaggcc aagcttaccg 3360 
tccaggcccg ccagctgctg agcggcabcg tgcagcagca gaacaacctg chgcgcgcca 3420 
tcgaggccca gcagcacctg ctgcagctga ccgtgtgggg catcaagcag cbgcaggccc 3480 
gcaccctggc cgtggagcgc tacctgaagg accagcagca gctcgagcag atctggaacc 354 0 
acaccacctg gatggagtgg gaccgcgaga tcaacaacta caccagcctg atccacagcc 3600 
tgatcgagga gagccagaac cagcaggaga agaacgagca ggagcbgctg gagctggaca 3660 
agtgggccag ccbgbggaac tggttcaaca tcaccaactg gctgtggtac atcaagctgt 3720 
tcatcatgat cgtgggcggc cbggbgggcc tgcgcatcgt gttcgccgtg ctgagcatct 3780 
gaggatccag abcbgctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg 3840 
tgccttccbt gaccctggaa ggtgccachc ccactgtcct bbccbaataa aatgaggaaa 3900 
ttgcatcgca ttgtctgagb aggbgbcatb cbatbcbggg gggbggggtg gggcaggaca 3960 
gcaaggggga ggabtgggaa gacaatagca ggcatgctgg ggabgcggtg ggcbctatgg 4020 
gbacccaggt gctgaagaat tgacccggbt cctcctgggc cagaaagaag caggcacatc 4080 
ccctbctctg tgacacaccc tgbccacgcc cctggttctt agbtccagcc ccactcatag 4140 
gacacbcaba gcbcaggagg gctccgcctt caatcccacc cgcbaaagba ctbggagcgg 4200 
bctctcccbc ccbcabcagc ccaccaaacc aaaccbagcc bccaagagbg ggaagaaatt 4260 
aaagcaagat aggctattaa gbgcagaggg agagaaaatg ccbccaacab gtgaggaagb 4320 
aatgagagaa atcabagaat tbtaaggcca bgatbtaagg ccabcabggc cttaatcbbc 4380 
cgcbtcctcg ctcactgacb cgctgcgctc ggtcgttcgg cbgcggcgag cggtatcagc 444 0 
tcacbcaaag gcggtaabac ggttatccac agaabcaggg gataacgcag| gaaagaacat 4500 
gbgagcaaaa ggccagcaaa aggccaggaa ccgbaaaaag gccgcgtbgc bggcgtbttt 4560 
ccabaggcbc cgccccccbg acgagcabca caaaaatcga cgcbcaagbc agaggtggcg 4620 
aaacccgaca ggactataaa gabaccaggc gtfcbcccccb ggaagcbccc bcgtgcgctc 4680 
bccbgbtccg accctgccgc btaccggaba ccbgtccgcc tbtctccctt cgggaagcgb 4740 
ggcgcttbcb cabagctcac gctgbaggta tcbcagbbcg gtgtaggbcg ttcgcbccaa 4800 
gcbgggcbgt gbgcacgaac cccccgbbca gcccgaccgc tgcgcctbat ccggbaacta 4860 
tcgbctbgag bccaacccgg baagacacga ctbabcgcca ctggcagcag ccactggbaa 4920 
caggabbagc agagcgaggt abgtaggcgg bgcbacagag ttctbgaagb ggbggcctaa 4980 
cbacggcbac acbagaagaa cagtatbtgg batcbgcgcb cbgctgaagc cagbbaccbb 504 0 
cggaaaaaga gtbggbagcb cbtgatccgg caaacaaacc accgctggba gcggbggtbt 5100 
btbtgtbbgc aagcagcaga tbacgcgcag aaaaaaagga tctcaagaag atccbtbgafc 5160 
cbttbcbacg gggbcbgacg cbcagbggaa cgaaaacbca cgbbaaggga bttbggbcat 5220 
gagabtatca aaaaggabcb bcacctagab ccbtbbaaab taaaaabgaa gbtbbaaabc 5280 
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aabcbaaagb abatatgagb aaacbbggbc bgacagtbac caatgcttaa bcagbgaggc 5340 
accbatcbca gcgatctgtc tatttcgttc atccatagtt gcctgactcg gggggggggg 5400 
gcgctgaggt ctgcctcgbg aagaaggtgt tgctgactca taccaggcct gaatcgcccc 5460 
atcabccagc cagaaagtga gggagccacg gbtgabgaga gcbttgbbgb aggtggacca 5520 
gttggtgatt bbgaacbbbb gcbbbgccac ggaacggtct gcgttgtcgg gaagatgcgt 5580 
gatctgatcc ttcaactcag caaaagtbcg atttattcaa caaagccgcc gtcccgtcaa 5640 
gtcagcgtaa tgctctgcca gbgbbacaac caatbaacca abbcbgabta gaaaaactca 5700 
bcgagcatca aabgaaacbg caabtbabbc ababcaggab batcaatacc ababbbtbga 5760 
aaaagccgbt bctgbaabga aggagaaaac bcaccgaggc agttccatag gabggcaaga 5820 
tcctggbabc ggbcbgcgat bccgactcgb ccaacatcaa bacaaccbab taabbbcccc 5880 
bcgtcaaaaa baaggtbabc aagbgagaaa bcaccabgag tgacgacbga atccggbgag 5940 
aabggcaaaa gcbbabgcab bbcbbbccag acbbgtbcaa caggccagcc abtacgcbcg 6000 
bcabcaaaab cacbcgcabc aaccaaaccg bbabbcabbc gtgabbgcgc cbgagcgaga 6060 
cgaaabacgc gabcgcbgbb aaaaggacaa tbacaaacag gaatcgaatg caaccggcgc 6120 
aggaacacbg ccagcgcabc aacaababbb tcaccbgaab caggababtc bbcbaabacc 6180 
bggaatgcbg bbtbcccggg gabcgcagbg gbgagtaacc atgcabcatc aggagbacgg 6240 
abaaaabgcb tgabggbcgg aagaggcaba aatbccgbca gccagbbbag bcbgaccabc 6300 
bcabctgbaa cabcabbggc aacgcbacct bbgccatgbb bcagaaacaa cbcbggcgca 6360 
tcgggctbcc cabacaabcg abagabbgbc gcacctgabb gcccgacabb abcgcgagcc 6420 
cabbbatacc catabaaabc agcabccatg bbggaabbba abcgcggccb cgagcaagac 6480 
gbbbcccgbb gaababggct cabaacaccc cbtgtabbac bgttbabgba agcagacagb 6540 
bbbabtgbbc abgabgabat abbbbtabct bgbgcaabgb aacatcagag abtbbgagac 6600 
acaacgbggc bttccccccc cccccabbab tgaagcatbt abcagggtba bbgbctcabg 6660 
agcggataca batbtgaabg tattbagaaa aabaaacaaa baggggbbcc gcgcacabbb 6720 
ccccgaaaag bgccaccbga cgbcbaagaa accabbatba bcatgacatb aaccbataaa 6780 
aabaggcgba bcacgaggcc cbbbcgbc 6808 



<210> 56 
<211> 6769 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 56 

bcgcgcgbtb cggbgabgac ggtgaaaacc bcbgacacat gcagctcccg gagacggbca 60 
cagcbtgbct gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 
bbggcgggtg bcggggcbgg cbtaacbabg cggcatcaga gcagatbgba cbgagagbgc 180 
accababgcg gbgbgaaata ccgcacagab gcgbaaggag aaaabaccgc atcagattgg 24 0 
cbabbggcca bbgcabacgb tgtatccaba bcabaatabg tacabbbaba btggcbcabg 300 
tccaacabba ccgccabgbt gacabtgabt abbgacbagb tabbaabagb aabcaatbac 360 
ggggbcabba gtbcatagcc catababgga gbbccgcgbb acabaactba cggbaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccabbgacg tcaabaabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbbbccatbg acgbcaabgg gbggagbabb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgbabca babgccaagb acgcccccba btgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbatgc ccagbacabg acctbabggg acbbbccbac 660 
bbggcagbac abctacgbab bagbcabcgc babbaccabg gbgabgcggb btbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbtbgbbtb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
cbccgcccca btgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agcbcgbbba gtgaaccgbc agabcgccbg gagacgccab ccacgcbgtb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
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tccccgbgcc aagagtgacg taagtaccgc ctatagactc tafcaggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccbbatgcba 1140 

baggbgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc abtacbaabc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tcbgbccbbc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccabttabta tbbacaaabb cacatataca acaacgccgt cccccgtgcc 1380 

cgcagbbbbt attaaacata gcgtgggabc tccacgcgaa tctcgggtac gtgttccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agccctggbc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag bggaggccag acttaggcac 1560 

agcacaabgc ccaccaccac cagtgtgccg cacaaggccg bggcggtagg gtatgtgtct 1620 

gaaaatgagc gtggagattg ggctcgcacg gcbgacgcag atggaagact fcaaggcagcg 1680 

gcagaagaag atgcaggcag cbgagttgbt gtattcbgat aagagtcaga ggtaactccc 1740 

gttgcggtgc tgfctaacggb ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacabaatag ctgacagact aacagactgt tcctttccat gggbcbtbbc 1860 

bgcagbcacc gbcgbcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgtga 1920 

aggagaagta ccagcacctg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 1980 

tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2040 

tgtggaagga ggccaccacc accctgcbct gcgccagcga cgccaaggcc tacgacaccg 2100 

aggtgcacaa cgtgtgggcc acccacgcct gcgbgcccac cgaccccaac ccccaggagg 2160 

tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 

tgcacgagga catcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 

ccctgtgcgt gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 2340 

gcggccgcat gatcatggag aagggcgaga tcaagaacbg cagcttcgct agcaccagch 24 00 

gcaacaccag cgtgatcacc caggcctgcc ccaaggtgag cttcgagccc abccccaacc 24 60 

actactgcgc ccccgccggc ttcgccatcc tgaagtgcaa ggacaagaag ttcaacggca 2520 

agggcccctg caccaacgtg agcaccgtgc agtgcaccca cggcatccgc cccgtggtga 2580 

gcacccagct gctgctgaac ggtagcctgg ccgaggagga ggtggtgatc cgcagcgcta 2640 

acbbcgccga caacgccaag gtgatcatcg tgcagctgaa cgagagcgtg gagatcaact 2700 

gcacccgccc caacaacaac acccgcaaga gcatccacat cggccccggc cgcgccttct 2760 

acaccaccgg cgagatcatc ggcgacatcc gccaggccca ctgcaacctg agccgcgcca 2820 

agtggaacga caccctgaac aagatcgtga tcaagctgcg cgagcagttc ggcaacaaga 2880 

ccatcgtgtt caagcacagc agcggcggcg accccgagat cgtgacccac agcttcaact 2940 

gcggcggcga gttcttctac tgcaacagca cccagctgtt caacagcacc tggttcaaca 3000 

gcacctggag caccgagggc agcaacaaca ccgagggcag cgacaccatc accctgccct 3060 

gccgcatcaa gcagatcatc aacatgtggc agaaggtggg caaggccatg tacgcccccc 3120 

ccatcagcgg ccagatccgc tgcagcagca acatcaccgg cctgctgctg acccgcgacg 3180 

gcggcaacag caacaacgag agcgagatct tccgtccggg cggcggcgac atgcgcgaca 3240 

actggcgcag cgagctgtac aagtacaagg tggtgaagat cgagcccctg ggcgtggccc 3300 

ccaccaaggc caagcttacc gtccaggccc gccagctgct gagcggcatc gtgcagcagc 3360 

agaacaacct gctgcgcgcc atcgaggccc agcagcaccb gctgcagctg accghgtggg 3420 

gcatcaagca gctgcaggcc cgcaccctgg ccgtggagcg ctacctgaag gaccagcagc 34 80 

agctcgagca gatctggaac cacaccacct ggatggagbg ggaccgcgag atcaacaact 354 0 

acaccagcct gatccacagc ctgabcgagg agagccagaa ccagcaggag aagaacgagc 3600 

aggagctgct ggagctggac aagtgggcca gcctgtggaa ctggttcaac atcaccaact 3660 

ggctgtggta catcaagctg ttcatcatga tcgtgggcgg cctggtgggc ctgcgcatcg 3720 

bgbtcgccgb gctgagcabc tgaggatcca gatcbgcbgt gcctbcbagb bgccagccat 3780 

ctgtbgtttg cccctccccc gtgccttcct tgaccctgga aggtgccacb cccactgbcc 3840 

bttccbaata aaatgaggaa attgcatcgc attgtcbgag taggtgbcat bcbattcbgg 3900 

ggggtggggb ggggcaggac agcaaggggg aggabbggga agacaatagc aggcabgcbg 3960 

gggabgcggb gggcbctabg ggbacccagg bgcbgaagaa bbgacccggt bcctccbggg 4020 

ccagaaagaa gcaggcacab ccccbbctcb gbgacacacc cbgtccacgc cccbggtfccb 4080 

tagttccagc cccactcaba ggacacbcat agcbcaggag ggctccgccb fccaatcccac 414 0 

ccgcbaaagb acttggagcg gbctcbccct ccctcabcag cccaccaaac caaacctagc 4200 

cbccaagagb gggaagaaat baaagcaaga baggcbatta agbgcagagg gagagaaaab 4260 

gcctccaaca bgbgaggaag baafcgagaga aatcafcagaa bbtbaaggcc abgabbbaag 4320 
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gccahcatgg ccbbaabcbb ccgcttcctc gctcactgac tcgctgcgct cggtcgbbcg 4380 

gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggbbabcca cagaatcagg 4440 

ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 4 500 

ggccgcgfcbg cbggcgbttt tccataggct ccgcccccct gacgagcabc acaaaaatcg 4560 

acgcbcaagb cagaggbggc gaaacccgac aggactataa agataccagg cgtttccccc 4620 

tggaagctcc cbcgbgcgcb cbccbgbbcc gacccbgccg cttaccggat acctgtccgc 4680 

ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atcbcagtbc 4740 

ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 4BO0 

cbgcgccbta bccggbaacb atcgtcttga gtccaacccg gtaagacacg acttatcgcc 4860 

actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 4 920 

gbbcbbgaag bggbggccba actacggcta cactagaaga acagtatttg gtabcbgcgc 4 980 

bctgctgaag ccagttacct tcggaaaaag agbtggtagc bcbbgabccg gcaaacaaac 5040 

caccgctggt agcggbggbb bbbbbgbtbg caagcagcag attacgcgca gaaaaaaagg 5100 

atctcaagaa gatccbttga bcbbtbcbac ggggtctgac gctcagtgga acgaaaactc 5160 

acgttaaggg abtbbggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 5220 

ttaaaaatga agttttaaat caatctaaag tatatatgag taaacbbggb ctgacagtta 5280 

ccaatgctta atcagbgagg cacctabcbc agcgatctgt ctatttcgbb catccatagt 5340 

bgcctgacbc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg btgcbgactc 54 00 

ataccaggcc tgaatcgccc cabcatccag ccagaaagtg agggagccac ggbbgabgag 5460 

agcbtbgbtg baggtggacc agbtggbgat tttgaacbtt tgctbtgcca cggaacggtc 5520 

tgcgttgtcg ggaagabgcg tgatctgatc cttcaactca gcaaaagttc gatttattca 5580 

acaaagccgc cgtcccgtca agbcagcgta atgctcbgcc agtgbbacaa ccaattaacc 5640 

aabbcbgatb agaaaaacbc abcgagcabc aaabgaaact gcaatttatb cabatcagga 5700 

ttabcaabac cababtbtbg aaaaagccgt ttctgbaatg aaggagaaaa ctcaccgagg 5760 

cagbtccata ggatggcaag abcctggtab cggtctgcga btccgactcg tccaacatca 5 82 0 

atacaaccba bbaatbbccc cbcgtcaaaa ataaggbtab caagtgagaa atcaccabga 5880 

gtgacgactg aatccggbga gaatggcaaa agcttatgca tbbcbhtcca gacbtgbtca 5940 

acaggccagc cabtacgcbc gtcabcaaaa tcacbcgcab caaccaaacc gtbattcatb 6000 

cgtgabtgcg cctgagcgag acgaaatacg cgatcgctgt baaaaggaca atbacaaaca 6060 

ggaatcgaab gcaaccggcg caggaacact gccagcgcat caacaatabt ttcaccbgaa 6120 

bcaggabatb ctbctaatac cbggaabgcb gbbtbcccgg ggatcgcagt ggbgagbaac 6180 

ca tgcabcat caggagbacg gabaaaabgc btgatggtcg gaagaggcab aaattccgtc 624 0 

agccagtbta gbcbgaccat cbcatcbgba acabcattgg caacgctacc ttbgccatgt 6300 

ttcagaaaca acbctggcgc atcgggcbbc ccabacaatc gabagatbgb cgcacctgab 6360 

tgcccgacab batcgcgagc ccabbtabac ccababaaab cagcabccat gbbggaatbb 6420 

aatcgcggcc tcgagcaaga cgtttcccgt tgaabatggc bcataacacc ccttgtatta 6480 

cbgbbbabgb aagcagacag bbbbabbgbb. cabgabgata babtbttatc ttgbgcaabg 6540 

taacabcaga gattbtgaga cacaacgtgg ctttcccccc ccccccatba btgaagcabt 6600 

batcagggtb atbgtcbcab gagcggatac atabbbgaab gtatbbagaa aaataaacaa 6660 

abaggggbbc cgcgcacabb bccccgaaaa gtgccaccbg acgbctaaga aaccabbabb 6720 

atcafcgacab baaccbataa aaabaggcgb atcacgaggc ccttbcgtc 6769 



<210> 57 
<211> 6784 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 57 

tcgcgcgbtb cggtgatgac ggbgaaaacc tcbgacacab gcagcbcccg gagacggtca 60 
cagctbgbct gtaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggbg 12 0 
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ttggcgggbg tcggggctgg cbbaacbatg 
accatatgcg gtgtgaaata ccgcacagat 
cbabbggcca ttgcatacgt tgtatccata 
bccaacabba ccgccatgtt gacattgatt 
ggggtcabta gttcatagcc cabatabgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga ctttccattg 
tgcccacttg gcagtacatc aagbgbatca 
tgacggtaaa bggcccgccb ggcabtatgc 
ttggcagtac atctacgtat tagtcatcgc 
catcaatggg cgtggatagc ggtttgactc 
cgtcaatggg agtttgtttt ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgttta gtgaaccgtc agatcgcctg 
bagaagacac cgggaccgat ccagcctccg 
hccccgtgcc aagagtgacg taagtaccgc 
tcttatgcat gctabacbgb ttfctggcbbg 
baggtgabgg tatagcttag cctataggtg 
tattggtgac gatacbbbcc attactaatc 
tattggctat atgccaatac bcbgtccbbc 
ggatggggtc ccatttatta tbbacaaabb 
cgcagttttt attaaacata gcgtgggatc 
catgggctct bcbccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag cbgagbbgbb 
gtbgcggbgc hgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca cgtgtgatca 
aggagaagta ccagcacctg tggcgctggg 
tgctgatgat ctgcagcgcc accgagaagc 
tgtggaagga ggccaccacc accctgctct 
aggtgcacaa cgtgtgggcc acccacgccb 
tggtgctggt gaacgtgacc gagaacttcg 
tgcacgagga catcabcagc cbgtgggacc 
cccbgtgcgb gagccbgaag tgcaccgacc 
gcggccgcah gatcabggag aagggcgaga 
gcatccgcgg caaggtgcag aaggagtacg 
tcgacaacga caccaccagc tacagcctga 
cctgccccaa ggbgagcttc gagcccatcc 
ccatcctgaa gtgcaaggac aagaagttca 
ccgtgcagbg cacccacggc abccgccccg 
gccbggccga ggaggaggtg gtgatccgca 
tcatcgtgca gctgaacgag agcgtggaga 
acctgagccg cgccaagtgg aacgacaccc 
agttcggcaa caagaccatc gtgttcaagc 
cccacagctt caactgcggc ggcgagtbcb 
gcacctggfct caacagcacc tggagcaccg 
ccabcacccb gcccbgccgc abcaagcaga 
ccabgbacgc cccccccatc agcggccaga 
tgctgacccg cgacggcggc aacagcaaca 
gcgacatgcg cgacaactgg cgcagcgagc 
cccbgggcgb ggcccccacc aaggccaagc 
gcafccgtgca gcagcagaac aaccbgcbgc 



cggcatcaga gcagatbgba cbgagagtgc 180 
gcgtaaggag aaaataccgc abcagattgg 240 
tcabaababg bacabbbaba bbggcbcatg 300 
attgactagt battaabagt aatcaatbac 360 
gtbccgcgtt acataactta cggtaaabgg 420 
cccatbgacg tcaataabga cgtatgttcc 480 
acgtcaatgg gtggagtabt bacggtaaac 540 
tatgccaagt acgcccccba btgacgtcaa 600 
ccagtacatg accttatggg actttcctac 660 
babbaccabg gtgatgcggb bbbggcagba 720 
ac ggggabbb ccaagbcbcc accccabtga 780 
bcaacgggac ttbccaaaat gbcgbaacaa 840 
gcgtgtacgg tgggaggbct atabaagcag 90 0 
gagacgccab ccacgcbgtb tbgaccbcca 960 
cggccgggaa cggbgcabtg gaacgcggat 1020 
cbatagactc tataggcaca ccccbtbggc 1080 
gggcctabac acccccgcbt cctbatgcta 1140 
tgggbtatbg accattatbg accactcccc 1200 
cabaacatgg cbctttgcca caactatcbc 1260 
agagacbgac acggacbctg tatttbbaca 1320 
cacatabaca acaacgccgt cccccgbgcc 13 80 
tccacgcgaa tcbcgggtac gbgbbccgga 1440 
bccacabccg agcccbggtc ccatgccbcc 1500 
ctcctaacag tggaggccag actbaggcac 1560 
cacaaggccg tggcggtagg gtatgbgbct 1620 
gcbgacgcag abggaagacb taaggcagcg 1680 
gtatbcbgab aagagbcaga ggtaactccc 1740 
gbagbcbgag cagbacbcgb bgctgccgcg 1800 
aacagactgt bccbttccab gggbcbbbbc 1860 
gatabcgcgg ccgcbcbaga abgcgcgbga 1920 
gcbggcgcbg. gggcaccatg cbgcbgggca 1980 
tgtgggtgac cgtgbactac ggcgbgcccg 2040 
gcgccagcga cgccaaggcc bacgacaccg 2100 
gcgbgcccac cgaccccaac ccccaggagg 2160 
acabgtggaa gaacgacatg gbggagcaga 222 0 
agagccbgaa gcccbgcgtg aagctgaccc 22 80 
tgaagaacga caccaacacc aacagcagca 2340 
tcaagaacbg cagcbbcaac abcagcacca 2400 
ccbbctbcba caagcbggac abcabcccca 2460 
ccagcbgcaa caccagcgbg abcacccagg 2520 
ccaaccacba cbgcgccccc gccggcbtcg 2580 
acggcaaggg ccccbgcacc aacgtgagca 2640 
bggtgagcac ccagcbgcbg ctgaacggba 2700 
gcgcbaacbb cgccgacaac gccaaggbga 2760 
bcaacbgcac ccgcgctagc gcccacbgca 2820 
bgaacaagab cgbgabcaag cbgcgcgagc 2 8 80 
acagcagcgg cggcgacccc gagatcgbga 2940 
bcbacbgcaa cagcacccag ctgbbcaaca 3000 
agggcagcaa caacaccgag ggcagcgaca 3 060 
bcabcaacab gbggcagaag gbgggcaagg 3120 
bccgcbgcag cagcaacabc accggccbgc 3180 
acgagagcga gabcbbccgb ccgggcggcg 3240 
tgtacaagba caaggtggbg aagabcgagc 3300 
btaccgtcca ggcccgccag cbgctgagcg 3 3 60 
gcgccabcga ggcccagcag caccbgctgc 342 0 
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agctgaccgb gtggggcabc aagcagcbgc aggcccgcac ccbggccgbg gagcgctacc 34 80 

tgaaggacca gcagcagctc gagcagatcb ggaaccacac cacctggatg gagtgggacc 3540 

gcgagatcaa caachacacc agcctgatcc acagcctgat cgaggagagc cagaaccagc 3600 

aggagaagaa cgagcaggag ctgctggagc tggacaagtg ggccagccbg tggaactggt 3660 

tcaacatcac caacbggctg bggbacabca agctgttcat catgatcgfcg ggcggcctgg 3720 

tgggcctgcg catcgtgttc gccgtgctga gcatctgagg atccagabcb gctgtgcctt 3780 

ctagttgcca gccatctgtt gbbbgccccb cccccgtgcc ttcchtgacc ctggaaggtg 3 840 

ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcatfcgt ctgagtaggt 3900 

gbcabbcbab bcbggggggb ggggtggggc aggacagcaa gggggaggat tgggaagaca 3960 

atagcaggca tgctggggat gcggtgggct ctatgggtac ccaggtgctg aagaattgac 4 020 

ccggttcctc ctgggccaga aagaagcagg cacatcccct tctctgtgac acaccctgtc 4080 

cacgcccctg gbbcbbagtb ccagccccac tcabaggaca ctcatagctc aggagggctc 4140 

cgccttcaafc cccacccgct aaagtacttg gagcggtctc tccctccctc atcagcccac 4200 

caaaccaaac cbagccbcca agagbgggaa gaaattaaag caagataggc tattaagtgc 4260 

agagggagag aaaatgcctc caacatgtga ggaagtaatg agagaaatca tagaatttta 4320 

aggccatgat ttaaggccat catggcctta abctbccgct tcctcgctca ctgactcgct 438.0 

gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 4440 

atccacagaa tcaggggata acgcaggaaa gaacabgbga gcaaaaggcc agcaaaaggc 4500 

caggaaccgt aaaaaggccg cgtbgcbggc gbttbtccab aggctccgcc cccctgacga 4560 

gcatcacaaa aabcgacgcb caagbcagag gtggcgaaac ccgacaggac tabaaagata 4620 

ccaggcgbbb ccccctggaa gctcccbcgb gcgcbcbccb gbtccgaccc bgccgcbbac 4680 

cggatacctg tccgccbbtc tcccbtcggg aagcgbggcg cbbbcbcata gctcacgcbg 4740 

baggbabctc agtbcggtgt aggbcgttcg cbccaagctg ggcbgbgbgc acgaaccccc 4 800 

cgttcagccc gaccgcbgcg cctbabccgg taactabcgt ctbgagbcca acccggtaag 4860 

acacgacbta bcgccactgg cagcagccac bggtaacagg abtagcagag cgaggbabgb 492 0 

aggcggbgcb acagagbtcb tgaagtggbg gcctaactac ggcbacacba gaagaacagt 4980 

abbtggbabc bgcgcbcbgc tgaagccagb taccbtcgga aaaagagbtg gtagcbcttg 5040 

atccggcaaa caaaccaccg ctggbagcgg tggtbbbbtt gtbbgcaagc agcagatbac 5100 

gcgcagaaaa aaaggabctc aagaagatcc tbbgabcbbt bcbacggggt cbgacgcbca 5160 

gbggaacgaa aacbcacgtb aagggabbbt ggbcatgaga btabcaaaaa ggabcbtcac 5220 

ctagatccbb bbaaatbaaa aabgaagttb baaatcaabc baaagbabat abgagbaaac 52 80 

bbggbcbgac agtbaccaab gcbbaabcag tgaggcaccb abcbcagcga bctgbctabb 5340 

bcgbbcabcc abagbbgccb gactcggggg gggggggcgc bgaggbcbgc ctcgbgaaga 5400 

aggbgtbgct gacbcabacc aggcctgaab cgccccabca bccagccaga aagtgaggga 54 60 

gccacggbbg abgagagctt bgbtgtaggb ggaccagbbg gtgatbbbga acbbbtgcbt 5520 

bgccacggaa cggbctgcgb bgtcgggaag atgcgbgabc bgabccbbca actcagcaaa 5580 

agbbcgatbb abbcaacaaa gccgccgbcc cgbcaagbca gcgbaabgct ctgccagbgb 5640 

bacaaccaab baaccaabbc bgabtagaaa aacbcabcga gcatcaaabg aaactgcaab 5700 

btatbcabab caggabtabc aabaccatat ttbbgaaaaa gccgtbbcbg taabgaagga 5760 

gaaaactcac cgaggcagbb ccataggabg gcaagabccb ggtabcggbc tgcgabbccg 5820 

actcgtccaa cabcaataca accbattaab bbccccbcgt caaaaabaag gttatcaagt 5880 

gagaaabcac cabgagbgac gacbgaabcc ggtgagaatg gcaaaagctb atgcabbbcb 5940 

tbccagacbb gbbcaacagg ccagccabta cgcbcgbcat caaaabcacb cgcatcaacc 6000 

aaaccgbtab tcabbcgbga bbgcgccbga gcgagacgaa abacgcgatc gcbgttaaaa 6060 

ggacaabtac aaacaggaat cgaatgcaac cggcgcagga acactgccag cgcatcaaca 6120 

ababbbtcac cbgaatcagg atabtcbbcb aabacctgga abgctgtbbb cccggggabc 6180 

gcagbggtga gtaaccabgc atcabcagga gbacggabaa aabgctbgab ggbcggaaga 6240 

ggcabaaatb ccgbcagcca gbbtagbcbg accabctcab cbgbaacabc abbggcaacg 63 00 

cbaccbbbgc cabgtttcag aaacaacbcb ggcgcabcgg gcbbcccaba caabcgabag 6360 

atbgbcgcac ctgabtgccc gacabbabcg cgagcccabb babacccaba taaabcagca 6420 

bccabgbbgg aabbtaatcg cggcctcgag caagacgbbb cccgttgaab atggctcata 6480 

acaccccbbg batbactgbt babgbaagca gacagtbbba bbgbbcabga bgabatabtb 6540 

btatcbbgtg caabgtaaca bcagagabbt bgagacacaa cgbggcbbbc cccccccccc 6600 

catbabbgaa gcabbtafcca gggbbabtgb cbcatgagcg gabacababb bgaabgbatb 6660 

bagaaaaaba aacaaabagg ggbbccgcgc acabbbcccc gaaaagbgcc accbgacgtc 6720 
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taagaaacca ttattatcat gacattaacc tataaaaata ggcgbatcac gaggcccttt 6780 
cgtc 6784 



<210> 58 
<211> 6787 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plaBmid pVR1012x/s containing HIV genes 
<400> 58 

bcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatafcgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata btggcbcatg 300 
bccaacabba ccgccatgtt gacattgatb attgactagt tabtaabagb aatcaattac 360 
ggggtcatta gttcatagcc catabatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccabbgacg tcaataatga cgtatgttcc 480 
catagbaacg ccaahaggga ctbtccattg acgtcaatgg gtggagtatt tacggtaaac 540 
bgcccacttg gcagtacabc aagtgtatca batgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tabtaccabg gtgatgcggt bttggcagba 720 
catcaatggg cgtggabagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtbbt ggcaccaaaa tcaacgggac tbtccaaaab gtcgtaacaa 840 
ctccgcccca btgacgcaaa tgggcggbag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgtbta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcatbg gaacgcggat 1020 
bccccgbgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
bcbtatgcat gctatactgt ttttggcttg gggcctabac acccccgctt ccctatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttthaca 1320 
ggatggggtc ccatttatfca tttacaaatfc cacatataca acaacgccgt cccccgtgcc 1380 
cgcagthhtt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggbcgctcgg cagctccttg ctcctaacag tggaggccag achtaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaafcgagc gtggagactg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacacaatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgtga 1920 
aggagaagta ccagcacctg tggcgccggg gctggcgctg gggcaccatg ctgctgggca 1980 
tgctgatgat ctgcagcgcc accgagaagc tgtgggcgac cgtgtactac ggcgtgcccg 2040 
tgtggaagga ggccaccacc accctgctct ■ gcgccagcga cgccaaggcc hacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct, gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacafcg gtggagcaga 2220 
tgcacgagga cabcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 2340 
gcggccgcat gabcacggag aagggcgaga tcaagaactg cagcbbcaac abcagcacca 24 00 
gcatccgcgg caaggbgcag aaggagtacg ccbbcbtcta caagctggac atcatcccca 2460 
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tcgacaacga caccaccagc tacagcctga ccagcbgcaa caccagcgbg atcacccagg 2520 
cctgccccaa ggtgagcttc gagcccatcc ccaaccacta ctgcgccccc gccggcttcg 2580 
ccabccbgaa gtgcaaggac aagaagthca acggcaaggg cccctgcacc aacgtgagca 2640 
ccgtgcagtg cacccacggc atccgccccg tggtgagcac ccagctgctg ctgaacggta 2700 
gcctggccga ggaggaggtg gtgatccgca gcgctaactb cgccgacaac gccaaggtga 2760 
tcatcgtgca gctgaacgag agcgtggaga tcaactgcac ccgccccaac aacaacaccc 2820 
gcaagagcat ccacatcggc cccggccgcg ccttctacac caccggcgag atcatcggcg 2880 
acatccgcca ggcccactgc aacctgagcc gcgccaagtg gaacgacacc ctgaacaaga 2940 
tcgtgatcaa gctgcgcgag cagttcggca acaagaccab cgtgttcaag cacagcagcg .3000 
gcggcgaccc cgagabcgbg acccacagct tcaactgcgg cggcgagttc tbcbacbgca 3060 
acagcgctag ccbgcccbgc cgcatcaagc agatcatcaa cabgbggcag aaggtgggca 3120 
aggccabgba cgcccccccc atcagcggcc agatccgctg cagcagcaac atcaccggcc 3180 
tgctgctgac ccgcgacggc ggcaacagca acaacgagag cgagatcttc cgtccgggcg 3240 
gcggcgacat gcgcgacaac bggcgcagcg agctgtacaa gtacaaggtg gtgaagatcg 3300 
agcccctggg cgtggccccc accaaggcca agcbtaccgb ccaggcccgc cagctgctga 3360 
gcggcabcgb gcagcagcag aacaaccbgc tgcgcgccat cgaggcccag cagcacctgc 34 20 
tgcagctgac cgtgtggggc atcaagcagc tgcaggcccg caccctggcc gtggagcgct 34 80 
acctgaagga ccagcagcag ctcgagcaga tctggaacca caccacctgg atggagtggg 3540 
accgcgagat caacaacbac accagcctga bccacagccb gatcgaggag agccagaacc 3 600 
agcaggagaa gaacgagcag gagctgctgg agctggacaa gtgggccagc ctgtggaact 3660 
ggttcaacat caccaactgg ctgtggtaca tcaagcbgtb catcatgatc gtgggcggcc 3720 
bggbgggccb gcgcatcgtg ttcgccgbgc tgagcabcbg aggatccaga tctgctgtgc 37 BO 
cttctagttg ccagccabcb gbbgtbbgcc cctcccccgt gccttccttg accctggaag 3 840 
gtgccactcc cactgtcctt tcctaataaa atgaggaaat bgcabcgcab bgbcbgagba 3900 
ggtgtcattc tattctgggg ggbggggbgg ggcaggacag caagggggag gabbgggaag 3960 
acaatagcag gcabgctggg gatgcggtgg gcbcbabggg tacccaggtg ctgaagaatt 4020 
gacccggttc cbccbgggcc agaaagaagc aggcacatcc cctbcbcbgb gacacaccct 4080 
gtccacgccc cbggbbcbta gttccagccc cactcatagg acactcatag ctcaggaggg 4140 
ctccgccttc aatcccaccc gctaaagtac ttggagcggt cbcbcccbcc ctcatcagcc 4200 
caccaaacca aacctagcct ccaagagtgg gaagaaatta aagcaagata ggctattaag 4260 
tg ca g a 999 a gagaaaafcgc ctccaacatg tgaggaagta atgagagaaa bcabagaatb 4320 
btaaggccat gatttaaggc catcatggcc ttaatcttcc gcttcctcgc tcactgactc 4380 
gcbgcgcbcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggbaabacg 4440 
gttatccaca gaatcagggg ataacgcagg aaagaacatg bgagcaaaag gccagcaaaa 4500 
ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga 4560 
cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag 4 620 
ataccaggcg ttbcccccbg gaagcbcccb cgbgcgcbcb ccbgbbccga cccbgccgcb 4680 
baccggatac cbgbccgccb bbcbcccbbc gggaagcgbg gcgcbbbcbc abagctcacg 4740 
cbgbaggbab cbcagbbcgg bgbaggbcgb bcgctccaag cbgggcbgbg bgcacgaacc 4 800 
ccccgbbcag cccgaccgcb gcgccbbabc cggbaacbab cgbcbbgagb ccaacccggb 4860 
aagacacgac bbabcgccac bggcagcagc cactggtaac aggabbagca gagcgaggba 4920 
tgbaggcggb gcbacagagt bcbbgaagbg gtggcctaac bacggcbaca cbagaagaac 4980 
agbabbtggb abcbgcgcbc bgcbgaagcc agbtaccbbc ggaaaaagag bbggbagcbc 5040 
tbgabccggc aaacaaacca ccgcbggbag cggtggbbbb bbbgbbbgca agcagcagab 5100 
bacgcgcaga aaaaaaggab cbcaagaaga bccttbgabc bbbbcbacgg ggbcbgacgc 5160 
bcagbggaac gaaaacbcac gbbaagggab btbggbcabg agabbabcaa aaaggabcbb 5220 
caccbagabc cbbbtaaabb aaaaabgaag btbbaaabca abcbaaagba bababgagba 5280 
aacbbggbcb gacagbbacc aatgctbaab cagbgaggca ccbabctcag cgabcbgbcb 5340 
abbbcgbbca bccatagbtg ccbgacbcgg gggggggggg cgcbgaggbc bgccbcgbga 5400 
agaaggbgbb gcbgacbcab accaggccbg aabcgcccca tcabccagcc agaaagbgag 5460 
ggagccacgg bbgabgagag cbtbgbbgba ggbggaccag tbggbgabbb bgaacbbbbg 5520 
cbbbgccacg gaacggbcbg cgtbgbcggg aagabgcgbg abcbgabccb bcaacbcagc 5580 
aaaagtbcga btbabbcaac aaagccgccg bcccgbcaag tcagcgbaab gcbcbgccag 5640 
bgbbacaacc aabbaaccaa bbcbgabbag aaaaacbcab cgagcabcaa abgaaacbgc 5700 
aabbbabbca babcaggabb abcaabacca babbbbbgaa aaagccgbbb cbgtaabgaa 5760 
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ggagaaaact caccgaggca gttccatagg atggcaagat cctggtatcg gtctgcgatt 5820 
ccgactcgtc caacatcaat acaacctatt aabbbccccb cgbcaaaaab aaggttatca 5880 
agtgagaaat caccatgagt gacgactgaa tccggtgaga atggcaaaag ctbabgcabb 5940 
tctttccaga cttgttcaac aggccagcca ttacgctcgb catcaaaatc actcgcatca 6000 
accaaaccgt tattcattcg bgabbgcgcc tgagcgagac gaaafcacgcg atcgctgtta 6060 
aaaggacaat tacaaacagg aabcgaabgc aaccggcgca ggaacactgc cagcgcatca 6120 
acaatatttt cacctgaatc aggatattct tctaatacct ggaatgctgt bbbcccgggg 6180 
atcgcagbgg tgagtaacca tgcahcatca ggagtacgga taaaatgctt gatggbcgga 6240 
agaggcataa attccgtcag ccagtttagt ctgaccatct catctgtaac atcattggca 6300 
acgctacctt bgccabgbtb cagaaacaac tctggcgcat cgggcbtccc atacaatcga 6360 
tagattgtcg cacctgattg cccgacatta tcgcgagccc atttataccc atabaaabca 6420 
gcatccatgt tggaatbtaa bcgcggcctc gagcaagacg btbcccgbtg aababggcbc 6480 
ataacacccc bbgbabbacb gtbbabgtaa gcagacagbb tbabtgbbca bgatgababa 6540 
ttttbatcbb gbgcaabgta acabcagaga bttbgagaca caacgbggct btcccccccc 6600 
ccccabtabb gaagcatbta bcagggttab tgtcbcabga gcggabacab abbbgaabgb 6660 
attfcagaaaa ataaacaaat aggggbtccg cgcacabbbc cccgaaaagb gccaccbgac 6720 
gtctaagaaa ccabbabtab catgacatta accbabaaaa abaggcgbab cacgaggccc 6780 
bbbcgtc 6787 



<210> 59 
<211> 6691 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 

* i 

<400> 59 

bcgcgcgbtb cggbgabgac ggbgaaaacc tcbgacacat gcagcbcccg gagacggtca 60 
cagctbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg tcagcgggbg 120 
bbggcgggbg tcggggcbgg cbbaactatg cggcatcaga gcagabbgba cbgagagbgc 180 
accatatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaataccgc abcagabbgg 24 0 
cbattggcca bbgcabacgt bgtabccata bcabaababg bacatbbaba tbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgatt abbgacbagb babbaabagb aabcaabbac 360 
ggggtcatba gtbcatagcc catabatgga gbbccgcgbb acabaacbba cggtaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccabbgacg bcaabaatga cgtatgbbcc 480 
cabagbaacg ccaabaggga ctbbccabtg acgbcaatgg gtggagbabb tacggbaaac 540 
bgcccacbbg gcagbacabc aagbgbabca babgccaagb acgcccccba bbgacgbcaa 600 
bgacggbaaa tggcccgcct ggcabbabgc ccagbacabg accbbatggg actttccbac 660 
bbggcagbac abcbacgbat bagbcabcgc babbaccatg gtgabgcggb tbtggcagba 720 
cabcaatggg cgtggatagc ggbbbgactc acggggattb ccaagtctcc accccabbga 780 
cgbcaabggg agtbbgbbbb ggcaccaaaa bcaacgggac ttbccaaaab gbcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb abataagcag 900 
agctcgbbba gtgaaccgbc agabcgcctg gagacgccab ccacgctgbb tbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggat 1020 
bccccgbgcc aagagbgacg baagbaccgc ctabagacbc babaggcaca ccccbtbggc 1080 
bcbtabgcab gcbabacbgb bbtbggcbbg gggccbabac acccccgcbb ccttabgcba 1140 
baggbgabgg babagcbbag ccbabaggbg tgggbtabtg accabbabtg accacbcccc 12 00 
babbggbgac gabacbbbcc abbacbaabc cabaacabgg cbcbbtgcca caacbabcbc 1260 
babbggcbab abgccaabac bcbgbccbbc agagacbgac acggactctg battbbbaca 1320 
ggabggggtc ccabbbabba bbtacaaabb cacabataca acaacgccgt cccccgbgcc 13 80 
cgcagbtbtb abbaaacaba gcgbgggabc bccacgcgaa bcbcgggtac gbgbtccgga 1440 
cabgggcbcb bcbccggtag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
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agcggcbcab ggtcgctcgg cagcbccbtg ctcctaacag tggaggccag achtaggcac X560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gbabgbgbct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagacfc taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gbabtcbgab aagagtcaga ggbaacbccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt bcctttccab gggbcbbbbc 1860 
tgcagtcacc gtcgtcgaca cgtgtgahca gatatcgcgg ccgctctaga abgcgcgbga 1920 
aggagaagta ccagcacctg bggcgcbggg gctggcgcbg gggcaccabg ctgctgggca 1980 
bgctgatgat cbgcagcgcc accgagaagc bgtgggbgac cgbgbacbac ggcgbgcccg 2040 
tSJtgf? 33 ^ 3 ggccaccacc acccbgcbcb gcgccagcga cgccaaggcc bacgacaccg 2100 
aggbgcacaa cgtgbgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
tggtgctggb gaacgbgacc gagaacbtcg acabgbggaa gaacgacabg gbggagcaga 2220 
bgcacgagga catcabcagc cbgfcgggacc agagccbgaa gccctgcgbg aagctgaccc 2280 
ccctgbgcgb gagccbgaag tgcaccgacg cbagcaccag cbgcaacacc agcgtgabca 2340 
cccaggccbg ccccaaggbg agcbbcgagc ccatccccaa ccacbacbgc gcccccgccg 2400 
gcttcgccab cctgaagbgc aaggacaaga agtbcaacgg caagggcccc tgcaccaacg 2460 
bgagcaccgt gcagtgcacc cacggcatcc gccccgtggt gagcacccag ctgctgcbga 2520 
acggtagccb ggccgaggag gaggbggbga bccgcagcgc baacbbcgcc gacaacgcca 2580 
aggbgatcab cgtgcagcbg aacgagagcg tggagabcaa cbgcacccgc cccaacaaca 264 0 
acacccgcaa gagcatccac abcggccccg gccgcgcctt cbacaccacc ggcgagabca 2700 
bcggcgacab ccgccaggcc cacbgcaacc bgagccgcgc caagtggaac gacacccbga 2760 
acaagabcgt gatcaagctg cgcgagcagt bcggcaacaa gaccatcgtg btcaagcaca 2 82 0 
gcagcggcgg cgaccccgag atcgbgaccc acagcbbcaa cbgcggcggc gagtbcttct 2880 
actgcaacag cacccagcbg btcaacagca ccbggbtcaa cagcaccbgg agcaccgagg 2940 
gcagcaacaa caccgagggc agcgacacca bcacccbgcc ctgccgcatc aagcagatca 3 000 
bcaacatgbg gcagaaggtg ggcaaggcca bgtacgcccc ccccabcagc ggccagatcc 3060 
gctgcagcag caacabcacc ggccbgcbgc tgacccgcga cggcggcaac agcaacaacg 3120 
agagcgagat ctbccgbccg ggcggcggcg acatgcgcga caactggcgc agcgagctgb 3180 
acaagtacaa ggbggbgaag abcgagcccc bgggcgbggc ccccaccaag gccaagctta 3240 
ccgbccaggc ccgccagcbg cbgagcggca bcgbgcagca gcagaacaac cbgcbgcgcg 3300 
ccabcgaggc ccagcagcac cbgcbgcagc tgaccgbgbg gggcabcaag cagcbgcagg 3360 
cccgcaccct ggccgbggag cgcbaccbga aggaccagca gcagcbcgag cagatcbgga 3420 
accacaccac cbggabggag tgggaccgcg agabcaacaa cbacaccagc cbgatccaca 34 80 
gcctgabcga ggagagccag aaccagcagg agaagaacga gcaggagctg cbggagcbgg 354 0 
acaagbgggc cagccbgtgg aacbggbbca acabcaccaa cbggcbgbgg tacatcaagc 3600 
bgbbcabcat gabcgbgggc ggccbggbgg gccbgcgcab cgbgbbcgcc gbgcbgagca 3660 
bcbgaggabc cagabcbgct gbgccbtcba gtbgccagcc atcbgbbgtb bgccccbccc 3720 
ccgtgccbbc cbtgaccctg gaaggtgcca cbcccactgb ccbbbcctaa baaaabgagg 3780 
aaattgcabc gcabbgbctg agtaggbgbc abbcbatbcb ggggggtggg gbggggcagg 3840 
acagcaaggg ggaggabtgg gaagacaaba gcaggcabgc bggggatgcg gbgggcbcba 3900 
bgggbaccca ggbgcbgaag aatbgacccg gbbcctccbg ggccagaaag aagcaggcac 3960 
abccccbbcb cbgbgacaca cccbgbccac gcccctggbb cbbagbbcca gccccactca 4020 
baggacacbc abagcbcagg agggcbccgc ctbcaabccc acccgcbaaa gbacbbggag 4080 
cggtcbcbcc ctcccbcabc agcccaccaa accaaaccba gcctccaaga gbgggaagaa 4140 
atbaaagcaa gabaggcbab baagbgcaga gggagagaaa abgccbccaa cabgbgagga 4200 
agbaabgaga gaaabcabag aabbbtaagg ccatgatfcta aggccabcab ggccbbaabc 4260 
bbccgcbbcc bcgcbcacbg acbcgcbgcg cbcggtcgbb cggcbgcggc gagcggbabc 4320 
agcbcacbca aaggcggbaa bacggbbabc cacagaabca ggggabaacg caggaaagaa 4380 
cabgbgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgb tgcbggcgbb 4440 
tbtccabagg cbccgccccc cbgacgagca tcacaaaaab cgacgcbcaa gbcagaggbg 4 500 
gcgaaacccg acaggactab aaagabacca ggcgtbtccc ccbggaagcb cccbcgtgcg 4560 
cbcbccbgbb ccgacccbgc cgcbbaccgg ataccbgbcc gccbbbcbcc cbbcgggaag 4620 
cgbggcgcbb bcbcabagcb cacgcbgbag gtatcbcagb bcggbgbagg tcgtbcgcbc 4680 
caagcbgggc bgbgbgcacg aaccccccgt bcagcccgac cgcbgcgccb tabccggbaa 4740 
cbabcgtcbb gagbccaacc cggbaagaca cgacbbabcg ccacbggcag cagccacbgg 4800 
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taacaggatt agcagagcga ggtatgtagg cggtgctaca gagbbcbbga agbggtggcc 4860 
taactacggc tacactagaa gaacagbatb tggtatctgc gcbcbgcbga agccagttac 4 920 
cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 4 980 
tbtbbbbgbt tgcaagcagc agattacgcg cagaaaaaaa ggatcbcaag aagabccbbb 5040 
gabcbbbbct acggggtctg acgctcagtg gaacgaaaac tcacgtbaag ggattttggt 5100 
catgagatta tcaaaaagga bcbbcaccta gabcctbtba aattaaaaat gaagttttaa 5160 
atcaatctaa agtatatatg agbaaacbtg gtctgacagt taccaabgct taatcagtga 5220 
ggcaccbabc bcagcgabcb gbcbabbbcg bbcabccaba gttgccbgac bcgggggggg 5280 
9999 c 9 ct 9 a ggtctgccbc gfcgaagaagg tgtbgctgac tcataccagg ccbgaabcgc 5340 
cccatcabcc agccagaaag bgagggagcc acggbtgatg agagctbtgt bgtaggtgga 5400 
ccagtbggbg abbttgaact bbbgcbbtgc cacggaacgg bcbgcgbbgb cgggaagabg 5460 
cgbgabcbga bcctbcaacb cagcaaaagb bcgabtbabb caacaaagcc gccgbcccgb 5520 
caagtcagcg baatgcfccbg ccagbgbtac aaccaabbaa ccaabbcbga tbagaaaaac 5580 
bcabcgagca bcaaabgaaa cbgcaattta bbcababcag gabtatcaab accabafcbbt 5640 
bgaaaaagcc gbbtcbgtaa bgaaggagaa aacbcaccga ggcagbbcca baggabggca 5700 
agabccbggt abcggbctgc gabbccgacb cgbccaacab caatacaacc tabbaatbbc 5760 
cccbcgbcaa aaataaggbb abcaagbgag aaabcaccab gagtgacgac bgaabccggb 5820 
gagaabggca aaagctbabg cabbbcbbbc cagactbgbb caacaggcca gccabbacgc 5880 
bcgbcabcaa aabcacbcgc abcaaccaaa ccgbbabtca bbcgbgabbg cgccbgagcg 5940 
agacgaaaba cgcgatcgcb gbbaaaagga caabbacaaa caggaabcga abgcaaccgg 6000 
cgcaggaaca ctgccagcgc. abcaacaata bbbbcaccbg aabcaggata bbcbbcbaat 6060 
accbggaatg cbgtbtbccc ggggabcgca gbggbgagba accabgcatc abcaggagba 6120 
cggabaaaab gcttgabggb cggaagaggc abaaatbccg bcagccagtb bagbcbgacc 6180 
abcbcabcbg taacatcabb ggcaacgcba ccbtbgccab gtbtcagaaa caactcbggc 6240 
gcabcgggcb bcccatacaa bcgatagatb gbcgcaccbg abbgcccgac abbatcgcga 63 00 
gcccabbbat acccatataa abcagcatcc abgbbggaab btaatcgcgg cctcgagcaa 6360 
gacgtbbccc gbbgaatabg gcbcabaaca ccccbbgbab bacbgbtbab gbaagcagac 6420 
agbbttabbg bbcatgabga bababbbbba bcbbgbgcaa bgtaacabca gagatbbbga 64 80 
gacacaacgt ggcbtbcccc ccccccccab babbgaagca bbbabcaggg bbabbgbcbc 6540 
atgagcggab acatabbbga abgtabbbag aaaaabaaac aaabaggggb bccgcgcaca 6600 
btbccccgaa aagbgccacc bgacgbcbaa gaaaccabba bbabcabgac abbaaccbab 6660 
aaaaabaggc gbabcacgag gcccbbbcgb c 6691 



<210> 60 
<211> 6727 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVRl012x/s containing HIV genes 
<400> 60 

bcgcgcgtbb cggbgabgac ggbgaaaacc tcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 
bbggcgggbg bcggggctgg cbbaactabg cggcatcaga gcagabtgba cbgagagtgc 180 
accatabgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagattgg 240 
cbabbggcca bbgcabacgb bgbabccaba bcabaababg tacabbtaba tbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgabb abtgacbagb babbaatagb aabcaabtac 360 
ggggbcabba gbbcatagcc cababatgga gbbccgcgbt acabaacbba cggbaaatgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbtcc 480 
catagbaacg ccaabaggga cbbtccabbg acgbcaabgg gbggagbabb tacggbaaac 540 
bgcccacbtg gcagbacabc aagbgbabca tatgccaagb acgcccccba tbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbatggg acbbtcctac 660 
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ttggcagtac atctacgtat tagtcatcgc 
catcaatggg cgtggatagc ggtttgactc 
cgtcaabggg agtttgtttt ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agcbcgbbba gbgaaccgbc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
bctbabgcab gctatactgt bbbbggcbbg 
taggtgatgg babagctbag cctataggtg 
tattggtgac gatactttcc attactaatc 
tattggctat atgccaatac tctgtccttc 
ggatggggtc ccatttatta bbbacaaabb 
cgcagttttt attaaacata gcgtgggatc 
cabgggctcb tctccggtag cggcggagct 
agcggcbcab ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gbggagabbg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca cghgtgatca 
aggagaagba ccagcacctg tggcgctggg 
bgcbgabgab ctgcagcgcc accgagaagc 
tgtggaagga ggccaccacc acccbgcbcb 
aggtgcacaa cgtgtgggcc acccacgcct 
bggbgctggt gaacgtgacc gagaacttcg 
tgcacgagga cafccatcagc cbgbgggacc 
cccbgtgcgb gagcctgaag hgcaccgacg 
ccagcatccg cggcaaggtg cagaaggagt 
ccatcgacaa cgacaccacc agctacagcc 
aggccbgccc caaggtgagc ttcgagccca 
tcgccatcct gaagtgcaag gacaagaagt 
gcaccgtgca gtgcacccac ggcatccgcc 
gtagcctggc cgaggaggag gtggtgatcc 
tgatcatcgt gcagcbgaac gagagcgtgg 
gcaacctgag ccgcgccaag tggaacgaca 
agcagttcgg caacaagacc atcgtgttca 
tgacccacag cttcaactgc ggcggcgagt 
acagcaccbg gttcaacagc acctggagca 
acaccabcac cctgccctgc cgcabcaagc 
aggccatgta cgcccccccc atcagcggcc 
tgctgctgac ccgcgacggc ggcaacagca 
gcggcgacat gcgcgacaac tggcgcagcg 
agccccbggg cgtggccccc accaaggcca 
gcggcabcgb gcagcagcag aacaacctgc 
tgcagctgac cgtgtggggc abcaagcagc 
acctgaagga ccagcagcag ctcgagcaga 
accgcgagat caacaactac accagcctga 
agcaggagaa gaacgagcag gagctgctgg 
ggbbcaacab caccaacbgg ctgtggtaca 
bggbgggccb gcgcatcgtg bbcgccgbgc 
cttctagttg ccagccatct gttgtttgcc 
gtgccactcc cacbgbccbb bccbaabaaa 
ggbgtcabbc tattctgggg ggtggggtgg 
acaatagcag gcatgctggg gatgcggtgg 



babbaccabg gbgabgcggb bbbggcagba 720 
acggggabbb ccaagbcbcc accccabbga 780 
bcaacgggac bbbccaaaab gbcgbaacaa 840 
gcgtgbacgg tgggaggbcb ababaagcag 900 
gagacgccab ccacgcbgtb bbgacctcca 960 
cggccgggaa cggbgcabtg gaacgcggat 1020 
cbatagactc babaggcaca cccctbbggc 1080 
gggccbabac acccccgcbb ccttabgcba 1140 
tgggbbabtg accatbabtg accacbcccc 1200 
cabaacabgg cbcbtbgcca caacbabcbc 1260 
agagacbgac acggactctg battbbbaca 1320 
cacababaca acaacgccgb cccccgbgcc 13 80 
bccacgcgaa bctcgggbac gbgbbccgga 1440 
bccacabccg agcccbggbc ccatgccbcc 1500 
cbccbaacag tggaggccag acbbaggcac 1560 
cacaaggccg bggcggtagg gbatgbgtcb 1620 
gcbgacgcag abggaagacb taaggcagcg 1680 
gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbagbcbgag cagbactcgb bgcbgccgcg 1800 
aacagacbgb tccbttccab gggtcbbtbc 1860 
gababcgcgg ccgcbcbaga abgcgcgtga 192 0 
gcbggcgcbg gggcaccabg cbgcbgggca 1980 
tgbgggtgac cgtgbacbac ggcgbgcccg 2040 
gcgccagcga cgccaaggcc bacgacaccg 2100 
gcgbgcccac cgaccccaac ccccaggagg 2160 
acabgbggaa gaacgacabg gbggagcaga 2220 
agagcctgaa gccctgcgbg aagcbgaccc 2280 
cbagcaagaa cbgcagcbbc aacabcagca 2340 
acgccbbcbb cbacaagcbg gacatcabcc 2400 
bgaccagcbg caacaccagc gbgabcaccc 2460 
bccccaacca cbacbgcgcc cccgccggcb 2520 
bcaacggcaa gggccccbgc accaacgbga 2580 
ccgtggtgag cacccagcbg cbgcbgaacg 2640 
gcagcgcbaa cbbcgccgac aacgccaagg 2700 
agabcaacbg cacccgcgcb agcgcccacb 2760 
cccbgaacaa gabcgbgabc aagcbgcgcg 2820 
agcacagcag cggcggcgac cccgagabcg 2880 
bcbbctacbg caacagcacc cagcbgbbca 2940 
ccgagggcag caacaacacc gagggcagcg 3000 
agabcabcaa cabgbggcag aaggbgggca 3060 
agabccgcbg cagcagcaac abcaccggcc 3120 
acaacgagag cgagabctbc cgbccgggcg 3180 
agcbgbacaa gbacaaggbg gbgaagabcg 3240 
agcbbaccgb ccaggcccgc cagctgctga 3300 
tgcgcgccab cgaggcccag cagcaccbgc 33 60 
bgcaggcccg caccctggcc gtggagcgcb 3420 
bcbggaacca caccaccbgg abggagbggg 3480 
bccacagccb gabcgaggag agccagaacc 3540 
a g ct gga ca a gtgggccagc ctgbggaacb 3600 
bcaagcbgbb cabcatgabc gtgggcggcc 3660 
bgagcabcbg aggatccaga bctgcbgbgc 372 0 
ccbcccccgb gccbbccbbg acccbggaag 3780 
abgaggaaab bgcabcgcab bgbcbgagba 3840 
ggcaggacag caagggggag gabbgggaag 3900 
gcbcbabggg bacccaggbg cbgaagaabb 3960 
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gacccggbbc ctcctgggcc agaaagaagc 
gtccacgccc cbggbbcbba gtbccagccc 
ctccgccttc aatcccaccc gctaaagtac 
caccaaacca aacctagcct ccaagagtgg 
tgcagaggga gagaaaatgc ctccaacatg 
ttaaggccat gabtbaaggc cahcatggcc 
gctgcgctcg gtcgttcggc bgcggcgagc 
gbbatccaca gaatcagggg ataacgcagg 
ggccaggaac cgbaaaaagg ccgcgbbgcb 
cgagcabcac aaaaatcgac gctcaagtca 
ataccaggcg tttccccctg gaagctccct 
baccggabac ctgbccgccb ttctcccttc 
cbgbaggbab ctcagttcgg tgtaggtcgt 
ccccgttcag cccgaccgct gcgccttatc 
aagacacgac ttatcgccac tggcagcagc 
tgtaggcggt gcbacagagb bcbbgaagbg 
agtabbbggb abcbgcgcbc tgctgaagcc 
tbgabccggc aaacaaacca ccgctggtag 
bacgcgcaga aaaaaaggat ctcaagaaga 
tcagtggaac gaaaactcac gbbaagggab 
cacctagatc cttttaaatt aaaaatgaag 
aacbbggtcb gacagbbacc aabgctbaat 
atbtcgttca tccatagtbg cctgacbcgg 
agaaggbgbb gcbgacbcab accaggccbg 
ggagccacgg bbgabgagag cbttgbbgba 
cbbbgccacg gaacggbcbg cgtbgbcggg 
aaaagbbcga bbbattcaac aaagccgccg 
bgtbacaacc aabtaaccaa tbctgabbag 
aattbabtca babcaggatt abcaabacca 
ggagaaaacb caccgaggca gbtccabagg 
ccgacbcgbc caacabcaab acaaccbabt 
agbgagaaab caccabgagb gacgacbgaa 
bctbbccaga cbfcgtbcaac aggccagcca 
accaaaccgb babtcatbcg tgabbgcgcc 
aaaggacaab bacaaacagg aabcgaabgc 
acaababbbb caccbgaabc aggababbcb 
abcgcagtgg tgagtaacca bgcabcabca 
agaggcabaa abbccgbcag ccagbbbagb 
acgcbaccbb bgccabgbbt cagaaacaac 
tagabbgbcg caccbgabbg cccgacabba 
gcabccabgb bggaabbbaa tcgcggccbc 
abaacacccc bbgbabbacb gbbbabgbaa 
tbbbbabcbb gbgcaabgba acabcagaga 
ccccabbabb gaagcabbba bcagggbbab 
atbbagaaaa abaaacaaab aggggtbccg 
gbcbaagaaa ccatbabtab cabgacatba 
bbbcgtc 



aggcacabcc ccbbcbctgb gacacacccb 4020 
cacbcabagg acactcabag cbcaggaggg 4 080 
bbggagcggb cbcbcccbcc cbcabcagcc 4140 
gaagaaabba aagcaagaba ggcbabbaag 4200 
bgaggaagba abgagagaaa bcabagaabt 4260 
bbaabcbbcc gctbccbcgc bcacbgacbc 4320 
ggbatcagcb cacbcaaagg cggbaabacg 4380 
aaagaacabg bgagcaaaag gccagcaaaa 4440 
ggcgbbbbtc cabaggcbcc gccccccbga 4500 
gaggbggcga aacccgacag gacbabaaag 4560 
cgbgcgcbcb ccbgttccga cccbgccgcb 4620 
gggaagcgtg gcgctbbcbc abagcbcacg 4680 
bcgcbccaag cbgggctgbg bgcacgaacc 4740 
cggbaacbab cgbcbbgagt ccaacccggb 4800 
cacbggbaac aggabbagca gagcgaggba 4860 
gbggccbaac bacggcbaca cbagaagaac 4 920 
agbbaccbtc ggaaaaagag tbggbagcbc 4980 
cggbggbbtt bbtgtbbgca agcagcagab 5040 
bccbbbgabc bbtbcbacgg ggbcbgacgc 5100 
bbbggbcabg agabbatcaa aaaggatcbb 5160 
bbbbaaabca abcbaaagba batabgagba 5220 
cagbgaggca ccbabctcag cgabcbgbcb 5280 
9999999999 cgcbgaggbc tgccbcgbga 5340 
aabcgcccca bcabccagcc agaaagbgag 5400 
ggbggaccag tbggbgabbb bgaactbbbg 5460 
aagabgcgbg abcbgabccb bcaactcagc 5520 
bcccgbcaag bcagcgbaab gcbcbgccag 5580 
aaaaacbcab cgagcabcaa abgaaacbgc 5640 
babtbbtgaa aaagccgbbb cbgbaabgaa 5700 
abggcaagab cctggbabcg gbcbgcgabb 5760 
aabtbcccct cgbcaaaaab aaggbbabca 5820 
bccggbgaga abggcaaaag cbbabgcabb 5880 
bbacgcbcgb cabcaaaabc acbcgcabca 5940 
tgagcgagac gaaabacgcg atcgcbgbta 6000 
aaccggcgca ggaacactgc cagcgcatca 6060 
bcbaabaccb ggaabgctgb bbbcccgggg 6120 
ggagbacgga taaaabgcbb gabggbcgga 6180 
cbgaccabcb cabcbgtaac abcabbggca 6240 
bcbggcgcab cgggcbbccc abacaabcga 6300 
bcgcgagccc abbbataccc abataaabca 6360 
gagcaagacg bbbcccgbtg aababggcbc 6420 
gcagacagbt bbabbgbbca bgatgababa 6480 
btbbgagaca caacgtggcb bbcccccccc 6540 
bgbcbcabga gcggabacab abbbgaabgb 6600 
cgcacabbbc cccgaaaagb gccaccbgac 6660 
accbabaaaa abaggcgbab cacgaggccc 6720 

6727 



<210> 61 
<211> 6730 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> plasmid pVR1012x/s contain: 
<400> 61 

bcgcgcgbtb cggtgatgac ggtgaaaacc 
cagcttgtct gtaagcggat gccgggagca 
btggcgggbg tcggggcbgg cbtaacbatg 
accatatgcg gtgtgaaata ccgcacagat 
ctattggcca ttgcatacgt tgtatccata 
tccaacatta ccgccatgtt gacabtgatt 
ggggtcatta gtbcabagcc catatatgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga ctttccattg 
hgcccacttg gcagtacabc aagtgtatca 
tgacggtaaa tggcccgcct ggcattatgc 
ttggcagtac abctacgtat tagtcatcgc 
catcaatggg cgtggatagc ggtttgacbc 
cgtcaatggg agbtbgbtbb ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgttta gtgaaccgtc agatcgcctg 
bagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
bcttatgcab gctatactgt tbbtggcttg 
haggtgatgg tatagcttag cctafcaggtg 
tattggtgac gabacbttcc abtacbaatc 
tabtggcbat atgccaatac tctgtccttc 
ggatggggtc ccattbatba tttacaaatt 
cgcagttttt attaaacaba gcgtgggatc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagabtg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gbcgbcgaca cgtgtgatca 
aggagaagta ccagcaccbg tggcgctggg 
tgctgatgat ctgcagcgcc accgagaagc 
tgbggaagga ggccaccacc accctgctct 
aggtgcacaa cgtgtgggcc acccacgcct 
tggtgctggt gaacgtgacc gagaacbtcg 
tgcacgagga catcatcagc cbgtgggacc 
ccctgtgcgt gagcctgaag tgcaccgacg 
ccagcatccg cggcaaggtg cagaaggagt 
ccatcgacaa cgacaccacc agctacagcc 
aggcctgccc caaggtgagc ttcgagccca 
tcgccatcct gaagtgcaag gacaagaagt 
gcaccgtgca gtgcacccac ggcatccgcc 
gtagccbggc cgaggaggag gtggbgatcc 
tgatcatcgt gcagcbgaac gagagcgtgg 
cccgcaagag catccacatc ggccccggcc 
gcgacatccg ccaggcccac tgcaacctga 
agatcgbgat caagctgcgc gagcagbbcg 
gcggcggcga ccccgagatc gtgacccaca 
gcaacagcgc bagccbgccc bgccgcatca 



ng HIV genes 

tcbgacacat gcagcbcccg gagacggtca 60 
gacaagcccg tcagggcgcg tcagcgggtg 120 
cggcatcaga gcagattgta ctgagagtgc 180 
gcgtaaggag aaaataccgc atcagafctgg 240 
tcabaatatg tacatttaba ttggcbcatg 300 
attgactagt tatbaatagt aatcaabtac 360 
gttccgcgtt acabaactta cggtaaatgg 420 
cccattgacg tcaataatga cgtabgtbcc 480 
acgtcaatgg gbggagbabb tacggbaaac 54 0 
tatgccaagb acgcccccba ttgacgtcaa 600 
ccagtacatg accttatggg acttbcctac 660 
tattaccabg gtgabgcggb tttggcagta 720 
acggggabbb ccaagtcbcc accccattga 780 
tcaacgggac bttccaaaat gtcgtaacaa 84 0 
gcgtgtacgg bgggaggtct atataagcag 900 
gagacgccat ccacgcbgtt ttgacctcca 960 
cggccgggaa cggtgcattg gaacgcggab 102 0 
ctatagactc tataggcaca cccctttggc 1080 
gggccbabac acccccgctt ccttatgcta 1140 
bgggtbabtg accatbatbg accactcccc 1200 
cabaacatgg cbcbbbgcca caacbatcbc 1260 
agagactgac acggacbcbg tattbttaca 1320 
cacatataca acaacgccgt cccccgtgcc 1380 
tccacgcgaa tcbcgggtac gbgtbccgga 1440 
tccacabccg agccctggtc ccabgccbcc 1500 
ctcctaacag tggaggccag acttaggcac 1560 
cacaaggccg tggcggtagg gtabgtgtct 1620 
gctgacgcag atggaagact baaggcagcg 1680 
gtatbcbgat aagagbcaga ggbaactccc 1740 
gtagbctgag cagbactcgt tgctgccgcg 1800 
aacagactgb tcctbtccat gggbcbttbc 1860 
gatatcgcgg ccgcbcbaga abgcgcgtga 192 0 
gctggcgcbg gggcaccatg ctgctgggca 1980 
tgtgggtgac cgtgtactac ggcgtgcccg 2040 
gcgccagcga cgccaaggcc tacgacaccg 2100 
gcgtgcccac cgaccccaac ccccaggagg 2160 
acatgbggaa gaacgacatg gbggagcaga 2220 
agagcctgaa gcccbgcgtg aagctgaccc 2280 
cbagcaagaa ctgcagcbbc aacatcagca 2340 
acgccttctb ctacaagctg gacatcatcc 2400 
bgaccagcbg caacaccagc gbgatcaccc 2460 
bccccaacca cbacbgcgcc cccgccggcb 2520 
bcaacggcaa gggcccctgc accaacgbga 2580 
ccgbggtgag cacccagcbg ctgcbgaacg 2640 
gcagcgcbaa cbtcgccgac aacgccaagg 2700 
agatcaacbg cacccgcccc aacaacaaca 2760 
gcgccttcba caccaccggc gagatcabcg 2820 
gccgcgccaa gbggaacgac accctgaaca 2880 
gcaacaagac cabcgbgbbc aagcacagca 2940 
gcbbcaacbg cggcggcgag ttcttctact 3000 
agcagabcat caacatgbgg cagaaggtgg 3060 
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gcaaggccab gbacgccccc cccatcagcg gccagahccg ctgcagcagc aacatcaccg 3120 
gcctgctgct gacccgcgac ggcggcaaca gcaacaacga gagcgagatc tbccgbccgg 3180 
gcggcggcga catgcgcgac aactggcgca gcgagctgta caagtacaag gtggtgaaga 3240 
tcgagcccct gggcgtggcc cccaccaagg ccaagcttac cgtccaggcc cgccagctgc 3300 
bgagcggcat cgtgcagcag cagaacaacc bgcbgcgcgc cabcgaggcc cagcagcacc 3360 
tgctgcagct gaccgtgtgg ggcabcaagc agcbgcaggc ccgcaccctg gccgtggagc 3420 
gctacctgaa ggaccagcag cagctcgagc agabcbggaa ccacaccacc tggatggagt 34 80 
gggaccgcga gatcaacaac bacaccagcc tgatccacag cctgatcgag gagagccaga 3540 
accagcagga gaagaacgag caggagctgc tggagctgga caagtgggcc agcctgtgga 3 600 
actggttcaa catcaccaac tggctgtggt acatcaagct gttcatcatg atcgtgggcg 3660 
gcctggtggg cctgcgcatc gtgttcgccg tgctgagcat ctgaggatcc agatctgctg 3720 
tgccttctag ttgccagcca tcbgbtgbbb gcccctcccc cgbgccbbcc bbgacccbgg 3780 
aaggtgccac bcccacbgbc cbbbccbaat aaaatgagga aattgcatcg cabbgbcbga 3840 
gtaggtgtca bbctabbcbg gggggtgggg tggggcagga cagcaagggg gaggattggg 3900 
aagacaatag caggcatgct ggggatgcgg bgggctctat gggtacccag gtgctgaaga 3 960 
attgacccgg btcctcctgg gccagaaaga agcaggcaca tccccttctc tgtgacacac 4020 
cctgtccacg cccctggttc ttagttccag ccccactcat aggacactca bagcbcagga 4080 
gggctccgcc ttcaatccca cccgctaaag bacbbggagc ggtctctccc tccctcatca 4140 
gcccaccaaa ccaaacctag cctccaagag bgggaagaaa ttaaagcaag abaggctabb 4200 
aagtgcagag ggagagaaaa tgcctccaac atgtgaggaa gtaatgagag aaatcataga 4260 
attttaaggc catgatttaa ggccatcatg gccttaatct tccgcttcct cgctcactga 4320 
cbcgcbgcgc tcggtcgttc ggctgcggcg agcggtatca gcbcacbcaa aggcggtaab 4380 
acggbbabcc acagaabcag gggabaacgc aggaaagaac abgbgagcaa aaggccagca 444 0 
aaaggccagg aaccgbaaaa aggccgcgbb gcbggcgbbt tbccataggc bccgcccccc 4500 
bgacgagcat cacaaaaabc gacgctcaag bcagaggbgg cgaaacccga caggactaba 4560 
aagabaccag gcgbbtcccc cbggaagcbc ccbcgbgcgc bcbccbgbbc cgacccbgcc 4620 
gcbbaccgga baccbgbccg cctbbcbccc bbcgggaagc gbggcgcbbb cbcabagcbc 4680 
acgcbgbagg babcbcagbb cggbgbaggb cgbbcgcbcc aagcbgggcb gbgbgcacga 4740 
accccccgbb cagcccgacc gcbgcgccbb abccggbaac tabcgbcbtg agbccaaccc 4800 
ggbaagacac gacbtabcgc cacbggcagc agccacbggb aacaggabba gcagagcgag 4 860 
gbatgbaggc ggbgcbacag agbbctbgaa gbggbggccb aacbacggcb acacbagaag 4920 
aacagbabbb ggbatctgcg cbcbgcbgaa gccagbbacc ttcggaaaaa gagbbggbag 4980 
cbcbbgabcc ggcaaacaaa ccaccgcbgg bagcggtggb bbbbbtgtbb gcaagcagca 5040 
gabbacgcgc agaaaaaaag gabcbcaaga agabccbbbg abctbbbcba cggggbcbga 5100 
cgcbcagbgg aacgaaaacb cacgtbaagg gabbbbggbc abgagabtab caaaaaggab 5160 
cbtcaccbag abccbtbbaa abbaaaaabg aagbtbbaaa bcaabcbaaa gbabababga 5220 
gbaaacbtgg bcbgacagbb accaatgcbb aabcagbgag gcaccbabcb cagcgabcbg 5280 
bcbatbbcgb bcabccabag bbgccbgacb cggggggggg gggcgctgag gbctgccbcg 534 0 
bgaagaaggb gbbgcbgacb cabaccaggc cbgaabcgcc ccabcabcca gccagaaagb 54 00 
gagggagcca cggbbgabga gagcbbtgbb gtaggbggac cagbbggbga bbbbgaactb 5460 
bbgcbbbgcc acggaacggb cbgcgbbgbc gggaagabgc gbgabcbgat ccbbcaactc 5520 
agcaaaagbb cgabbbabbc aacaaagccg ccgbcccgbc aagbcagcgt aabgcbcbgc 5580 
cagbgbbaca accaatbaac caabbcbgab bagaaaaacb cabcgagcab caaabgaaac 5640 
bgcaabbbab bcababcagg abbabcaaba ccababbbbb gaaaaagccg bbbcbgbaab 5700 
gaaggagaaa acbcaccgag gcagbbccab aggabggcaa gabccbggba bcggbcbgcg 5760 
abbccgacbc gbccaacabc aabacaaccb abbaabbbcc ccbcgbcaaa aabaaggbba 5820 
bcaagbgaga aabcaccatg agbgacgact gaabccggbg agaatggcaa aagcbbabgc 58 80 
atbbcbbbcc agacbbgbbc aacaggccag ccabbacgcb cgbcabcaaa abcacbcgca 5940 
bcaaccaaac cgbbabbcat bcgbgatbgc gccbgagcga gacgaaabac gcgabcgcbg 6000 
bbaaaaggac aabbacaaac aggaabcgaa bgcaaccggc gcaggaacac tgccagcgca 6060 
bcaacaabat bbbcaccbga abcaggabab bcbtctaaba ccbggaabgc tgbbbbcccg 6120 
gggabcgcag tggbgagbaa ccabgcabca bcaggagbac ggabaaaabg cbbgabggbc 6180 
ggaagaggca baaabbccgb cagccagbbb agbcbgacca bcbcatcbgb aacabcabbg 6240 
gcaacgcbac cbbbgccabg bbbcagaaac aactcbggcg cabcgggcbb cccabacaab 6300 
cgabagabbg bcgcaccbga bbgcccgaca bbabcgcgag cccabbtaba cccababaaa 6360 
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tcagcatcca tgttggaatt taatcgcggc ctcgagcaag acgtttcccg tbgaatatgg 6420 

ctcataacac cccttgtatt actgtttatg taagcagaca gbtttabbgt tcatgatgat 6480 

atabbbttat cbtgbgcaat gtaacatcag agattttgag acacaacgtg gctthccccc 6540 

cccccccatt attgaagcat fctatcagggt tatbgtcbca tgagcggata catatbtgaa 6600 

bgbatttaga aaaataaaca aataggggtt ccgcgcacat tbccccgaaa agtgccacct 6660 

gacgtctaag aaaccattat tatcatgaca tbaaccbaba aaaataggcg tatcacgagg 6720 

cccbttcgtc 6730 



<210> 62 
<211> 6688 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 62 

bcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgbabccata tcataatatg tacatbtata ttggctcatg 300 
bccaacabta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgbt acataactta cggfcaaatgg 420 
cccgcctggc bgaccgccca acgacccccg cccattgacg bcaabaatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatb tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcatbabgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc batbaccabg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgbcaabggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
cbccgcccca bbgacgcaaa tgggcggtag gcghgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgccbg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgab ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc cbatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tatbggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagtttbt attaaacata gcgtgggatc tccacgcgaa tctcgggfcac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcafc ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gfcggagatbg ggctcgcacg gcbgacgcag atggaagact baaggcagcg 1600 
gcagaagaag abgcaggcag ctgagbtgtb gbabtcbgat aagagtcaga ggbaactccc 1740 
gttgcggbgc tgtbaacggb ggagggcagt gtagtctgag cagtacbcgb tgctgccgcg 1800 
cgcgccacca gacataatag cbgacagacb aacagacbgt bccbtbccat gggbcttttc 1860 
tgcagtcacc gbcgtcgaca cgtgtgabca gatatcgcgg ccgctcbaga abgcgcgtga 1920 
aggagaagta ccagcaccbg bggcgcbggg gcbggcgcbg gggcaccatg cbgcbgggca 1980 
tgctgatgat ctgcagcgcc accgagaagc tgbgggtgac cgbgtacbac ggcgtgcccg 2040 
tgtggaagga ggccaccacc acccbgcbct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggbgcacaa cgtgtgggcc acccacgcct gcgbgcccac cgaccccaac ccccaggagg 2160 
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tggtgctggt gaacgtgacc gagaacttcg acatgtggaa gaacgacatg gtggagcaga 2220 
tgcacgagga catcatcagc ctgtgggacc agagcctgaa gccctgcgtg aagctgaccc 2280 
ccctgtgcgt: gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 2340 
gcggccgcat gatcatggag aagggcgaga tcaagaactg cagcttcgct agcaccagct 2400 
gcaacaccag cgtgatcacc caggcctgcc ccaaggtgag cttcgagccc atccccaacc 2460 
actactgcgc ccccgccggc ttcgccatcc tgaagtgcaa ggacaagaag ttcaacggca 2520 
agggcccctg caccaacgtg agcaccgtgc agfcgcaccca cggcatccgc cccgtggtga 2580 
gcacccagct gctgctgaac ggtagcctgg ccgaggagga ggfcggtgatc cgcagcgcba 264 0 
acttcgccga caacgccaag gtgatcatcg tgcagctgaa cgagagcgtg gagatcaact 2700 
gcacccgcgc tagcgcccac tgcaacctga gccgcgccaa gtggaacgac accctgaaca 2760 
agatcgtgat caagctgcgc gagcagttcg gcaacaagac catcgtgttc aagcacagca 2820 
gcggcggcga ccccgagatc gtgacccaca gcttcaactg cggcggcgag ttcttctact 2880 
gcaacagcac ccagctgttc aacagcacct ggttcaacag cacctggagc accgagggca 2940 
gcaacaacac cgagggcagc gacaccatca ccctgccctg ccgcatcaag cagatcatca 3000 
acatgtggca gaaggtgggc aaggccatgt acgccccccc catcagcggc cagatccgct 3060 
gcagcagcaa catcaccggc ctgctgctga cccgcgacgg cggcaacagc aacaacgaga 3120 
gcgagatctt ccgtccgggc ggcggcgaca tgcgcgacaa ctggcgcagc gagctgtaca 3180 
agtacaaggt ggtgaagatc gagcccctgg gcgfcggcccc caccaaggcc aagcttaccg 324 0 
tccaggcccg ccagctgctg agcggcatcg tgcagcagca gaacaacctg ctgcgcgcca 3300 
tcgaggccca gcagcacctg ctgcagctga ccgtgtgggg catcaagcag ctgcaggccc 3360 
gcaccctggc cgtggagcgc tacctgaagg accagcagca gctcgagcag atctggaacc 3420 
acaccacctg gatggagtgg gaccgcgaga tcaacaacta caccagccfcg atccacagcc 34 80 
tgatcgagga gagccagaac cagcaggaga agaacgagca ggagctgctg gagctggaca 354 0 
agtgggccag ccfcgtggaac tggttcaaca tcaccaactg gctgtggtac atcaagctgt 3600 
tcatcatgat cgtgggcggc ctggtgggcc tgcgcatcgt gttcgccgtg ctgagcatct 3660 
gaggatccag atctgctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg 3720 
tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa 3780 
ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcaggaca 3840 
gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg 3900 
gtacccaggt gctgaagaat fcgacccggtt cctcctgggc cagaaagaag caggcacatc 3960 
cccttctctg tgacacaccc tgtccacgcc cchggttctt agttccagcc ccactcatag 4020 
gacactcata gctcaggagg gctccgcctt caatcccacc cgctaaagta cttggagcgg 4080 
tctctccctc cctcatcagc ccaccaaacc aaacctagcc tccaagagtg ggaagaaatt 4140 
aaagcaagat aggctatfcaa gtgcagaggg agagaaaatg cctccaacat gfcgaggaagt 42 00 
aatgagagaa atcafcagaat ttfcaaggcca tgatttaagg ccatcatggc cthaatcttc 4260 
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 4320 
tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 4380 
gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 444 0 
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 4500 
aaacccgaca ggachataaa gataccaggc gtthccccct ggaagctccc tcgtgcgctc 4560 
tcctgttccg acccfcgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 4620 
ggcgctttct catagctcac gctgtaggha tctcagttcg gtgbaggtcg ttcgctccaa 4680 
gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 4740 
tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 4 800 
caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 4 860 
cfcacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagthacctt 4920 
cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 4 980 
ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 5040 
cttttctacg gggtctgacg cbcagtggaa cgaaaactca cgttaaggga ttttggtcat 5100 
gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc 5160 
aatctaaagt atatatgagt aaacttggfcc tgacagttac caatgcttaa tcagtgaggc 5220 
acctatcfcca gcgatctgtc tatttcgttc atccatagtt gcctgactcg gggggggggg 52 80 
gcgcfcgaggt ctgcctcgtg aagaaggtgt tgctgactca taccaggcct gaatcgcccc 5340 
afccatccagc cagaaagtga gggagccacg gttgatgaga gcbfctgttgt aggtggacca 54 00 
gfctggtgatt ttgaactttt gctttgccac ggaacggtct gcgttgtcgg gaagatgcgt 5460 



- 147 - 



WO 02/32943 



PCT/US01/25721 



gatctgatcc ttcaactcag caaaagttcg atttattcaa caaagccgcc gtcccgtcaa 5520 
gtcagcgtaa tgcbctgcca gtgttacaac caattaacca attctgatta gaaaaactca 5580 
tcgagcatca aatgaaactg caatttattc atatcaggat tatcaatacc atatttttga 5640 
aaaagccgtt tctgtaatga aggagaaaac tcaccgaggc agttccatag gatggcaaga 5700 
tcctggtatc ggtctgcgat tccgactcgt ccaacatcaa tacaacctat taatttcccc 5760 
tcgtcaaaaa taaggttatc aagtgagaaa tcaccatgag tgacgactga atccggtgag 5820 
aatggcaaaa gcttatgcat ttctttccag acttgttcaa caggccagcc atfcacgctcg 5880 
tcatcaaaat cactcgcatc aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga 594 0 
cgaaatacgc gatcgctgtt aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc 6000 
aggaacactg ccagcgcatc aacaatattt tcacctgaat caggatattc ttctaahacc 6060 
tggaatgctg ttttcccggg gatcgcagtg gtgagtaacc abgcatcatc aggagtacgg 6120 
ataaaatgct tgatggtcgg aagaggcata aattccgtca gccagtttag tctgaccatc 6180 
tcatctgtaa catcattggc aacgctacct ttgccatgtt tcagaaacaa ctctggcgca 6240 
ccgggcttcc catacaatcg atagattgtc gcacctgatt gcccgacatt atcgcgagcc 6300 
catttatacc catataaatc agcatccatg ttggaattta atcgcggcct cgagcaagac 6360 
gtttcccgtt gaatatggct cataacaccc cttgtattac tgtttatgta agcagacagt 6420 
tttattgttc atgatgatat atttttatct tgtgcaatgt aacatcagag attttgagac 6480 
acaacgtggc tttccccccc cccccattat tgaagcattt atcagggtta ttgtctcatg 6540 
agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt 6600 
ccccgaaaag tgccacctga cgtctaagaa accattatta tcatgacatt aacctataaa 6660 
aataggcgta tcacgaggcc ctttcgtc 6688 



<210> 63 
<211> 6691 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 63 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
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ggabggggbc ccatttatta tttacaaabt cacatataca acaacgccgt cccccgtgcc 1380 . 

cgcagttttt abbaaacaba gcgtgggabc tccacgcgaa tctcgggtac gtgttccgga 1440 

cabgggcbcb bcbccggbag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag abgcaggcag ctgagttgtt gbatbcbgab aagagtcaga ggtaactccc 1740 

gttgcggtgc bgbbaacggb ggagggcagb gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacabaabag ctgacagact aacagactgt bccbbtccat gggtcttttc 1860 

bgcagbcacc gtcgbcgaca cgtgtgatca gababcgcgg ccgctctaga atgcgcgtga 1920 

aggagaagta ccagcacctg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 1980 

tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2040 

bgbggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 

aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 

bggbgcbggb gaacgtgacc gagaacttcg acatgtggaa gaacgacabg gtggagcaga 2220 

tgcacgagga cabcabcagc cbgbgggacc agagccbgaa gccctgcgtg aagctgaccc 22 80 

cccbgbgcgb gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 2340 

gcggccgcat gabcabggag aagggcgaga tcaagaactg cagcttcgct agcaccagct 24 00 

gcaacaccag cgtgatcacc caggcctgcc ccaaggtgag cttcgagccc atccccaacc 24 60 

actactgcgc ccccgccggc ttcgccatcc tgaagtgcaa ggacaagaag bbcaacggca 2520 

agggcccctg caccaacgtg agcaccgtgc agtgcaccca cggcatccgc cccgtggtga 2580 

gcacccagct gctgctgaac ggtagcctgg ccgaggagga ggbggbgabc cgcagcgcta 264 0 

acttcgccga caacgccaag gtgatcatcg tgcagctgaa cgagagcgtg gagatcaact 2700 

gcacccgccc caacaacaac acccgcaaga gcatccacat cggccccggc cgcgccttct 2760 

acaccaccgg cgagatcatc ggcgacatcc gccaggccca ctgcaacctg agccgcgcca 2820 

agtggaacga caccctgaac aagabcgbga tcaagctgcg cgagcagttc ggcaacaaga 2880 

ccabcgbgbb caagcacagc agcggcggcg accccgagab cgbgacccac agcttcaact 2940 

gcggcggcga gttcttctac tgcaacagcg ctagcctgcc ctgccgcatc aagcagatca 3000 

tcaacatgtg gcagaaggtg ggcaaggcca tgtacgcccc ccccabcagc ggccagatcc 3060 

gctgcagcag caacatcacc ggcctgcbgc bgacccgcga cggcggcaac agcaacaacg 3120 

agagcgagat cbbccgbccg ggcggcggcg acatgcgcga caactggcgc agcgagctgt 3180 

acaagtacaa ggbggbgaag atcgagcccc bgggcgtggc ccccaccaag gccaagctta 3240 

ccgtccaggc ccgccagctg ctgagcggca bcgbgcagca gcagaacaac ctgctgcgcg 33 00 

ccatcgaggc ccagcagcac ctgctgcagc tgaccgtgtg gggcatcaag cagctgcagg 3360 

cccgcaccct ggccgbggag cgctacctga aggaccagca gcagctcgag cagatctgga 3420 

accacaccac cbggabggag bgggaccgcg agatcaacaa ctacaccagc cbgatccaca 34 80 

gcctgatcga ggagagccag aaccagcagg agaagaacga gcaggagctg ctggagctgg 3540 

acaagtgggc cagcctgtgg aactggttca acabcaccaa cbggcbgtgg bacatcaagc 3600 

tgbbcabcab gabcgbgggc ggcctggbgg gccbgcgcab cgbgbbcgcc gbgcbgagca 3660 

bcbgaggabc cagabctgcb gbgccbtcba gbtgccagcc atcbgbbgbb tgccccbccc 3720 

ccgbgccbbc cbbgacccbg gaaggtgcca ctcccactgb ccbbtccbaa taaaabgagg 3780 

aaatbgcabc gcabtgtcbg agbaggtgtc attcbattcb ggggggbggg gbggggcagg 3840 

acagcaaggg ggaggabtgg gaagacaata gcaggcabgc bggggatgcg gtgggcbcba 3900 

bgggbaccca ggbgctgaag aatbgacccg gbtcctccbg ggccagaaag aagcaggcac 3960 

atcccctbcb cbgbgacaca cccbgbccac gccccbggbb cttagbtcca gccccacbca 4020 

baggacactc abagctcagg agggcbccgc cbtcaabccc acccgctaaa gbacbtggag 4 080 

cggbcbcbcc cbcccbcabc agcccaccaa accaaaccba gccbccaaga gbgggaagaa 4140 

attaaagcaa gabaggcbab taagbgcaga gggagagaaa atgccbccaa cabgbgagga 4200 

agbaabgaga gaaabcabag aabbbtaagg ccabgatbba aggccabcab ggcctbaabc 4260 

ttccgcbbcc bcgctcacbg acbcgctgcg cbcggbcgbb cggcbgcggc gagcggbabc 4320 

agcbcacbca aaggcggbaa bacggbbatc cacagaabca ggggabaacg caggaaagaa 4380 

cabgbgagca aaaggccagc aaaaggccag gaaccgbaaa aaggccgcgt tgcbggcgbt 4440 

bbbccabagg cbccgccccc ctgacgagca bcacaaaaat cgacgctcaa gbcagaggbg 4500 

gcgaaacccg acaggacbab aaagabacca ggcgbbbccc ccbggaagcb cccbcgbgcg 4560 

cbctccbgbb ccgacccbgc cgcbbaccgg ataccbgbcc gccbbbcbcc cbbcgggaag 4520 
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cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg bcgtbcgcbc 4680 
caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 4740 
cbabcgbctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 4800 
taacaggatt agcagagcga ggbatgbagg cggbgctaca gagbtcbbga agtggtggcc 4860 
taactacggc tacactagaa gaacagtatt bggbabcbgc gctctgctga agccagttac 4920 
cttcggaaaa agagbbggba gcbcbbgabc cggcaaacaa accaccgctg gtagcggtgg 4980 
tbbtbbbgbt tgcaagcagc agabbacgcg cagaaaaaaa ggatcbcaag aagatccttt 5040 
gatcbbbtct acggggtctg acgctcagtg gaacgaaaac bcacgfcbaag ggathttggfc 5100 
catgagatta tcaaaaagga tcttcaccta gabccbttba aabtaaaaat gaagbbbbaa 5160 
atcaatctaa agbabababg agbaaacbbg gtctgacagt taccaatgct taatcagtga 5220 
ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac tcgggggggg 5280 
9999C9Ctga ggtctgcctc gtgaagaagg tgttgctgac tcataccagg ccbgaabcgc 5340 
cccatcatcc agccagaaag tgagggagcc acggttgatg agagctbbgt tgtaggbgga 54 00 
ccagttggtg attttgaact ttbgctttgc cacggaacgg bcbgcgbbgt cgggaagatg 5460 
cgtgatctga tccttcaact cagcaaaagt tcgabttabt caacaaagcc gccgbcccgb 5520 
caagtcagcg taatgctctg ccagtgttac aaccaattaa ccaattctga ttagaaaaac 5580 
tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat accatatttt 5640 
tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca 5700 
agabccbggb atcggtctgc gabtccgacb cgtccaacat caatacaacc babbaatbbc 5760 
ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggt 5820 . 
gagaabggca aaagcbtatg cabttctttc cagacttgbt caacaggcca gccattacgc 5880 
bcgbcatcaa aatcactcgc atcaaccaaa ccgbtattca tbcgtgatbg cgcctgagcg 5940 
agacgaaata cgcgabcgct gbbaaaagga caabtacaaa caggaatcga atgcaaccgg 6000 
cgcaggaaca ctgccagcgc atcaacaata tbbbcaccbg aabcaggata bbcttcbaab 6060 
accbggaatg ctgttbbccc ggggatcgca gbggtgagba accatgcabc atcaggagta 6120 
cggabaaaab gcbtgabggt cggaagaggc abaaabtccg bcagccagbb bagbcbgacc 6180 
abcbcabctg baacabcabb ggcaacgcba ccbbtgccab gttbcagaaa caacbcbggc 6240 
gcabcgggcb bcccabacaa bcgabagabt gbcgcaccbg abtgcccgac abbabcgcga 6300 
gcccabbtab acccababaa abcagcatcc abgbbggaab bbaabcgcgg ccbcgagcaa 6360 
gacgbbbccc gttgaabatg gctcabaaca cccctbgtab bactgbbbab gbaagcagac 6420 
agbbbbatbg ttcabgatga batabbttba tcbbgbgcaa tgtaacatca gagabttbga 6480 
gacacaacgb ggcttbcccc ccccccccab tabtgaagca bbtatcaggg bbabtgtcbc 654 0 
abgagcggab acatabbbga abgtabbtag aaaaataaac aaabaggggb bccgcgcaca 6600 
tbbccccgaa aagtgccacc tgacgbctaa gaaaccatba bbabcabgac abtaaccbab 6660 
aaaaataggc gtatcacgag gcccbtbcgb c 6691 



<210> 64 
<211> 6706 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 64 

bcgcgcgbbb cggbgatgac ggtgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcttgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbtaacbabg cggcabcaga gcagabbgba ctgagagbgc 180 
accababgcg gtgtgaaaba ccgcacagab gcgbaaggag aaaabaccgc atcagatbgg 240 
ctattggcca ttgcabacgb tgbabccaba bcabaatabg bacabbtaba btggcbcabg 300 
bccaacabba ccgccatgbt gacabbgabb abbgactagt babbaabagb aatcaabbac 360 
ggggbcabba gbtcabagcc catababgga gbbccgcgbb acabaactba cggbaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccabbgacg bcaabaatga cgbabgtbcc 4 80 
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catagtaacg ccaataggga cbbbccattg acgtcaatgg gbggagbabb bacggbaaac 540 
tgcccacttg gcagtacatc aagbgbatca tatgccaagt acgcccccha ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
tbggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt bbbggcagba 720 
catcaatggg cgbggabagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgbcaabggg agbbtgbbbb ggcaccaaaa tcaacgggac tttccaaaah gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt tbgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgbgcc aagagtgacg taagtaccgc cbatagacbc babaggcaca cccctttggc 1080 
tcttatgcat gcbabactgb ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg bgggbbabtg accabbabtg accactcccc 1200 
tattggtgac gatactttcc abbactaatc cabaacabgg cbcbbtgcca caactatctc 1260 
tabtggctat atgccaahac tctgtccttc agagacbgac acggactctg tatttbtaca 1320 
ggabggggtc ccabttabta bbbacaaabb cacatabaca acaacgccgt cccccgbgcc 1380 
cgcagbtbbt attaaacaba gcgtgggabc tccacgcgaa tctcgggbac gtgbbccgga 1440 
cabgggctct bcbccggtag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggtcgcbcgg cagctccbbg cbccbaacag tggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg tggcggbagg gtatgbgbcb 1620 
gaaaabgagc gbggagabbg ggctcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgtb gtabbcbgat aagagbcaga ggbaacbccc 1740 
gbbgcggbgc tgbbaacggb ggagggcagb gbagbcbgag cagbacbcgb bgcbgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagacbgt bccbbbccat gggbcbbtbc 1860 
bgcagbcacc gbcgbcgaca cgtgbgabca gababcgcgg ccgcbctaga atgcgcgbga 1920 
aggagaagba ccagcacctg bggcgcbggg gcbggcgcbg gggcaccabg cbgcbgggca 1980 
tgcbgatgab cbgcagcgcc accgagaagc tgbgggbgac cgbgbacbac ggcgbgcccg 2040 
tgbggaagga ggccaccacc acccbgcbcb gcgccagcga cgccaaggcc tacgacaccg 2100 
aggbgcacaa cgbgbgggcc acccacgccb gcgbgcccac cgaccccaac ccccaggagg 2160 
tggbgctggb gaacgtgacc gagaactbcg acabgbggaa gaacgacabg gbggagcaga 2220 
tgcacgagga cabcabcagc cbgbgggacc agagccbgaa gcccbgcgbg aagcbgaccc 22 80 
cccbgbgcgb gagccbgaag bgcaccgacc tgaagaacga caccaacacc aacagcagca 2340 
gcggccgcab gatcabggag aagggcgaga bcaagaacbg cagcbbcaac abcagcacca 24 00 
gcabccgcgg caaggbgcag aaggagbacg ccbbctbcba caagcbggac abcabcccca 2460 
tcgacaacga caccaccagc bacagccbga ccagcbgcaa caccagcgbg atcacccagg 2520 
ccbgccccaa ggbgagcttc gagcccabcc ccaaccacba cbgcgccccc gccggcbtcg 2580 
ccabccbgaa gtgcaaggac aagaagbtca acggcaaggg cccctgcacc aacgtgagca 2640 
ccgbgcagbg cacccacggc abccgccccg bggbgagcac ccagcbgcbg ctgaacggba 2700 
gcctggccga ggaggaggbg gbgabccgca gcgctaacbb cgccgacaac gccaaggfcga 2760 
bcabcgbgca gcbgaacgag agcgbggaga bcaacbgcac ccgcgcbagc gcccactgca 2820 
accbgagccg cgccaagbgg aacgacaccc bgaacaagab cgbgabcaag cbgcgcgagc 2880 
agbbcggcaa caagaccabc gtgbbcaagc acagcagcgg cggcgacccc gagatcgbga 2940 
cccacagcbb caactgcggc ggcgagbbcb bcbacbgcaa cagcgcbagc cbgcccbgcc 3000 
gcabcaagca gabcatcaac atgtggcaga aggbgggcaa ggccabgbac gcccccccca 3060 
tcagcggcca gabccgctgc agcagcaaca bcaccggccb gcbgcbgacc cgcgacggcg 3120 
gcaacagcaa caacgagagc gagabcbbcc gbccgggcgg cggcgacabg cgcgacaacb 3180 
ggcgcagcga gcbgbacaag bacaaggbgg bgaagabcga gccccbgggc gbggccccca 3240 
ccaaggccaa gcbbaccgbc caggcccgcc agcbgctgag cggcabcgbg cagcagcaga 33 00 
acaaccbgcb gcgcgccatc gaggcccagc agcaccbgcb gcagcbgacc gbgbggggca 3360 
bcaagcagcb gcaggcccgc accctggccg bggagcgcba ccbgaaggac cagcagcagc 3420 
tcgagcagab cbggaaccac accaccbgga bggagbggga ccgcgagatc aacaacbaca 34 80 
ccagccbgab ccacagccbg abcgaggaga gccagaacca gcaggagaag aacgagcagg 3540 
agcbgcbgga gcbggacaag bgggccagcc bgtggaacbg gtbcaacabc accaacbggc 3 600 
bgbggbacab caagcbgbbc atcatgatcg tgggcggccb ggbgggccbg cgcabcgbgb 3660 
tcgccgbgcb gagcatcbga ggatccagab cbgcbgbgcc bbcbagbbgc cagccabcbg 3720 
btgtbbgccc cbcccccgbg ccbbcctbga cccbggaagg bgccactccc acbgbccbbt 3780 
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ccbaataaaa tgaggaaatt gcatcgcabb gtctgagtag gbgbcabbct attctggggg 3840 
gbggggbggg gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg 3 900 
atgcggtggg ctctatgggt acccaggtgc tgaagaattg acccggttcc tcctgggcca 3960 
gaaagaagca ggcacatccc cttctctgtg acacaccctg tccacgcccc tggttcttag 4020 
bbccagcccc acbcabagga cacbcabagc tcaggagggc tccgccttca atcccacccg 4080 
ctaaagtact tggagcggtc tctccctccc tcatcagccc accaaaccaa acctagcctc 4140 
caagagtggg aagaaattaa agcaagabag gctattaagt gcagagggag agaaaatgcc 4200 
tccaacatgt gaggaagbaa tgagagaaat catagaattt taaggccatg atttaaggcc 4260 
atcatggcct taatcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 4320 
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aabcagggga 4380 
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 4440 
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcabcaca aaaatcgacg 4500 
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 4560 
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 4620 
tctcccttcg ggaagcgtgg cgcbbbcbca tagctcacgc tgtaggtatc tcagttcggt 4680 
gbaggbcgbb cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 4740 
cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgacb babcgccacb 4 800 
ggcagcagcc acbggbaaca ggattagcag agcgaggtab gtaggcggtg ctacagagtt 4 860 
cbbgaagbgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct 4 920 
gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 4 980 
cgctggtagc ggbggbbtbb ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 5 04 0 
bcaagaagab ccbbtgabcb tttctacggg gtctgacgct cagbggaacg aaaactcacg 5100 
bbaagggabb btggtcabga . gabbabcaaa aaggabcbbc accbagabcc bbbbaaatba 5160 
aaaatgaagt bbbaaabcaa tctaaagtat atatgagtaa acttggtctg acagttacca 5220 
abgcbbaabc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccabagtbgc 52 80 
ctgactcggg gggggggggc gctgaggtct gcctcgtgaa gaaggtgttg ctgactcata 5340 
ccaggcctga atcgccccat catccagcca gaaagtgagg gagccacggt tgatgagagc 5400 
bbbgbbgbag gbggaccagt tggtgabttt gaacttttgc tttgccacgg aacggbcbgc 5460 
gbtgbcggga agatgcgtga bcbgabccbb caactcagca aaagttcgat bbabtcaaca 5520 
aagccgccgt cccgtcaagt cagcgtaatg ctctgccagt gttacaacca attaaccaat 5580 
bcbgabbaga aaaactcatc gagcatcaaa tgaaactgca atttafctcat atcaggatta 5640 
tcaataccat atttttgaaa aagccgttfcc tgtaatgaag gagaaaacbc accgaggcag 5700 
ttccatagga tggcaagatc ctggtatcgg tctgcgattc cgactcgtcc aacatcaata 5760 
caacctatta atttcccctc gtcaaaaata aggttatcaa gtgagaaatc accatgagtg 5820 
acgactgaat ccggtgagaa tggcaaaagc ttatgcattt cfcbhccagac ttgttcaaca 5880 
ggccagccat tacgctcgtc atcaaaatca ctcgcatcaa ccaaaccgtt attcattcgt 5940 
gattgcgcct gagcgagacg aaabacgcga tcgctgttaa aaggacaatt acaaacagga 6000 
atcgaatgca accggcgcag gaacactgcc agcgcabcaa caababbbbc acctgaabca 6060 
ggabatbcbb cbaatacctg gaabgcbgbb ttcccgggga bcgcagbggb gagbaaccab 6120 
gcatcabcag gagbacggab aaaabgcbbg abggbcggaa gaggcabaaa bbccgbcagc 6180 
cagbbbagbc bgaccabcbc abcbgbaaca bcabtggcaa cgcbaccbtt gccabgbbbc 6240 
agaaacaacb cbggcgcabc gggcbbccca bacaatcgab agabtgbcgc acctgabbgc 6300 
ccgacabbab cgcgagccca bbbabaccca babaaabcag cabccabgbt ggaabbbaab 6360 
cgcggccbcg agcaagacgb bbcccgbbga ababggcbca baacaccccb bgbabtactg 6420 
bbbabgbaag cagacagbtb batbgbbcab gabgababab bbbbabcbbg bgcaatgbaa 6480 
catcagagat btbgagacac aacgbggcbt bccccccccc cccabbabbg aagcatbbab 6540 
cagggbbabb gbcbcabgag cggabacaba tbbgaabgba bbbagaaaaa baaacaaaba 6600 
ggggbbccgc gcacabbbcc ccgaaaagbg ccaccbgacg bcbaagaaac cabbabbabc 6660 
abgacabbaa ccbataaaaa baggcgbabc acgaggcccb bbcgbc 6706 



<210> 65 
<211> 6649 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 65 

tcgcgcgttb cggfcgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggbg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagabtgba ctgagagtgc 180 
accatatgcg gbgtgaaaba ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
bccaacabba ccgccatgtt gacabtgatb attgactagt tattaatagt aabcaabtac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgbt acataactta cggtaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
cabagbaacg ccaataggga cbtbccabbg acgtcaatgg gtggagtabb bacggbaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccba bbgacgbcaa 600 
tgacggtaaa tggcccgcct ggcatbatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atcbacgbat tagtcatcgc tattaccatg gtgatgcggt tttggcagba 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccatbga 780 
cgbcaabggg agbbbgbbtb ggcaccaaaa tcaacgggac tthccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa bgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctabac acccccgctt ccttatgcta 1140 
taggtgatgg babagctbag cctataggtg tgggttattg accattattg accactcccc 1200 
tabbggbgac gabacbttcc abbacbaabc cataacatgg ctctttgcca caactatctc 1260 
tabtggctafc atgccaatac tctgtccttc agagacbgac acggacbcbg tattbtbaca 1320 
99 at 9999 tc ccatbtatta ttbacaaatt cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbtttt abtaaacata gcgbgggatc bccacgcgaa bctcgggbac gtgttccgga 1440 
catgggcbct tctccggtag cggcggagct tccacabccg agcccbggtc ccatgcctcc 1500 
agcggcbcab ggbcgctcgg cagcbccbbg cbccbaacag bggaggccag acttaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg tggcggbagg gbabgbgbcb 1620 
gaaaabgagc gbggagabtg ggcbcgcacg gctgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbtgtt gbattctgab aagagtcaga ggbaactccc 1740 
gbbgcggbgc bgtbaacggb ggagggcagfc gtagtctgag cagtactcgb bgctgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagacbgb bccbbbccab gggbcttbbc 1860 
tgcagtcacc gbcgbcgaca cgtgtgabca gabatcgcgg ccgcbcbaga abgcgcgbga 1920 
aggagaagba ccagcaccbg bggcgctggg gcbggcgcbg gggcaccabg cbgcbgggca 1980 
bgcbgabgab cbgcagcgcc accgagaagc bgbgggtgac cgtgtacbac ggcgtgcccg 2040 
tgtggaagga ggccaccacc accctgcbcb gcgccagcga cgccaaggcc tacgacaccg 2100 
aggbgcacaa cgbgtgggcc acccacgccb gcgbgcccac cgaccccaac ccccaggagg 2160 
bggbgcbggb gaacgbgacc gagaacbtcg acabgbggaa gaacgacabg gbggagcaga 2220 
bgcacgagga cabcatcagc cbgbgggacc agagcctgaa gcccbgcgbg aagcbgaccc 22 80 
cccbgtgcgb gagccbgaag bgcaccgacg ctagcaagaa ctgcagcttc aacatcagca 2340 
ccagcabccg cggcaaggtg cagaaggagt acgcctbcbb cbacaagctg gacabcabcc 2400 
ccatcgacaa cgacaccacc agctacagcc bgaccagctg caacaccagc gbgatcaccc 2460 
aggcctgccc caaggbgagc btcgagccca bccccaacca cbacbgcgcc cccgccggcb 2520 
bcgccatccb gaagbgcaag gacaagaagb bcaacggcaa gggcccctgc accaacgbga 2580 
gcaccgtgca gtgcacccac ggcabccgcc ccgbggbgag cacccagctg ctgctgaacg 2640 
gbagcctggc cgaggaggag gtggbgatcc gcagcgctaa cbbcgccgac aacgccaagg 2700 
tgabcatcgt gcagcbgaac gagagcgtgg agatcaactg cacccgcgct agcgcccacb 2760 
gcaaccbgag ccgcgccaag tggaacgaca ccctgaacaa gabcgbgatc aagcbgcgcg 2820 
agcagtbcgg caacaagacc atcgtgbtca agcacagcag cggcggcgac cccgagabcg 2880 
bgacccacag cbbcaacbgc ggcggcgagt bcbbcbacbg caacagcgcb agccbgccct 294 0 
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gccgcatcaa gcagabcabc aacatgtggc 
ccatcagcgg ccagatccgc tgcagcagca 
gcggcaacag caacaacgag agcgagatct 
acbggcgcag cgagctgbac aagtacaagg 
ccaccaaggc caagcttacc gtccaggccc 
agaacaacct gctgcgcgcc atcgaggccc 
gcabcaagca gcbgcaggcc cgcaccctgg 
agctcgagca gatctggaac cacaccacct 
acaccagcct gatccacagc ctgatcgagg 
aggagcfcgct ggagcbggac aagbgggcca 
ggctgtggta catcaagcfcg tbcabcabga 
bgbbcgccgt gcbgagcabc tgaggatcca 
ctgttgtttg cccctccccc gbgccttcct 
tttcctaata aaatgaggaa attgcatcgc 
ggggtggggt ggggcaggac agcaaggggg 
gggatgcggt gggctctatg ggtacccagg 
ccagaaagaa gcaggcacab ccccbtctcb 
tagttccagc cccactcata ggacactcat 
ccgctaaagt actbggagcg gtctctccct 
ctccaagagt gggaagaaat baaagcaaga 
gcctccaaca tgtgaggaag baabgagaga 
gccabcabgg ccttaatctt ccgcttcctc 
gctgcggcga gcggtatcag ctcactcaaa 
ggataacgca ggaaagaaca tgtgagcaaa 
ggccgcgttg cbggcgbbbb tccataggct 
acgctcaagt cagaggtggc gaaacccgac 
tggaagctcc ctcgtgcgct ctcctgttcc 
ctttctccct tcgggaagcg tggcgcbttc 
ggtgtaggtc gttcgctcca agctgggctg 
ctgcgcctta tccggtaact atcgbcbbga 
actggcagca gccactggta acaggattag 
gbbcbbgaag tggtggccta actacggcta 
bcbgctgaag ccagttacct tcggaaaaag 
caccgctggt agcggtggtt tttttgtttg 
abcbcaagaa gabccbbtga bcbbbbctac 
acgttaaggg abbbbggtca tgagattatc 
ttaaaaatga agbbbbaaab caatctaaag 
ccaabgctba atcagtgagg cacctatctc 
tgcctgactc gggggggggg ggcgctgagg 
ataccaggcc tgaatcgccc cabcabccag 
agcbbbgtbg taggtggacc agttggtgat 
bgcgbbgtcg ggaagabgcg bgabcbgatc 
acaaagccgc cgtcccgbca agbcagcgba 
aabbcbgabb agaaaaacbc abcgagcabc 
bbabcaabac cababbbbbg aaaaagccgt 
cagbbccaba ggabggcaag abccbggtab 
atacaaccba btaabbbccc cbcgbcaaaa 
gbgacgacbg aabccggbga gaabggcaaa 
acaggccagc cabbacgcbc gtcabcaaaa 
cgtgabbgcg ccbgagcgag acgaaabacg 
ggaabcgaab gcaaccggcg caggaacact 
bcaggababb cbbcbaabac cbggaatgct 
catgcabcat caggagbacg gabaaaabgc 
agccagtbba gbcbgaccab cbcabcbgba 
bbcagaaaca acbcbggcgc abcgggcbbc 



agaaggbggg caaggccabg bacgcccccc 3000 
acabcaccgg ccbgcbgcbg acccgcgacg 3060 
bccgtccggg cggcggcgac atgcgcgaca 3120 
bggbgaagab cgagcccctg ggcgtggccc 3180 
gccagctgcb gagcggcabc gbgcagcagc 3240 
agcagcacct gcbgcagcbg accgtgbggg 3300 
ccgbggagcg cbaccbgaag gaccagcagc 3360 
ggatggagbg ggaccgcgag abcaacaacb 3420 
agagccagaa ccagcaggag aagaacgagc 34 80 
gccbgbggaa cbggbtcaac abcaccaacb 3540 
bcgbgggcgg ccbggbgggc cbgcgcatcg 3600 
gabctgcbgt gccbtcbagt bgccagccab 3660 
tgacccbgga aggbgccacb cccacbgbcc 3720 
abbgbcbgag baggbgbcab bcbabbcbgg 3780 
aggabbggga agacaatagc aggcatgcbg 3840 
tgctgaagaa bbgacccggb tcctccbggg 3 900 
gbgacacacc cbgbccacgc ccctggbbcb 3960 
agcbcaggag ggcbccgccb bcaabcccac 4020 
cccbcabcag cccaccaaac caaaccbagc 4080 
baggcbabba agbgcagagg gagagaaaat 4140 
aatcabagaa bttbaaggcc abgatbbaag 4200 
gctcacbgac bcgcbgcgcb cggbcgbbcg 4260 
ggcggbaaba cggbtatcca cagaatcagg 4 320 
aggccagcaa aaggccagga accgbaaaaa 4380 
ccgccccccb gacgagcabc acaaaaabcg 4440 
aggacbataa agabaccagg cgtbbccccc 4500 
gaccctgccg cttaccggat acctgbccgc 4560 
bcabagctca cgcbgbaggb abcbcagbbc 4620 
tgtgcacgaa ccccccgttc agcccgaccg 4680 
gbccaacccg gbaagacacg acbbabcgcc 4740 
cagagcgagg babgbaggcg gbgcbacaga 4 800 
cactagaaga acagtahbbg gtabcbgcgc 4860 
agbbggbagc bcbbgabccg gcaaacaaac 4920 
caagcagcag abbacgcgca gaaaaaaagg 4980 
ggggtcbgac gcbcagbgga acgaaaacbc 504 0 
aaaaaggabc bbcaccbaga bccbtbbaaa 5100 
babatatgag baaacbbggt ctgacagbta 5160 
agcgabcbgt cbatttcgbb catccabagb 5220 
tctgccbcgt gaagaaggbg btgctgacbc 5280 
ccagaaagbg agggagccac ggbbgabgag 5340 
btbgaacbbb bgcbbtgcca cggaacggbc 5400 
ctbcaacbca gcaaaagbbc gatbbabbca 5460 
abgctcbgcc agbgbbacaa ccaabbaacc 5520 
aaabgaaacb gcaabbbatb cababcagga 5580 
bbcbgbaabg aaggagaaaa cbcaccgagg 5640 
cggbcbgcga Cbccgactcg bccaacabca 5700 
abaaggbbab caagbgagaa abcaccabga 5760 
agcbbabgca bbbcbbbcca gactbgtbca 5820 
bcactcgcat caaccaaacc gbtabtcabb 5880 
cgabcgcbgb baaaaggaca abbacaaaca 5940 
gccagcgcab caacaababb bbcaccbgaa 6000 
gbtbbcccgg ggabcgcagb ggbgagtaac 6060 ■ 
bbgabggbcg gaagaggcab aaabbccgbc 6120 
acatcabbgg caacgcbacc tbbgccabgb 6180 
ccatacaatc gatagabbgt cgcaccbgab 6240 
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tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat gttggaattt 6300 

aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc ccttgtatta 6360 

ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc ttgtgcaatg 6420 

taacatcaga gattttgaga cacaacgtgg ctfctcccccc ccccccatta ttgaagcatt 6480 

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 6540 

ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga aaccattatt 6600 

atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 6649 



<210> 66 
<211> 6610 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/B containing HIV genes 
<400> 66 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 1B0 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt. tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccactg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 78 0 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga atgcgcgtga 1920 
aggagaagta ccagcacctg tggcgctggg gctggcgctg gggcaccatg ctgctgggca 1980 
tgctgatgat ctgcagcgcc accgagaagc tgtgggtgac cgtgtactac ggcgtgcccg 2040 
tgtggaagga ggccaccacc accctgctct gcgccagcga cgccaaggcc tacgacaccg 2100 
aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac ccccaggagg 2160 
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bggbgcbggb gaacgtgacc gagaacbbcg acatgtggaa gaacgacatg gtggagcaga 2220 
tgcacgagga catcatcagc cbgtgggacc agagcctgaa gccctgcgbg aagctgaccc 2280 
ccctgtgcgt gagcctgaag tgcaccgacc tgaagaacga caccaacacc aacagcagca 2340 
gcggccgcat gatcatggag aagggcgaga tcaagaactg cagcbbcgcb agcaccagcb 2400 
gcaacaccag cgtgatcacc caggcctgcc ccaaggtgag cttcgagccc atccccaacc 24 60 
actactgcgc ccccgccggc tbcgccabcc tgaagtgcaa ggacaagaag ttcaacggca 2520 
agggcccctg caccaacgtg agcaccgtgc agtgcaccca cggcatccgc cccgtggtga 2580 
gcacccagct gctgctgaac ggtagccfcgg ccgaggagga ggtggbgatc cgcagcgcta 2640 
acttcgccga caacgccaag gtgatcatcg tgcagcfcgaa cgagagcgtg gagabcaact 2700 
gcacccgcgc tagcgcccac bgcaaccbga gccgcgccaa gtggaacgac acccbgaaca 2760 
agatcgtgat caagctgcgc gagcagttcg gcaacaagac catcgtgttc aagcacagca 2 820 
gcggcggcga ccccgagatc gtgacccaca gcttcaactg cggcggcgag ttctbctact 2880 
gcaacagcgc bagccbgccc tgccgcatca agcagatcat caacatgtgg cagaaggtgg 2940 
gcaaggccat gtacgccccc cccatcagcg gccagatccg ctgcagcagc aacatcaccg 3000 
gcctgcbgcb gacccgcgac ggcggcaaca gcaacaacga gagcgagatc ttccgtccgg 3060 
gcggcggcga catgcgcgac aactggcgca gcgagctgta caagtacaag gtggtgaaga 312 0 
tcgagcccch gggcgtggcc cccaccaagg ccaagcttac cgtccaggcc cgccagctgc 3180 
tgagcggcat cgtgcagcag cagaacaacc tgctgcgcgc catcgaggcc cagcagcacc 3240 
tgctgcagct gaccgtgtgg ggcatcaagc agcbgcaggc ccgcaccctg gccgbggagc 3300 
gctacctgaa ggaccagcag cagctcgagc agabcbggaa ccacaccacc bggatggagb 3 360 
gggaccgcga gatcaacaac tacaccagcc tgatccacag cctgatcgag gagagccaga 3420 
accagcagga gaagaacgag caggagcbgc tggagctgga caagbgggcc agcctgtgga 3480 
actggttcaa catcaccaac bggcbgbggb acatcaagct gttcatcatg atcgtgggcg 354 0 
gccbggbggg cctgcgcatc gtgttcgccg tgctgagcat cbgaggabcc agabcbgcbg 3600 
tgccttctag ttgccagcca bctgbbgbtb gcccctcccc cgtgccttcc bbgacccbgg 3660 
aaggtgccac bcccacbgbc ctttcctaat aaaatgagga aattgcatcg cattgtctga 3720 
gtaggtgtca ttctattctg gggggbgggg bggggcagga cagcaagggg gaggattggg 3780 
aagacaatag caggcabgcb ggggabgcgg bgggcbcbat gggtacccag gtgctgaaga 3840 
attgacccgg ttcctcctgg gccagaaaga agcaggcaca tccccttctc tgtgacacac 3900 
cctgtccacg cccctggttc ttagttccag ccccactcat aggacactca bagcbcagga 3960 
gggctccgcc ttcaatccca cccgcbaaag tacttggagc ggtctctccc tccctcatca 4020 
gcccaccaaa ccaaacctag cctccaagag tgggaagaaa tbaaagcaag abaggcbabt 4080 
aagbgcagag ggagagaaaa bgccbccaac abgtgaggaa gbaabgagag aaatcabaga 414 0 
atbbbaaggc cabgabbbaa ggccabcabg gccbbaabcb bccgcbbccb cgcbcactga 4200 
cbcgcbgcgc bcggtcgbbc ggcbgcggcg agcggbabca gcbcacbcaa aggcggbaab 4260 
acggbbabcc acagaabcag gggabaacgc aggaaagaac abgtgagcaa aaggccagca 4320 
aaaggccagg aaccgbaaaa aggccgcgbt gcbggcgtbb bbccataggc bccgcccccc 4380 
bgacgagcab cacaaaaabc gacgcbcaag tcagaggbgg cgaaacccga caggacbaba 4440 
aagabaccag gcgbtbcccc cbggaagcbc ccbcgbgcgc bcbccbgttc cgacccbgcc 4500 
gcbbaccgga bacctgbccg ccbbtcbccc bbcgggaagc gbggcgcbbt cbcatagcbc 4560 
acgcbgbagg babctcagbb cggbgbaggb cgbbcgcbcc aagcbgggcb gtgbgcacga 4620 
accccccgbb cagcccgacc gcbgcgccbb abccggtaac babcgtcbbg agbccaaccc 4680 
ggtaagacac gacbtatcgc cacbggcagc agccactggb aacaggabta gcagagcgag 4740 
gbabgtaggc ggbgcbacag agbbctbgaa gbggbggccb aacbacggcb acacbagaag 4800 
aacagtabbb ggbatctgcg cbcbgcbgaa gccagbtacc tbcggaaaaa gagtbggbag 4860 
cbcbbgabcc ggcaaacaaa ccaccgcbgg bagcggbggb tbbbbbgbbb gcaagcagca 4920 
gabbacgcgc agaaaaaaag gabcbcaaga agatccbbbg abcbbbbcba cggggtcbga 4980 
cgcbcagbgg aacgaaaacfc cacgtbaagg gatbbbggbc abgagabbab caaaaaggab 5040 
cbbcaccbag abccfcttbaa abbaaaaatg aagtbbbaaa tcaabctaaa gbabatabga 5100 
gbaaacbbgg bcbgacagtb accaatgctt aabcagbgag gcaccbabct cagcgabctg 5160 
bctabtbcgb bcabccabag bbgccbgacb cggggggggg gggcgcbgag gbcbgccbcg 5220 
bgaagaaggb gbbgcbgacb cabaccaggc cbgaabcgcc ccabcabcca gccagaaagb 5280 
gagggagcca cggbbgabga gagcbbtgbb gbaggbggac cagbbggbga btbbgaactb 5340 
bbgcbbbgcc acggaacggb ctgcgbtgbc gggaagabgc gbgabcbgab ccbbcaacbc 5400 
agcaaaagbb cgattbatfcc aacaaagccg ccgbcccgtc aagtcagcgb aabgcbctgc 5460 
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cagtgttaca accaabbaac caattctgat tagaaaaact cabcgagcab caaatgaaac 5520 

bgcaabbbab bcababcagg abtabcaaba ccatatbttt gaaaaagccg bbbcbgbaab 5580 

gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg 5640 

atbccgacbc gtccaacatc aatacaacct abbaatbbcc ccbcgbcaaa aataaggtta 5700 

tcaagtgaga aabcaccatg agtgacgact gaatccggtg agaatggcaa aagcbbatgc 5760 

atttctttcc agacttgtbc aacaggccag ccattacgct cgbcabcaaa atcactcgca 5820 

tcaaccaaac cgbbatbcab tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg 5880 

ttaaaaggac aatbacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca 5940 

tcaacaatat bbbcaccbga abcaggabab bcbbcbaaba cctggaatgc tgttttcccg 6000 

gggabcgcag bggbgagbaa ccatgcatca tcaggagtac ggabaaaabg cttgabggtc 6060 

ggaagaggca baaabbccgb cagccagttt agtctgacca bcbcabcbgb aacatcabbg 6120 

gcaacgctac ctttgccatg bbbcagaaac aacbcbggcg cabcgggcbt cccabacaab 6180 

cgabagabtg bcgcaccbga bbgcccgaca bbabcgcgag cccabbbaba cccababaaa 6240 

bcagcatcca bgttggaabb baabcgcggc cbcgagcaag acgbbbcccg tbgaababgg 6300 

ctcabaacac cccbbgbabb actgbbtabg baagcagaca gtbbbabbgb tcabgabgab 6360 

atabbbbbab cbbgbgcaab gbaacatcag agabbbbgag acacaacgbg gcbbtccccc 6420 

cccccccabb abbgaagcab bbabcagggb babbgbcbca bgagcggaba cabatbbgaa 6480 

bgbabbbaga aaaataaaca aabaggggbb ccgcgcacab ttccccgaaa agbgccaccb 6540 

gacgbctaag aaaccabtab babcabgaca bbaacctaba aaaabaggcg batcacgagg 6600 

cccbbbcgbc 6610 



<210> 67 
<211> 6610 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes - 
<400> 67 

bcgcgcgtbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgtaaggag aaaabaccgc abcagabbgg 240 
cbabbggcca bbgcabacgb bgbabccaba bcabaababg tacabbbaba bbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgabb abbgacbagb babbaabagb aabcaabbac 360 
ggggbcabba gbbcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg tcaabaabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggbaaac 54 0 
bgcccacbbg gcagbacabc aagbgtabca babgccaagb acgcccccba bbgacgbcaa 600 
tgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagbac abcbacgbab tagbcabcgc babbaccabg gbgabgcggb btbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbbbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gtcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggtag gcgtgbacgg bgggaggbcb atabaagcag 900 
agctcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgbgcc aagagbgacg baagbaccgc cbabagacbc babaggcaca ccccbtbggc 1080 
bcbbabgcab gcbabacbgb bbtbggcbbg gggccbabac acccccgcbb ccbbatgcba 1140 
baggbgabgg babagcbbag cctabaggbg tgggbbabbg accabbatbg accacbcccc 1200 
babbggbgac gabacbbbcc abtacbaabc cabaacabgg cbcbbbgcca caacbabcbc 1260 
babbggcbab abgccaabac bcbgbccbbc agagacbgac acggacbcbg babbbbbaca 1320 
ggabggggbc ccabbbabba bbbacaaabt cacababaca acaacgccgb cccccgtgcc 1380 
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cgcagttttt attaaacata gcgbgggatc 
catgggctct tctccggtag cggcggagct 
agcggcbcab ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagbgbgccg. 
gaaaatgagc gtggagabtg ggcbcgcacg 
gcagaagaag atgcaggcag cbgagbbgbb 
gbbgcggbgc bgbbaacggb ggagggcagt 
cgcgccacca gacabaabag cbgacagacb 
tgcagtcacc gtcgtcgaca cgtgtgatca 
aggagaagta ccagcacctg bggcgcbggg 
tgcbgabgab ctgcagcgcc accgagaagc 
tgbggaagga ggccaccacc accctgcbcb 
aggbgcacaa cgtgtgggcc acccacgcct 
bggbgcbggb gaacgtgacc gagaactbcg 
tgcacgagga catcatcagc ctgtgggacc 
cccbgbgcgb gagcctgaag tgcaccgacg 
cccaggccbg ccccaaggtg agcttcgagc 
gcttcgccat cctgaagtgc aaggacaaga 
tgagcaccgt gcagtgcacc cacggcabcc 
acggtagcct ggccgaggag gaggtggtga 
aggbgabcab cgtgcagctg aacgagagcg 
actgcaacct gagccgcgcc aagbggaacg 
gcgagcagtt cggcaacaag accabcgbgb 
bcgbgaccca cagcbtcaac bgcggcggcg 
tcaacagcac cbggbtcaac agcaccbgga 
gcgacaccab cacccbgccc bgccgcabca 
gcaaggccab gbacgccccc cccabcagcg 
gcctgcbgcb gacccgcgac ggcggcaaca 
gcggcggcga cabgcgcgac aacbggcgca 
bcgagccccb gggcgtggcc cccaccaagg 
tgagcggcat cgbgcagcag cagaacaacc 
bgcbgcagcb gaccgbgbgg ggcabcaagc 
gcbaccbgaa ggaccagcag cagctcgagc 
gggaccgcga gabcaacaac bacaccagcc 
accagcagga gaagaacgag caggagctgc 
acbggtbcaa catcaccaac bggcbgbggb 
gccbggbggg ccbgcgcatc gtgbbcgccg 
bgccttcbag tbgccagcca bcbgbbgbbb 
aaggbgccac tcccacbgbc ctbbcctaab 
gtaggbgbca bbcbabbcbg gggggtgggg 
aagacaabag caggcabgcb ggggatgcgg 
abbgacccgg bbccbccbgg gccagaaaga 
ccbgbccacg ccccbggtbc ttagbtccag 
gggcbccgcc bbcaabccca cccgcbaaag 
gcccaccaaa ccaaaccbag ccbccaagag 
aagbgcagag ggagagaaaa tgccbccaac 
atbbbaaggc cabgabtbaa ggccabcabg 
cbcgcbgcgc bcggbcgbbc ggcbgcggcg 
acggbbabcc acagaabcag gggabaacgc 
aaaggccagg aaccgbaaaa aggccgcgbb 
tgacgagcat cacaaaaatc gacgcbcaag 
aagabaccag gcgbbbcccc cbggaagcbc 
gcbbaccgga baccbgbccg ccbtbcbccc 
acgcbgbagg batcbcagbb cggbgbaggb 
accccccgbb cagcccgacc gcbgcgccbb 



tccacgcgaa bctcgggbac gbgbbccgga 1440 
bccacabccg agcccbggbc ccatgcctcc 1500 
cbccbaacag bggaggccag actbaggcac 1560 
cacaaggccg bggcggbagg gbabgbgtcb 1620 
gcbgacgcag atggaagacb baaggcagcg 1680 
gbatbcbgab aagagbcaga ggbaacbccc 1740 
gbagtcbgag cagbacbcgb bgcbgccgcg 1800 
aacagacbgb bccbbbccab gggbcbbbbc 1860 
gababcgcgg ccgcbcbaga abgcgcgbga 1920 
gcbggcgcbg gggcaccabg cbgcbgggca 1980 
tgbgggbgac cgbgbacbac ggcgbgcccg 204 0 
gcgccagcga cgccaaggcc bacgacaccg 2100 
gcgbgcccac cgaccccaac ccccaggagg 2160 
acabgbggaa gaacgacabg gtggagcaga 2220 
agagccbgaa gcccbgcgtg aagcbgaccc 2280 
cbagcaccag ctgcaacacc agcgbgabca 2340 
ccabccccaa ccacbacbgc gcccccgccg 24 00 
agbtcaacgg caagggcccc tgcaccaacg 2460 
gccccgbggb gagcacccag cbgcbgcbga 2520 
bccgcagcgc baacbbcgcc gacaacgcca 25 80 
bggagabcaa cbgcacccgc gcbagcgccc 2640 
acacccbgaa caagabcgbg abcaagcbgc 2700 
bcaagcacag cagcggcggc gaccccgaga 2760 
agbtcbbcba cbgcaacagc acccagcbgb 2820 
gcaccgaggg cagcaacaac accgagggca 2 880 
agcagabcab caacabgbgg cagaaggbgg 2940 
gccagabccg cbgcagcagc aacabcaccg 3000 
gcaacaacga gagcgagabc bbccgbccgg 3060 
gcgagctgba caagtacaag gtggbgaaga 3120 
ccaagctbac cgbccaggcc cgccagcbgc 3180 
tgcbgcgcgc cabcgaggcc cagcagcacc 3240 
agcbgcaggc ccgcacccbg gccgbggagc 3300 
agabcbggaa ccacaccacc bggabggagb 3360 
bgabccacag cctgabcgag gagagccaga 3420 
bggagcbgga caagbgggcc agccbgtgga 34 80 
acabcaagcb gbbcabcabg abcgbgggcg 3540 
bgcbgagcab cbgaggatcc agabcbgcbg 3600 
gccccbcccc cgbgccbbcc btgacccbgg 3660 
aaaatgagga aabbgcabcg cabbgbcbga* 3720 
tggggcagga cagcaagggg gaggabtggg 3780 
bgggcbcbab gggbacccag gbgcbgaaga 3840 
agcaggcaca tccccbbcbc bgbgacacac 3900 
ccccacbcat aggacacbca bagcbcagga 3960 
bacbbggagc ggbcbcbccc bcccbcabca 4020 
bgggaagaaa bbaaagcaag abaggctabb 4080 
abgbgaggaa gbaabgagag aaabcabaga 4140 
gccbbaatcb bccgctbccb cgcbcacbga 4200 
agcggbabca gcbcacbcaa aggcggbaab 4260 
aggaaagaac abgbgagcaa aaggccagca 4320 
gcbggcgbbb bbccabaggc bccgcccccc 4380 
bcagaggbgg cgaaacccga caggacbaba 4440 
ccbcgbgcgc bcbccbgbbc cgacccbgcc 4500 
bbcgggaagc gbggcgcbbb cbcabagcbc 4560 
cgbbcgcbcc aagcbgggcb gbgbgcacga 4 620 
abccggbaac babcgbcbbg agtccaaccc 4680 
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ggtaagacac gacbbabcgc cactggcagc agccacbggb aacaggatta gcagagcgag 4740 

gtatgtaggc ggbgcbacag agttcttgaa gtggtggcct aactacggct acacbagaag 4 BOO 

aacagbabtb ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 4860 

cbcbtgabcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 4920 

gattacgcgc agaaaaaaag gatchcaaga agatcctttg atcttttcta cggggbcbga 4 980 

cgcfccagtgg aacgaaaact cacgttaagg gabbbbggtc abgagattab caaaaaggat 5040 

cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gbabababga 5100 

gtaaacttgg tctgacagtt accaatgcbb aatcagtgag gcacctatct cagcgatcbg 5160 

tcbabbtcgb tcatccatag ttgcctgact cggggggggg gggcgctgag gbcbgccbcg 5220 

bgaagaaggb gttgctgact cataccaggc ctgaatcgcc ccatcatcca gccagaaagt 5280 

gagggagcca cggbtgatga gagctttgtt gtaggtggac cagtbggbga bbbbgaacbb 5340 

btgcbbtgcc acggaacggt ctgcgttgtc gggaagatgc gtgatcbgab ccttcaactc 5400 

agcaaaagtt cgatttattc aacaaagccg ccgtcccgtc aagbcagcgt aatgctctgc 5460 

cagtgttaca accaatbaac caatbcbgab tagaaaaact catcgagcat caaatgaaac 5520 

bgcaabbbab tcababcagg abtabcaaba ccababbttb gaaaaagccg bbbcbgbaab 5580 

gaaggagaaa acbcaccgag gcagbtccab aggatggcaa gabccbggba tcggbcbgcg 5640 

atbccgacbc gbccaacabc aabacaaccb atbaabttcc ccbcgbcaaa aataaggbba 5700 

tcaagbgaga aatcaccabg agtgacgacb gaatccggtg agaatggcaa aagcbbabgc 5760 

abtbcbbbcc agacbbgbbc aacaggccag ccabbacgct cgbcabcaaa abcacbcgca 5820 

tcaaccaaac cgbtatbcat tcgbgabtgc gccbgagcga gacgaaabac gcgabcgcbg 5880 

bbaaaaggac aatbacaaac aggaabcgaa bgcaaccggc gcaggaacac bgccagcgca 5940 

bcaacaatat tbtcaccbga abcaggabab bcbbcbaaba ccbggaabgc bgbbtbcccg 6000 

gggabcgcag bggbgagbaa ccabgcabca bcaggagbac ggabaaaabg ctbgabggbc 6060 

ggaagaggca baaabbccgb cagccagtbt agbctgacca tcbcatctgb aacatcabbg 6120 

gcaacgcbac cbbbgccabg tbbcagaaac aacbctggcg cabcgggcbb cccabacaab 6180 

cgabagabbg bcgcaccbga tbgcccgaca btabcgcgag cccabtbaba cccababaaa 6240 

tcagcabcca bgbbggaabt taabcgcggc ctcgagcaag acgbbtcccg tbgaababgg 6300 

cbcabaacac cccbbgbabt actgbbbabg baagcagaca gbtbbabbgt bcatgabgab 6360 

ababbbbbab cbbgbgcaab gbaacabcag agabbbtgag acacaacgbg gcbtbccccc 6420 

cccccccabb abtgaagcat bbabcagggt babbgbcbca tgagcggaba cababtbgaa 6480 

bgbabbbaga aaaabaaaca aataggggbt ccgcgcacab tbccccgaaa agbgccaccb 6540 

gacgbcbaag aaaccabbab batcabgaca bbaaccbaba aaaabaggcg babcacgagg 6600 

cccbbbcgbc 6610 



<210> 68 
<211> 6613 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 68 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggtca 60 
cagcbbgbcb gtaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbatg cggcatcaga gcagabbgba ctgagagbgc 180 
accatabgcg gbgbgaaata ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabtggcca bbgcabacgt bgtabccata bcabaatabg bacabbbaba bbggcbcatg 300 
bccaacabba ccgccabgbb gacatbgatb abbgactagb batbaabagb aabcaabbac 360 
ggggtcabba gbbcabagcc cabababgga gbbccgcgbt acabaacbba cggbaaatgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaataabga cgbatgbbcc 480 
cabagbaacg ccaataggga cbbtccabbg acgbcaabgg gbggagtabb bacggbaaac 540 
bgcccacbbg gcagbacatc aagtgbabca babgccaagb acgcccccba bbgacgbcaa 600 



- 159 - 



WO 02/32943 PCT/US01/25721 

tgacggtaaa tggcccgcct ggcattahgc ccagtacabg accbtatggg acbbbccbac 660 

ttggcagtac abcbacgbab tagtcatcgc tattaccatg gtgabgcggb tttggcagta 720 

catcaatggg cgtggatagc ggtthgacfcc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agbtbgbbbb ggcaccaaaa bcaacgggac tttccaaaat gtcgtaacaa 840 

cbccgcccca tbgacgcaaa bgggcggbag gcgtgtacgg bgggaggbcb atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggab 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

bcbtabgcab gcbabacbgb ttttggcttg gggcctatac acccccgctb ccbbabgcba 1140 

taggtgatgg tatagcttag cctataggtg bgggbbabbg accabbabbg accacbcccc 1200 

fcabbggbgac gatactttcc attactaafcc cataacatgg cbctbbgcca caacbatcbc 1260 

tattggctat atgccaatac bcbgbccbbc agagactgac acggacbcbg tatttttaca 1320 

ggatggggbc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt abbaaacaba gcgtgggabc tccacgcgaa tcbcgggbac gtgttccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agcccbggbc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaabgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 

gaaaatgagc gbggagabbg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag cbgagbbgbb gtattcbgat aagagtcaga ggtaacbccc 1740 

gtbgcggtgc tgbbaacggb ggagggcagt gtagbcbgag cagtactcgt tgcbgccgcg 1800 

cgcgccacca gacataabag ctgacagact aacagacbgt tccbbtccab gggbcbbbbc 1860 

tgcagbcacc gbcgtcgaca cgtgtgatca gatabcgcgg ccgctcbaga abgcgcgbga 1920 

aggagaagta ccagcaccbg bggcgcbggg gctggcgctg gggcaccabg cbgctgggca 1980 

tgctgabgat ctgcagcgcc accgagaagc bgbgggbgac cgtgbacbac ggcgtgcccg 2040 

tgbggaagga ggccaccacc acccbgcbcb gcgccagcga cgccaaggcc bacgacaccg 2100 

aggtgcacaa cgtgtgggcc acccacgccb gcgbgcccac cgaccccaac ccccaggagg 2160 

bggtgctggt gaacgbgacc gagaacbtcg acatgbggaa gaacgacabg gtggagcaga 2220 

bgcacgagga cabcatcagc cbgbgggacc agagccbgaa gcccbgcgbg aagctgaccc 22 80 

ccctgtgcgb gagccbgaag bgcaccgacg cbagcaccag cbgcaacacc agcgtgatca 2340 

cccaggccbg ccccaaggbg agcbbcgagc ccatccccaa ccacbacbgc gcccccgccg 24 00 

gcbbcgccab ccbgaagbgc aaggacaaga agbtcaacgg caagggcccc tgcaccaacg 2460 

bgagcaccgt gcagbgcacc cacggcabcc gccccgtggt gagcacccag ctgcbgcbga 2520 

acggtagcct ggccgaggag gaggbggtga bccgcagcgc baacbbcgcc gacaacgcca 2580 

aggbgabcab cgbgcagcbg aacgagagcg bggagabcaa cbgcacccgc cccaacaaca 2640 

acacccgcaa gagcabccac abcggccccg gccgcgcctb cbacaccacc ggcgagatca 2700 

bcggcgacab ccgccaggcc cacbgcaacc bgagccgcgc caagbggaac gacacccbga 27 60 

acaagabcgb gabcaagcbg cgcgagcagb bcggcaacaa gaccabcgbg bbcaagcaca 2820 

gcagcggcgg cgaccccgag abcgbgaccc acagctbcaa cbgcggcggc gagttcbtcb 2880 

acbgcaacag cgcbagccbg cccbgccgca bcaagcagab cabcaacabg bggcagaagg 2940 

bgggcaaggc cabgbacgcc ccccccatca gcggccagat ccgctgcagc agcaacabca 3000' 

ccggccbgcb gcbgacccgc gacggcggca acagcaacaa cgagagcgag abcbtccgbc 3060 

cgggcggcgg cgacabgcgc gacaacbggc gcagcgagcb gtacaagbac aaggbggtga 3120 

agabcgagcc ccbgggcgbg gcccccacca aggccaagct baccgbccag gcccgccagc 3180 

tgctgagcgg cabcgbgcag cagcagaaca accbgcbgcg cgccabcgag gcccagcagc 3240 

accbgctgca gcbgaccgtg bggggcabca agcagcbgca ggcccgcacc cbggccgbgg 3300 

agcgctaccb gaaggaccag cagcagcbcg agcagabcbg gaaccacacc accbggabgg 3360 

agbgggaccg cgagabcaac aacbacacca gccbgatcca cagcctgabc gaggagagcc 3420 

agaaccagca ggagaagaac gagcaggagc tgcbggagcb ggacaagbgg gccagcctgt 3480 

ggaactggbt caacabcacc aacbggcbgt ggtacabcaa gcbgtbcabc abgabcgbgg 3540 

gcggccbggt gggcctgcgc abcgtgtbcg ccgbgcbgag cabcbgagga tccagatcbg 3600 

ctgbgccbtc bagbtgccag ccabcbgbbg bbtgccccbc ccccgbgccb tccbbgaccc 3660 

bggaaggbgc cacbcccacb gbccbbbcct aataaaatga ggaaabbgca tcgcabbgbc 3720 

bgagtaggbg bcabbcbabb cbggggggbg gggbggggca ggacagcaag ggggaggabb 3780 

gggaagacaa bagcaggcab gcbggggabg cggbgggcbc tatgggfcacc caggbgcbga 3840 

agaatbgacc cggbbccbcc tgggccagaa agaagcaggc acabccccbb cbctgbgaca 3900 
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caccctgtcc acgcccctgg ttcttagttc cagccccact cataggacac tcatagc.tca 3960 

ggagggctcc gccttcaatc ccacccgcta aagtacttgg agcggtctct ccctccctca 4020 

tcagcccacc aaaccaaacc tagcctccaa gagtgggaag aaattaaagc aagataggct 4080 

attaagtgca gagggagaga aaatgcctcc aacatgtgag gaagtaatga gagaaatcat 414 0 

agaattttaa ggccatgatt taaggccatc atggccttaa tcttccgctt cctcgctcac 4200 

tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt 4260 

aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca 4320 

gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg tttfctccata ggctccgccc 4380 

ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact 4440 

ataaagatac caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct 4500 

gccgcttacc ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag 4560 

ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca 4 620 

cgaacccccc gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa 4680 

cccggtaaga cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc 4740 

gaggtatgta ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag 4800 

aagaacagta tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg 4 860 

tagctcttga tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca 4920 

gcagattacg cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc 4980 

tgacgctcag tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag 5040 

gatcttcacc tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata 5100 

tgagtaaact tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat 5160 

ctgtctattt cgttcatcca tagttgcctg actcgggggg ggggggcgct gaggtctgcc 5220 

tcgtgaagaa ggfcgttgctg actcatacca ggcctgaatc gccccatcat ccagccagaa 5280 

agtgagggag ccacggttga cgagagcttt gttgtaggtg gaccagttgg tgattttgaa 5340 

cttttgcttt gccacggaac ggtctgcgtt gtcgggaaga tgcgtgatct gatccttcaa 5400 

ctcagcaaaa gttcgattta ttcaacaaag ccgccgtccc gtcaagtcag cgtaatgctc 5460 

tgccagtgtt acaaccaatt aaccaattct gattagaaaa actcatcgag catcaaatga 5520 

aactgcaatt tattcatatc aggattatca ataccatatt tttgaaaaag ccgtttctgt 5580 

aatgaaggag aaaactcacc gaggcagttc cataggatgg caagatcctg gtatcggtct 5640 

gcgattccga ctcgtccaac atcaatacaa cctattaatt tcccctcgtc aaaaataagg 5700 

ttatcaagtg agaaatcacc atgagtgacg actgaatccg gtgagaatgg caaaagctta 5760 

tgcatttctt tccagacttg ttcaacaggc cagccattac gctcgtcatc aaaatcactc 5820 

gcatcaacca aaccgttatt cattcgtgat tgcgcctgag cgagacgaaa tacgcgatcg 5880 

ctgttaaaag gacaattaca aacaggaatc gaatgcaacc ggcgcaggaa cactgccagc 5940 

gcatcaacaa tattttcacc tgaatcagga tattcttcta atacctggaa tgctgttttc 6000 

ccggggatcg cagtggtgag taaccatgca tcatcaggag tacggataaa atgcttgatg 6060 

gtcggaagag gcataaattc cgtcagccag tttagtctga ccatctcatc tgtaacatca 6120 

ttggcaacgc tacctttgcc atgtttcaga aacaactctg gcgcatcggg cttcccatac 6180 

aatcgataga ttgtcgcacc tgattgcccg acattatcgc gagcccattt atacccatat 6240 

aaatcagcat ccatgttgga atttaatcgc ggcctcgagc aagacgtttc ccgttgaata 6300 

tggctcataa caccccttgt attactgttt atgtaagcag acagttttat tgttcatgat 6360 

gatatatttt tatcttgtgc aatgtaacat cagagatttt gagacacaac gtggctttcc 64 20 

cccccccccc attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt 6480 

gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca 6540 

cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg 6600 

aggccctttc gtc 6613 



<210> 69 

<211> 6532 

<212> DNA 

<213> Artificial Sequence 
c220> 
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<400> 69 

tcgcgcgttt cggtgatgac ggtgaaaacc 
cagcttgtct gtaagcggat gccgggagca 
ttggcgggtg tcggggctgg cttaactatg 
accatatgcg gtgtgaaata ccgcacagab 
ctattggcca ttgcatacgb bgtabccaba 
bccaacabba ccgccabgtb gacahtgatt 
ggggbcabba gttcatagcc catatatgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga ctbbccattg 
tgcccacttg gcagtacatc aagtgtatca 
bgacggbaaa tggcccgcct ggcattatgc 
ttggcagtac atcbacgbab tagtcatcgc 
catcaatggg cgtggatagc ggbbbgacbc 
cgtcaatggg agtttgtttt ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgttta gtgaaccgtc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
bccccgbgcc aagagbgacg taagtaccgc 
tcttatgcat gctatactgb bbbbggctbg 
taggtgatgg tatagcttag cctataggtg 
tattggtgac gatactttcc attactaatc 
batbggctab atgccaatac tctgtccttc 
ggatggggtc ccatttatta tttacaaatt 
cgcagbbtbt atbaaacata gcgtgggatc 
catgggcbcb tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagcbccbbg 
agcacaatgc ccaccaccac cagbgbgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgth 
gttgcggbgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gbcgbcgaca cgtgbgabca 
aggagaagta ccagcacctg tggcgctggg 
bgcbgatgat ctgcagcgcc accgagaagc 
bgbggaagga ggccaccacc accctgctct 
aggtgcacaa cgtgtgggcc acccacgcct 
tggtgctggt gaacgtgacc gagaacttcg 
tgcacgagga catcabcagc cbgbgggacc 
cccbgfcgcgt gagcctgaag tgcaccgacg 
cccaggcctg ccccaaggbg agcttcgagc 
gcbbcgccab cctgaagtgc aaggacaaga 
bgagcaccgb gcagbgcacc cacggcabcc 
acggtagcct ggccgaggag gaggtggtga 
aggtgatcat cgtgcagcbg aacgagagcg 
achgcaacct gagccgcgcc aagtggaacg 
gcgagcagtt cggcaacaag accabcgbgb 
tcgtgaccca cagcbbcaac tgcggcggcg 
cctgccgcat caagcagatc atcaacatgt 
cccccatcag cggccagatc cgctgcagca 
acggcggcaa cagcaacaac gagagcgaga 
acaactggcg cagcgagctg tacaagtaca 
cccccaccaa ggccaagctt accgtccagg 



bctgacacat gcagctcccg gagacggtca 60 
gacaagcccg tcagggcgcg tcagcgggtg 120 
cggcatcaga gcagattgta ctgagagtgc 180 
gcgtaaggag aaaataccgc atcagattgg 24 0 
bcabaababg tacatttata ttggctcatg 300 
atbgacbagb tattaatagt aatcaatfcac 360 
gttccgcgtt acataactta cggtaaatgg 420 
cccattgacg tcaataatga cgtatgttcc 480 
acgtcaatgg gtggagtatt tacggtaaac 540 
tatgccaagb acgcccccta bbgacgtcaa 600 
ccagbacabg acctbatggg acbtbcctac 660 
babbaccatg gbgabgcggb tbbggcagba 720 
acggggatbb ccaagtctcc accccabtga 780 
bcaacgggac tttccaaaat gtcgbaacaa 840 
gcgtgbacgg bgggaggbcb abataagcag 900 
gagacgccat ccacgcbgbb btgacctcca 960 
cggccgggaa cggtgcabbg gaacgcggab 1020 
ctatagactc tabaggcaca ccccbttggc 1080 
gggccbabac acccccgcbb ccbtatgcba 1140 
bgggbtabbg accatbatbg accactcccc 1200 
cabaacabgg cbcbbbgcca caacbabctc 1260 
agagacbgac acggacbcbg battbtbaca 1320 
cacababaca acaacgccgt cccccgtgcc 1380 
bccacgcgaa bcbcgggbac gbgbbccgga 1440 
bccacabccg agccctggtc ccatgcctcc 1500 
cbccbaacag tggaggccag acbbaggcac 1560 
cacaaggccg tggcggbagg gbatgbgbcb 1620 
gcbgacgcag atggaagacb baaggcagcg 1680 
gbabbcbgab aagagbcaga ggtaacbccc 1740 
gbagbcbgag cagtactcgt bgcbgccgcg 1800 
aacagacbgb bcctbbccat gggbcbbtbc 1860 
gabatcgcgg ccgcbcfcaga abgcgcgtga 1920 
gcbggcgctg gggcaccabg cbgcbgggca 1980 
tgtgggbgac cgbgbacbac ggcgtgcccg 2040 
gcgccagcga cgccaaggcc tacgacaccg 2100 
gcgbgcccac cgaccccaac ccccaggagg 2160 
acabgtggaa gaacgacabg gbggagcaga 2220 
agagccbgaa gcccbgcgbg aagcbgaccc 2280 
cbagcaccag cbgcaacacc agcgtgatca 2340 
ccabccccaa ccacbacbgc gcccccgccg 2400 
agbbcaacgg caagggcccc bgcaccaacg 2460 
gccccgbggb gagcacccag cbgcbgcbga 2520 
bccgcagcgc baacttcgcc gacaacgcca 2580 
bggagabcaa ctgcacccgc gcbagcgccc 2640 
acaccctgaa caagabcgbg abcaagctgc 2700 
bcaagcacag cagcggcggc gaccccgaga 2760 
agfcbctbcba cbgcaacagc gcbagccbgc 2820 
ggcagaaggt gggcaaggcc atgbacgccc 2880 
gcaacatcac cggcctgctg cbgacccgcg 2940 
tcbbccgtcc gggcggcggc gacabgcgcg 3000 
aggbggbgaa gabcgagccc ctgggcgbgg 3060 
cccgccagct gctgagcggc atcgbgcagc 3120 
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agcagaacaa cctgctgcgc gccatcgagg cccagcagca ccbgctgcag cbgaccgbgb 3180 
ggggcatcaa gcagctgcag gcccgcaccc tggccgtgga gcgctacctg aaggaccagc 3240 
agcagctcga gcagatctgg aaccacacca ccbggatgga gtgggaccgc gagatcaaca 33O0 
actacaccag cctgatccac agccbgabcg aggagagcca gaaccagcag gagaagaacg 33 60 
agcaggagcb gctggagctg gacaagtggg ccagccbgtg gaactggbbc aacabcacca 3420 
acbggcbgtg gbacabcaag ctgbbcabca bgabcgtggg cggccbggtg ggccbgcgca 34 80 
bcgbgbtcgc cgtgctgagc atcbgaggat ccagatctgc tgtgccbbct agbtgccagc 3540 
catcbgttgb bfcgccccbcc cccgtgcctt ccttgaccct ggaaggtgcc actcccacbg 3600 
fccctttccta abaaaatgag gaaabbgcab cgcabtgbcb gagbaggtgb cattcbatbc 3660 
tggggggtgg ggtggggcag gacagcaagg gggaggabbg ggaagacaab agcaggcabg 3720 
cbggggatgc ggtgggcbct atgggbaccc aggtgcbgaa gaattgaccc ggttccbccb 3780 
gggccagaaa gaagcaggca catccccttc bcbgbgacac accctgbcca cgcccctggb 3840 
bctbagbbcc agccccacbc ataggacact catagctcag gagggcbccg ccbtcaatcc 3900 
cacccgctaa agbactbgga gcggbcbctc cctccctcat cagcccacca aaccaaaccb 3960 
agccbccaag agtgggaaga aattaaagca agabaggcba bbaagbgcag agggagagaa 4 020 
aatgccbcca acabgtgagg aagbaabgag agaaabcata gaabbttaag gccabgabbb 4080 
aaggccabca bggcctbaab cttccgcttc cbcgctcact gactcgcbgc gcbcggbcgb 4140 
bcggcbgcgg cgagcggbat cagctcactc aaaggcggba atacggbbab ccacagaatc 42 00 
aggggataac gcaggaaaga acatgbgagc aaaaggccag caaaaggcca ggaaccgtaa 4260 
aaaggccgcg btgctggcgt tttbccatag gctccgcccc cctgacgagc abcacaaaaa 4320 
bcgacgctca agtcagaggb ggcgaaaccc gacaggacba baaagabacc aggcgbbbcc 4380 
cccbggaagc bcccbcgbgc gcbcbcctgb bccgacccbg ccgcbtaccg gabaccbgtc 4440 
cgccbtbcbc ccbbcgggaa gcgtggcgcb btcfccatagc bcacgcbgba ggbabcbcag 4500 
btcggbgbag gbcgttcgct ccaagcbggg cbgtgtgcac gaaccccccg bbcagcccga 4560 
ccgcbgcgcc bbatccggba acbabcgbct tgagbccaac ccggbaagac acgacbbabc 4620 
gccactggca gcagccacbg gtaacaggat bagcagagcg aggtatgbag gcggtgcbac 4680 
agagbtcbtg aagbggbggc cbaactacgg ctacactaga agaacagtab bbggbabcbg 4740 
cgctcbgcbg aagccagbba ccbbcggaaa aagagtbggt agcbctbgab ccggcaaaca 4 8 00 
aaccaccgcb ggbagcggbg gbbtbbbbgt bbgcaagcag cagabtacgc gcagaaaaaa 4B60 
aggabcbcaa gaagatcctb tgabctbbtc tacggggtcb gacgcbcagb ggaacgaaaa 4 920 
ctcacgbtaa gggattbbgg bcabgagatt abcaaaaagg atcbbcaccb agabccttbb 4980 
aaabtaaaaa tgaagtbbta aabcaabcba aagbahatab gagbaaacbb ggbcbgacag 5040 
tbaccaabgc tbaabcagbg aggcacctat cbcagcgabc bgbcbatbbc gbbcabccab 5100 
agbtgcctga ctcggggggg gggggcgcbg aggbcbgccb cgbgaagaag gbgbbgcbga 5160 
cbcabaccag gccbgaabcg ccccabcabc cagccagaaa gbgagggagc cacggbbgab 5220 
gagagcbbbg ttgbaggbgg accagbbggt gatbbbgaac bbbtgcbbtg ccacggaacg 5280 
gbcbgcgbbg tcgggaagab gcgbgatcbg atccbbcaac bcagcaaaag bbcgabtbab 5340 
tcaacaaagc cgccgbcccg bcaagtcagc gbaabgcbcb gccagbgbba caaccaabba 5400 
accaabbcbg abbagaaaaa cbcabcgagc abcaaabgaa acbgcaabbb abbcababca 5460 
ggabbabcaa baccababbb btgaaaaagc cgttbcbgba atgaaggaga aaacbcaccg 5520 
aggcagbbcc abaggabggc aagabccbgg babcggtcbg cgabtccgac bcgbccaaca 5580 
tcaatacaac ctabbaabbt ccccbcgtca aaaabaaggb babcaagbga gaaabcacca 5640 
tgagtgacga ctgaabccgg bgagaatggc aaaagcttab gcabttcbtb ccagacbbgb 5700 
tcaacaggcc agccabbacg ctcgtcabca aaabcacbcg cabcaaccaa accgbbabbc 5760 
abbcgbgabb gcgccbgagc gagacgaaat acgcgatcgc bgbbaaaagg acaabbacaa 582 0 
acaggaabcg aabgcaaccg gcgcaggaac acbgccagcg cabcaacaab abbbbcaccb 5880 
gaatcaggab atbcbbcbaa baccbggaat gctgbbbtcc cggggabcgc agbggtgagb 5940 
aaccabgcab cabcaggagt acggataaaa tgcbbgatgg bcggaagagg cabaaatbcc 6000 
gbcagccagb tbagtctgac cabctcabct gbaacabcab bggcaacgcb accbbbgcca 6060 
tgbbbcagaa acaacbcbgg cgcabcgggc bbcccabaca aCcgabagab bgbcgcaccb 6120 
gabbgcccga cabbabcgcg agcccabbba tacccataba aabcagcabc catgbbggaa 6180 
tbbaabcgcg gccbcgagca agacgbbbcc cgbtgaatab ggctcataac accccbbgba 6240 
tbacbgbtba tgtaagcaga cagbbtbatt gbbcabgabg atatabttbb atcbtgbgca 6300 
abgbaacabc agagabbtbg agacacaacg bggcbtbccc ccccccccca bbafcbgaagc 6360 
abbbabcagg gbbabbgtcb cabgagcgga bacatabbbg aatgbabbba gaaaaabaaa 6420 
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caaatagggg ttccgcgcac atttccccga aaagtgccac ctgacgtcta agaaaccatt 6480 
attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg tc 6532 



<210> 70 
<211> 7181 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVRl012x/s containing HIV genes 
<400> 70 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta ttfcacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga acaccatgcg 1920 
cgtgaaggag aagtaccagc acctgtggcg ctggggctgg cgctggggca ccatgctgct 1980 
gggcatgctg atgatctgca gcgccaccga gaagctgtgg ghgaccgtgt actacggcgt 2040 
gcccgtgtgg aaggaggcca ccaccaccct gctctgcgcc agcgacgcca aggcctacga 2100 
caccgaggtg cacaacgtgt gggccaccca cgcctgcgtg cccaccgacc ccaaccccca 2160 
ggaggtggtg ctggtgaacg tgaccgagaa cttcgacabg tggaagaacg acatggtgga 222 0 
gcagatgcac gaggacatca tcagcctgtg ggaccagagc ctgaagccct gcgtgaagct 2280 
gacccccctg tgcgtgagcc tgaagtgcac cgacctgaag aacgacacca acaccaacag 234 0 
cagcagcggc cgcatgatca tggagaaggg cgagatcaag aactgcagct tcaacatcag 2400 
caccagcatc cgcggcaagg tgcagaagga gtacgccttc ttctacaagc tggacatcat 24 60 
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ccccabcgac aacgacacca ccagcbacag ccbgaccagc tgcaacacca gcgtgatcac 2520 
ccaggccbgc cccaaggtga gcttcgagcc catccccaac cactactgcg cccccgccgg 2580 
cttcaccatc ctgaagtgca aggacaagaa gbbcaacggc aagggcccct gcaccaacgt 2640 
gagcaccgtg cagtgcaccc acggcatccg ccccgtggtg agcacccagc bgcbgcbgaa 2700 
cgaccgcctg gccgaggagg aggtggtgat ccgcagcgcc aacbbcgccg acaacgccaa 2760 
ggbgatcabc gbgcagcbga acgagagcgt ggagatcaac tgcacccgcc ccaacaacaa 2 82 0 
cacccgcaag agcatccaca bcggccccgg ccgcgccttc tacaccaccg gcgagatcat 2 880 
cggcgacatc cgccaggccc actgcaaccb gagccgcgcc aagtggaacg acaccctgaa 2 94 0 
caagatcgtg atcaagctgc gcgagcagtt cggcaacaag accabcgbgb tcaagcacag 3000 
cagcggcggc gaccccgaga tcgtgaccca cagcttcaac bgcggcggcg agbbcbbcba 3060 
ctgcaacagc acccagctgt hcaacagcac ctggttcaac agcacctgga gcaccgaggg 3120 
cagcaacaac accgagggca gcgacaccat caccctgccc tgccgcatca agcagatcat 3180 
caacatgbgg cagaaggtgg gcaaggccat gtacgccccc cccatcagcg gccagatccg 3240 
ctgcagcagc aacabcaccg gcctgctgct gacccgcgac ggcggcaaca gcaacaacga 33 00 
gagcgagatc ttccgcctgg gcggcggcga catgcgcgac aactggcgca gcgagctgta 3360 
caagtacaag gtggtgaaga bcgagccccb gggcgtggcc cccaccaagg ccaagcgccg 3420 
cgtggtgcag cgcgagaagc gcgccgbggg catcggcgcc cbgbtccbgg gcttcctggg 3480 
cgccgccggc agcaccatgg gcgccgccag cabgacccbg accgtgcagg cccgccagct 354 0 
gctgagcggc atcgtgcagc agcagaacaa cctgctgcgc gccatcgagg cccagcagca 3600 
cctgctgcag cbgaccgbgb ggggcatcaa gcagctgcag gcccgcatcc bggccgbgga 3660 
gcgctacctg aaggaccagc agctgctggg catctggggc bgcagcggca agctbabcbg 3720 
caccaccgcc gtgccctgga acgccagctg gagcaacaag agcctggagc agatctggaa 3780 
ccacaccacc tggatggagt gggaccgcga gatcaacaac tacaccagcc tgatccacag 384 0 
cctgabcgag gagagccaga accagcagga gaagaacgag caggagcbgc tggagctgga 3900 
caagtgggcc agccbgbgga actggttcaa catcaccaac tggctgtggt acabcaagcb 3960 
gttcatcatg atcgtgggcg gcctggtggg cctgcgcatc gtgttcgccg tgctgagcat 4020 
cgtgaaccgc gtgcgccagg gcbacagccc cctgagcttc cagacccacc tgcccatccc 4080 
ccgcggcccc gaccgccccg agggcatcga ggaggagggc ggcgagcgcg accgcgaccg 414 0 
cacgahccgc cbggbgaacg gcagctgagg atccagatct gctgtgcctt ctagttgcca 42 00 
gccatctgtt gbtbgccccb cccccgtgcc bbccbbgacc ctggaaggtg ccactcccac 4260 
bgbccbbbcc taataaaatg aggaaattgc abcgcabbgb cbgagbaggb gtcattctat 4320 
tctggggggt ggggbggggc aggacagcaa gggggaggat tgggaagaca atagcaggca 4380 
bgcbggggab gcggtgggct ctatgggtac ccaggtgctg aagaattgac ccggbbcctc 444 0 
ctgggccaga aagaagcagg cacatcccct tctctgtgac acaccctgtc cacgcccctg 4500 
gbbcbtagbb ccagccccac tcataggaca ctcatagctc aggagggctc cgccttcaat 4560 
cccacccgct aaagtacttg gagcggtctc tccctccctc atcagcccac 1 caaaccaaac 4620 
ctagcctcca agagtgggaa gaaattaaag caagataggc tattaagtgc agagggagag 4 680 
aaaatgcctc caacatgtga ggaagtaatg agagaaatca tagaatttta aggccatcat 4740 
ggccttaatc tbccgcttcc tcgctcacbg acbcgcbgcg ctcggbcgtb cggctgcggc 4800 
gagcggbabc agcbcactca aaggcggbaa bacggtbabc cacagaabca ggggataacg 4860 
caggaaagaa cabgbgagca aaaggccagc aaaaggccag gaaccgbaaa aaggccgcgb 4920 
bgcbggcgbb btbccabagg cbccgccccc cbgacgagca bcacaaaaat cgacgcbcaa 4980 
gbcagaggbg gcgaaacccg acaggacbab aaagabacca ggcgbbbccc cctggaagcb 5040 
cccbcgbgcg cbcbccbgtb ccgacccbgc cgcbtaccgg ataccbgbcc gccbbbcbcc 5100 
cbbcgggaag cgbggcgcbb bctcabagct cacgcbgbag gtabcbcagb bcggbgbagg 5160 
bcgtbcgcbc caagcbgggc bgbgtgcacg aaccccccgb bcagcccgac cgctgcgccb 5220 
babccggtaa cbabcgbcbb gagtccaacc cggbaagaca cgacbbabcg ccacbggcag 5280 
cagccacbgg baacaggabb agcagagcga ggbabgbagg cggtgcbaca gagbtcbbga 5340 
agbggbggcc baacbacggc bacacbagaa gaacagbabb bggtabcbgc gctcbgcbga 5400 
agccagbbac cbbcggaaaa agagtbggba gcbctbgabc cggcaaacaa accaccgcbg 5460 
gbagcggbgg bbbbbbbgbt bgcaagcagc agatbacgcg cagaaaaaaa ggabcbcaag 5520 
aagabccbbb gabcbbbbcb acggggbcbg acgcbcagbg gaacgaaaac bcacgbbaag 5580 
ggabbbbggb cabgagabba bcaaaaagga bcbbcaccba gabccbbbta aabbaaaaab 5640 
gaagbbbbaa atcaabcbaa agbabababg agbaaacbbg gbcbgacagb baccaabgcb 5700 
baabcagbga ggcaccbabc bcagcgatcb gtcbabbbcg bbcabccaba gtbgccbgac 5760 
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tcgggggggg ggggcgctga ggbcbgccbc gtgaagaagg bgbbgcbgac tcataccagg 5820 
cctgaatcgc cccatcatcc agccagaaag tgagggagcc acggttgatg agagctbbgb 5880 
tgtaggtgga ccagttggtg atbbtgaacb bbbgcbbtgc cacggaacgg tctgcgttgt 5940 
cgggaagatg cgbgabcbga tccttcaact cagcaaaagt bcgabbbabb caacaaagcc 6000 
gccgtcccgt caagtcagcg taatgctctg ccagtgttac aaccaattaa ccaattctga 6060 
ttagaaaaac tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gabbabcaab 6120 
accababbbb bgaaaaagco gtttctgtaa tgaaggagaa aactcaccga ggcagttcca 6180 
baggabggca agatcctggt abcggbcbgc gattccgact cgtccaacat caatacaacc 6240 
tabbaabbtc ccctcgtcaa aaataaggth atcaagtgag aaatcaccat gagtgacgac 6300 
bgaabccggb gagaatggca aaagcttatg cabbtctbbc cagacttgtt caacaggcca 6360 
gccattacgc tcgtcatcaa aabcactcgc atcaaccaaa ccgttattca bbcgtgabbg 6420 
cgcctgagcg agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga 64 80 
ahgcaaccgg cgcaggaaca ctgccagcgc atcaacaata ttbtcacctg aatcaggata 6540 
ttcttctaat acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc 6600 
atcaggagba cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt 6660 
tagtctgacc atctcatctg taacatcatt ggcaacgcta ccbbbgccab gbbbcagaaa 6720 
caacbcbggc gcabcgggcb tcccatacaa tcgatagatt gtcgcacctg attgcccgac 6780 
attatcgcga gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg 6840 
cctcgagcaa gacgtttccc gttgaatatg gctcataaca ccccttgtat bacbgbtbab 6900 
gtaagcagac agttttattg bbcabgabga batatbbbba tctbgtgcaa tgtaacatca 6960 
gagattttga gacacaacgt ggctttcccc ccccccccat tattgaagca tttatcaggg 7020 
ttattgtctc atgagcggat acatabbbga atgtatttag aaaaabaaac aaataggggt 7080 
tccgcgcaca btbccccgaa aagtgccacc tgacgtctaa gaaaccatta ttatcatgac 7140 
attaacctat aaaaabaggc gtatcacgag gcccbbbcgb c 7181 



<210> 71 
<211> 7493 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 71 

bcgcgcgtbb cggbgatgac ggbgaaaacc tcbgacacab gcagctcccg gagacggtca 60 
cagcbbgbcb gtaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggtg 120 
btggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgba ctgagagbgc 180 
accabatgcg gtgtgaaaba ccgcacagat gcgbaaggag aaaabaccgc atcagattgg 24 0 
cbatbggcca btgcabacgt bgbabccaba bcabaababg bacabbbaba btggcbcabg 300 
bccaacabba ccgccabgbt gacabbgatb abbgacbagb tabbaabagb aabcaatbac 360 
ggggbcabba gtbcabagcc cabababgga gbbccgcgbt acataacbba cggtaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbbbccabtg acgbcaabgg gbggagbabb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgbabca babgccaagb acgcccccba bbgacgbcaa 600 
bgacggbaaa bggcccgcct ggcabbabgc ccagbacabg accbbabggg actbbccbac 660 
btggcagbac atcbacgbat bagtcabcgc tabbaccabg gbgabgcggb bttggcagba 720 
cabcaabggg cgbggatagc ggttbgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgtcaabggg agbbbgtbbt ggcaccaaaa tcaacgggac bbtccaaaab gtcgbaacaa 840 
ctccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbct ababaagcag 900 
agcbcgbtba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
tccccgtgcc aagagbgacg baagbaccgc ctabagacbc tabaggcaca ccccbbbggc 1080 
bcbbabgcab gcbabacbgt tbbbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 



- 166 - 



WO 02/32943 PCT/US01/25721 

ta 99tgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gabacbbbcc abtacbaabc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaabac bctgbccbbc agagactgac acggactctg batbbtbaca 1320 
ggatggggtc ccabtbabba bbtacaaabt cacatataca acaacgccgt cccccgbgcc 13 80 
cgcagttttt attaaacaba gcgbgggabc tccacgcgaa tctcgggtac gtgbbccgga 1440 
cabgggcbcb tcbccggbag cggcggagct bccacabccg agccctggtc ccabgccbcc 1500 
agcggctcab ggbcgcbcgg cagcbccbtg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgtgccg cacaaggccg bggcggbagg gtatgtgbcb 1620 
gaaaabgagc gtggagabbg ggctcgcacg gctgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgbb gbabbcbgab aagagbcaga ggtaacbccc 1740 
gtbgcggbgc bgbbaacggb ggagggcagb gtagbcbgag cagbacbcgb tgctgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagacbgb bcctbbccab gggbcbbttc 1860 
tgcagbcacc gtcgtcgaca cgtgbgabca gababcgcgg ccgcbcbaga acaccabgcg 1920 
cgbgaaggag aagbaccagc accbgbggcg ctggggcbgg cgctggggca ccabgcbgcb 1980 
gggcabgcbg atgatcbgca gcgccaccga gaagctgtgg gbgaccgtgb acbacggcgb 2040 
gcccgtgtgg aaggaggcca ccaccaccct gcbctgcgcc agcgacgcca aggccbacga 2100 
caccgaggbg cacaacgbgb gggccaccca cgccbgcgtg cccaccgacc ccaaccccca 2160 
ggaggbggbg ctggtgaacg bgaccgagaa ctbcgacatg bggaagaacg acabggtgga 2220 
gcagabgcac gaggacabca bcagccbgbg ggaccagagc cbgaagcccb gcgtgaagcb 2280 
gaccccccbg bgcgtgagcc bgaagbgcac cgaccbgaag aacgacacca acaccaacag 2340 
cagcagcggc cgcabgabca tggagaaggg cgagabcaag aacbgcagct bcaacatcag 24 00 
caccagcabc cgcggcaagg bgcagaagga gtacgccbbc tbcbacaagc bggacatcat 2460 
ccccabcgac aacgacacca ccagcbacag cctgaccagc bgcaacacca gcgbgatcac 252 0 
ccaggccbgc cccaaggbga gcbbcgagcc catccccaac cacbacbgcg cccccgccgg 2580 
cbtcaccabc cbgaagbgca aggacaagaa gtbcaacggc aagggccccb gcaccaacgt 264 0 
gagcaccgtg cagbgcaccc acggcatccg ccccgtggtg agcacccagc bgcbgctgaa 270 0 
cgaccgcctg gccgaggagg aggtggbgab ccgcagcgcc aacbbcgccg acaacgccaa 2760 
ggbgabcabc gtgcagcbga acgagagcgb ggagabcaac bgcacccgcc ccaacaacaa 282 0 
cacccgcaag agcabccaca bcggccccgg ccgcgcctbc tacaccaccg gcgagabcat 288 0 
c 99 c gacatc cgccaggccc actgcaacct gagccgcgcc aagbggaacg acacccbgaa 2 94 0 
caagabcgbg atcaagcbgc gcgagcagbb cggcaacaag accabcgtgb bcaagcacag 3 00 0 
cagcggcggc gaccccgaga bcgbgaccca cagcttcaac bgcggcggcg agbbcbbcba 3060 
cbgcaacagc acccagcbgb bcaacagcac cbggtbcaac agcaccbgga gcaccgaggg 3120 
cagcaacaac accgagggca gcgacaccab cacccbgccc bgccgcabca agcagabcab 3180 
caacatgtgg cagaaggtgg gcaaggccab gbacgccccc cccabcagcg gccagatccg 324 0 
cbgcagcagc aacabcaccg gccbgcbgcb gacccgcgac ggcggcaaca gcaacaacga 3300 
gagcgagabc bbccgcctgg gcggcggcga cabgcgcgac aacbggcgca gcgagctgba 33 60 
caagbacaag gbggbgaaga bcgagccccb gggcgbggcc cccaccaagg ccaagcgccg 342 0 
cgbggbgcag cgcgagaagc gcgccgbggg cabcggcgcc cbgbbccbgg gcbbcctggg 34 8 0 
cgccgccggc agcaccabgg gcgccgccag cabgacccbg accgbgcagg cccgccagcb 354 0 
gcbgagcggc abcgbgcagc agcagaacaa ccbgcbgcgc gccabcgagg cccagcagca 3600 
cctgctgcag cbgaccgbgb ggggcabcaa gcagcbgcag gcccgcabcc bggccgbgga 3660 
gcgcbaccbg aaggaccagc agcbgcbggg cabcbggggc bgcagcggca agcttabcbg 3720 
caccaccgcc gbgccctgga acgccagctg gagcaacaag agccbggagc agatcbggaa 3780 
ccacaccacc bggabggagb gggaccgcga gatcaacaac bacaccagcc bgatccacag 384 0 
cctgabcgag gagagccaga accagcagga gaagaacgag caggagcbgc bggagcbgga 3900 
caagbgggcc agcctgbgga actggbbcaa cabcaccaac bggcbgbggb acabcaagcb 3 960 
gbbcabcabg abcgbgggcg gccbggbggg ccbgcgcabc gbgbbcgccg bgcbgagcat 4 020 
cgbgaaccgc gbgcgccagg gcbacagccc ccbgagcttc cagacccacc bgcccabccc 408 0 
ccgcggcccc gaccgccccg agggcabcga ggaggagggc ggcgagcgcg accgcgaccg 414 0 
cacgabccgc cbggbgaacg gcagccbggc ccbgabctgg gacgaccbgc gcagccbgbg 4200 
ccbgbtcagc baccaccgcc bgcgcgaccb gcbgctgabc gbgacccgca tcgbggagct 4260 
gcfcgggccgc cgcggctggg aggcccbgaa gtacbggtgg aaccbgcbgc agbacbggag 432 0 
ccaggagcbg aagaacagcg ccgbgagccb gcbgaacgcc accgccabcg ccgbggccga 4380 
gggcaccgac cgcgtgabcg aggbggbgca gggcgccbgc cgcgccatcc gccacatccc 4440 
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ccgccgcabc cgccagggcc bggagcgcab cctgctgtga ggabccagab ctgctgtgcc 4500 
ttctagttgc cagccabcbg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg 4560 
bgccacbccc acbgbccbbb ccbaabaaaa tgaggaaatt gcatcgcatt gtctgagtag 4620 
gtgbcabbcb attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga 46B0 
caatagcagg catgctgggg atgcggtggg cbcbabgggb acccaggtgc tgaagaattg 4740 
acccggttcc tcctgggcca gaaagaagca ggcacabccc cttctctgtg acacaccctg 4800 
tccacgcccc bggbbcbbag ttccagcccc actcatagga cactcatagc tcaggagggc 4860 
tccgccttca atcccacccg ctaaagtact tggagcggtc tctccctccc tcatcagccc 4920 
accaaaccaa accbagccbc caagagtggg aagaaattaa agcaagatag gctattaagt 4980 
gcagagggag agaaaabgcc tccaacafcgt gaggaagbaa fcgagagaaab catagaattt 5040 
taaggccatc atggccbbaa bcbbccgcbb ccbcgcbcac tgactcgctg cgctcggtcg 5100 
ttcggctgcg gcgagcggta tcagctcact caaaggcggb aatacggtta tccacagaat 5160 
caggggataa cgcaggaaag aacatghgag caaaaggcca gcaaaaggcc aggaaccgta 5220 
aaaaggccgc gttgctggcg tttttccata ggcbccgccc ccctgacgag catcacaaaa 5280 
abcgacgcbc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgbbbc 5340 
cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggabacctgb 5400 
ccgccbbbcb cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca 5460 
gbbcggbgba ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg 5520 
accgcbgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacbbab 5580 
cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 5640 
cagagttctt gaagbggbgg cctaactacg gctacactag aagaacagta tttggtatct 5700 
gcgctctgct gaagccagfct accbbcggaa aaagagttgg tagctcbbga fcccggcaaac 5760 
aaaccaccgc tggbagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa 5820 
aaggatctca agaagabcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa 5880 
actcacgbta agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt 5940 
taaattaaaa abgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 6000 
gttaccaatg cttaatcagt gaggcaccta tctcagcgafc ctgtctattt cgfctcatcca 6060 
tagttgcctg actcgggggg ggggggcgct gaggtctgcc tcgtgaagaa ggtgttgctg 6120 
actcatacca ggcctgaatc gccccatcat ccagccagaa agtgagggag ccacggttga 6180 
tgagagcttt gttgtaggbg gaccagbbgg tgabbbbgaa cbbbtgcbbt gccacggaac 6240 
ggbctgcgbt gbcgggaaga tgcgbgabcb gabccbbcaa cbcagcaaaa gbbcgabbba 6300 
btcaacaaag ccgccgbccc gtcaagbcag cgbaabgcbc bgccagbgbt acaaccaatb 6360 
aaccaabbcb gattagaaaa actcatcgag cabcaaabga aacbgcaatt battcatabc 6420 
aggabbabca abaccababb btbgaaaaag ccgtbbcbgb aabgaaggag aaaactcacc 6480 
gaggcagbbc cabaggabgg caagabccbg gbabcggbcb gcgabbccga cbcgbccaac 6540 
abcaabacaa ccbabbaabb bccccbcgbc aaaaabaagg bbabcaagbg agaaabcacc 6600 
atgagbgacg acbgaabccg gbgagaabgg caaaagcbba bgcatbbcbb bccagacbbg 6660 
btcaacaggc cagccabbac gcbcgbcabc aaaabcacbc gcabcaacca aaccgtbabt 6720 
cabbcgbgab bgcgccbgag cgagacgaaa bacgcgabcg cbgtbaaaag gacaatbaca 6780 
aacaggaabc gaabgcaacc ggcgcaggaa cactgccagc gcabcaacaa babbtbcacc 6840 
bgaatcagga babbcbbcba abaccbggaa bgcbgbbbtc ccggggabcg cagbggbgag 6900 
baaccabgca bcabcaggag bacggabaaa abgcbbgatg gbcggaagag gcabaaabbc 6960 
cgbcagccag btbagbcbga ccabcbcabc bgbaacabca tbggcaacgc baccbbbgcc 7020 
abgbbbcaga aacaacbctg gcgcabcggg cbbcccabac aatcgabaga bbgbcgcacc 7080 
bgabtgcccg acatbabcgc gagcccabbb abacccatab aaabcagcat ccabgbbgga 7140 
abbbaabcgc ggccbcgagc aagacgtbbc ccgtbgaaba bggcbcabaa cacccctbgb 7200 
abbacbgtbb atgbaagcag acagbbbbab bgbbcabgab gabababbbt babcbbgbgc 7260 
aabgbaacab cagagabbbb gagacacaac gbggcbbbcc cccccccccc abbabbgaag 7320 
cabttabcag ggbtabbgbc bcabgagcgg abacababbb gaabgbabbt agaaaaabaa 7380 
acaaabaggg gtbccgcgca cabbbccccg aaaagbgcca cctgacgtct aagaaaccab 7440 
tabbabcabg acabbaacct abaaaaabag gcgtabcacg aggcccbbbc gtc 7493 
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<211> 7186 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVRl0l2x/s containing HIV genes 
<400> 72 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 12 00 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 204 0 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 222 0 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga actgcagctt caacatcagc 24 00 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tctacaagct ggacatcatc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 258 0 
ttcaccatcc tgaagtgcaa caacaagacc ttcaacggca ccggcccctg caccaacgtg 264 0 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagct gctgctgaac 2700 
ggcagcctgg ccgaggagga ggtggtgatc cgcagcgtga acttcaccga caacgccaag 2760 
accatcatcg tgcagctgaa caccagcgtg gagatcaact gcacccgccc caacaacaac 2820 
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acccgcaagc gaattcgcat ccagcgcggc cccggccgcg ccttcgtgac catcggcaag 2880 
atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagbggaa caacacccbg -2940 
aagcagatcg acagcaagct gcgcgagcag tbcggcaaca acaagaccat cabcttcaag 3000 
cagagcagcg gcggcgaccc cgagatcgbg acccacagct tcaactgcgg cggcgagttc 3060 
ttctactgca acagcaccca gctgttcaac agcaccbggb bcaacagcac ctggagcacc 3120 
9 a 999 c agca acaacaccga gggcagcgac accatcaccc tgccctgccg catcaagcag 3180 
atcatcaaca tgtggcagaa ggtgggcaag gccatgtacg ccccccccat cagcggccag 3240 
atccgctgca gcagcaacat caccggcctg ctgctgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agatcttccg cctgggcggc ggcgacatgc gcgacaactg gcgcagcgag 3360 
ctgtacaagb acaaggtggt gaagabcgag cccctgggcg tggcccccac caaggccaag 3420 
cgccgcgtgg tgcagcgcga gaagcgcgcc gtgggcatcg gcgccctgtt cctgggcttc 3480 
ctgggcgccg ccggcagcac catgggcgcc gccagcatga ccctgaccgt gcaggcccgc 3540 
cagctgctga gcggcatcgt gcagcagcag aacaacctgc bgcgcgccat cgaggcccag 3600 
cagcacctgc tgcagctgac cgtgtggggc atcaagcagc tgcaggcccg catcctggcc 3660 
gtggagcgcb acctgaagga ccagcagctg cbgggcabct ggggctgcag cggcaagcbb 3720 
atcbgcacca ccgccgtgcc ctggaacgcc agcbggagca acaagagccb ggagcagabc 3780 
bggaaccaca ccacctggat ggagtgggac cgcgagabca acaactacac cagcctgatc 384 0 
cacagccbga tcgaggagag ccagaaccag caggagaaga acgagcagga gcbgcbggag 3900 
ctggacaagt gggccagcct gtggaactgg tbcaacatca ccaactggct gfcggtacatc 3960 
aagctgttca tcabgabcgb gggcggcctg gtgggcctgc gcabcgtgtt cgccgtgctg 4 020 
agcatcgtga accgcgtgcg ccagggctac agccccctga gcbbccagac ccacctgccc 4080 
abcccccgcg gccccgaccg ccccgagggc atcgaggagg agggcggcga gcgcgaccgc 4140 
gaccgcacga tccgcctggt gaacggcagc tgaggatcca gatctgctgt gccttctagt 42 00 
bgccagccab cbgbbgbbbg cccctccccc gbgccbbccb tgaccctgga aggtgccact 4260 
cccactgtcc tbtccbaaba aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4320 
bctabtcbgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4380 
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa bbgacccggb 4440 
tcctcctggg ccagaaagaa gcaggcacat cccctfcctct gtgacacacc ctgtccacgc 4500 
ccctggttct bagtbccagc cccactcata ggacactcat agctcaggag ggctccgcct 4560 
tcaatcccac ccgcbaaagb acttggagcg gtcfcctccct ccctcatcag cccaccaaac 4620 
caaacctagc ctccaagagt gggaagaaat fcaaagcaaga taggchafcta agtgcagagg 4680 
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 474 0 
atcatggcct taatcbtccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 4800 
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 4860 
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 4920 
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 4980 
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 5 040 
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 5100 
tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt 5160 
gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 5220 
cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 5280 
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 5340 
cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct 5400 
gctgaagcca gtbaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 5460 
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 5520 
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 5580 
ttaagggatt htggfccatga gattatcaaa aaggafccttc acctagatcc ttttaaatta 5640 
aaaabgaagt tttaaatcaa bctaaagtat atatgagtaa acfctggtctg acagttacca 5700 
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 5760 
ctgactcggg gggggggggc gctgaggtct gcctcgtgaa gaaggtgttg ctgactcata 5 820 
ccaggcctga atcgccccat catccagcca gaaagtgagg gagccacggt tgatgagagc 5880 
tttgttgtag gtggaccagt tggtgatttt gaacttttgc tttgccacgg aacggtcbgc 5940 
gtbgbcggga agabgcgtga bcbgatcctt caacbcagca aaagbtcgat btatbcaaca 6000 
aagccgccgt cccgbcaagt cagcgtaabg cbcbgccagb gbbacaacca abtaaccaat 6060 
bcbgabtaga aaaacbcabc gagcatcaaa tgaaacbgca atbbatbcab abcaggabta 6120 
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tcaataccat atttttgaaa aagccgtttc tgtaatgaag gagaaaactc accgaggcag 6180 
ttccatagga tggcaagatc ctggtatcgg tctgcgattc cgactcgtcc aacatcaata 624 0 
caacctatta atttcccctc gtcaaaaata aggttatcaa gtgagaaatc accatgagtg 6300 
acgactgaat ccggtgagaa hggcaaaagc ttatgcattt ctttccagac ttgttcaaca 6360 
ggccagccat tacgctcgtc atcaaaatca ctcgcatcaa ccaaaccgtt attcattcgt 6420 
gattgcgcct gagcgagacg aaatacgcga tcgctgttaa aaggacaatt acaaacagga 6480 
atcgaatgca accggcgcag gaacactgcc agcgcatcaa caatattttc acctgaatca 6540 
ggatattctt ctaatacctg gaatgctgtt ttcccgggga tcgcagtggt gagtaaccat 6600 
gcatcatcag gagtacggat aaaatgcttg atggtcggaa gaggcataaa ttccgtcagc 6660 
cagtttagtc tgaccatctc atctgtaaca tcattggcaa cgctaccttt gccatgtttc 6720 
agaaacaact ctggcgcatc gggcttccca tacaatcgat agattgtcgc acctgattgc 6780 
ccgacattat cgcgagccca tttataccca tataaatcag catccatgtt ggaatttaat 6840 
cgcggcctcg agcaagacgt ttcccgttga atatggctca taacacccct tgtattactg 6900 
tttatgtaag cagacagbtt battgttcat gatgatatat ttttatcttg tgcaatgtaa 6960 
catcagagat tttgagacac aacgtggctt tccccccccc cccattattg aagcatttat 7020 
cagggttatt gtctcabgag cggabacaba btbgaatgta bbtagaaaaa taaacaaata 7080 
ggggbtccgc gcacatbbcc ccgaaaagtg ccaccbgacg bctaagaaac cattabtatc 714 0 
abgacattaa ccbabaaaaa baggcgbabc acgaggccct bbcgtc 7186 



<210> 73 
<211> 7003 
<212> DNA : 

<213> Artificial Sequence 

<220> 

<223> plaamid pVR1012x/s containing HIV genes 
<400> 73 

bcgcgcgtbt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaabaccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
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agcggctcab ggbcgcbcgg cagcbccbbg ctcctaacag tggaggccag acttaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gbggagabbg ggctcgcacg gctgacgcag abggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagacbgb tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgbgbgabca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga bgabcbgcag cgccaccgag aagctgbggg bgaccgbgba cbacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg cbctgcgcca gcgacgccaa ggccbacgac 2100 
accgaggtgc acaacgbgbg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggbggbgc bggbgaacgb gaccgagaac ttcgacabgt ggaagaacga catggtggag 2220 
cagatgcacg aggacabcab cagccfcgtgg gaccagagcc bgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2 34 0 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga actgcagctt caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tctacaagct ggacabcabc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcaccatcc tgaagtgcaa caacaagacc bbcaacggca ccggcccctg caccaacgtg 2640 
agcaccgtgc agbgcaccca cggcatccgc cccgbggbga gcacccagct gcbgcbgaac 2700 
ggcagcctgg ccgaggagga ggtggtgatc cgcagcgtga acttcaccga caacgccaag 2760 
accatcatcg bgcagcbgaa caccagcgtg gagabcaacb gcacccgccc caacaacaac 2 820 
acccgcaagc gaabbcgcat ccagcgcggc cccggccgcg cctfccgtgac catcggcaag 2880 
atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagtggaa caacaccctg 2940 
aagcagabcg acagcaagcb gcgcgagcag ttcggcaaca acaagaccat catcttcaag 3000 
cagagcagcg gcggcgaccc cgagatcgtg acccacagct tcaacbgcgg cggcgagttc 3 060 
ttctactgca acagcaccca gctgttcaac agcacctggt tcaacagcac ctggagcacc 312 0 
gagggcagca acaacaccga gggcagcgac accatcaccc tgccctgccg catcaagcag 3180 
atcatcaaca tgtggcagaa ggtgggcaag gccatgtacg ccccccccat cagcggccag 3240 
atccgctgca gcagcaacab caccggcctg ctgctgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agabcbbccg cebgggcggc ggcgacatgc gcgacaactg gcgcagcgag 3360 
ctgtacaagt acaaggbggb gaagatcgag cccctgggcg tggcccccac caaggccaag 342 0 
cttaccgtgc aggcccgcca gctgctgagc ggcatcgtgc agcagcagaa caacctgctg 34 80 
cgcgccatcg aggcccagca gcacctgctg cagctgaccg tgtggggcat caagcagctg 354 0 
caggcccgca fccctggccgt ggagcgcbac ctgaaggacc agcagctgga gcagatctgg 3 600 
aaccacacca cctggatgga gtgggaccgc gagatcaaca actacaccag cctgatccac 3660 
agcctgatcg aggagagcca gaaccagcag gagaagaacg agcaggagct gctggagctg 372 0 
gacaagtggg ccagcctgtg gaactggbtc aacatcacca actggctgbg gtacatcaag 3780 
cbgtbcabca tgatcgtggg cggcctggtg ggcctgcgca tcgtgttcgc cgtgctgagc 3 84 0 
atcgtgaacc gcgtgcgcca gggctacagc cccctgagct tccagaccca ccbgcccabc 3900 
ccccgcggcc ccgaccgccc cgagggcatc gaggaggagg gcggcgagcg cgaccgcgac 3960 
cgcacgatcc gcctggtgaa cggcagctga ggatccagat ctgctgtgcc ttctagttgc 402 0 
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 4080 
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 4140 
attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 4200 
catgctgggg atgcggtggg ctctatgggt acccaggtgc tgaagaattg acccggttcc 4260 
tccfcgggcca gaaagaagca ggcacatccc cttctcbgtg acacaccctg tccacgcccc 4320 
tggttcttag ttccagcccc acfccatagga cactcatagc tcaggagggc tccgccttca 43 80 
atcccacccg ctaaagtact tggagcggtc tctccctccc tcatcagccc accaaaccaa 444 0 
acctagcctc caagagfcggg aagaaattaa agcaagatag gcbattaagt gcagagggag 4 500 
agaaaatgcc tccaacabgb gaggaagbaa bgagagaaab cabagaatbt baaggccabc 4560 
abggccbbaa bcbbccgctt ccbcgcbcac bgactcgcbg cgcbcggbcg btcggcbgcg 4620 
gcgagcggba tcagcbcacb caaaggcggb aatacggtta tccacagaat caggggabaa 4680 
cgcaggaaag aacabgtgag caaaaggcca gcaaaaggcc aggaaccgba aaaaggccgc 4740 
gbbgcbggcg bbbbbccata ggcbccgccc cccbgacgag catcacaaaa atcgacgcbc 4800 



~ 172 



WO 02/32943 PCT/USO 1/25721 

aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 4860 
ctccctcgtg cgcbcbccbg ttccgaccct gccgcttacc ggatacctgt ccgccbbbcb 4920 
cccttcggga agcgtggcgc bbbcbcabag ctcacgctgt aggtatctca gbbcggbgba 4980 
ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc ghtcagcccg accgctgcgc 5040 
ctbatccggt aacbabcgbc btgagtccaa cccggtaaga cacgactbab cgccactggc 5100 
agcagccact ggtaacagga ttagcagagc gaggbabgba ggcggtgcta cagagbbcbb 5160 
gaagtggtgg ccbaacbacg gcbacacbag aagaacagta bbbggtabcb gcgctctgct 5220 
gaagccagtt accbbcggaa aaagagttgg tagctcttga bccggcaaac aaaccaccgc 5280 
tggtagcggt ggtttttttg bbbgcaagca gcagattacg cgcagaaaaa aaggatctca 5340 
agaagatcct bbgabcbtbb ctacggggtc tgacgctcag tggaacgaaa actcacgtta 5400 
agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa 5460 
atgaagtttt aaatcaatct aaagtabata tgagtaaact tggtctgaca gttaccaatg 5520 
cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg 5580 
actcgggggg ggggggcgct gaggtctgcc bcgbgaagaa ggbgbbgctg acbcabacca 5640 
ggcctgaatc gccccatcat ccagccagaa agtgagggag ccacggttga tgagagcttt 5700 
gbbgtaggbg gaccagbtgg bgabbbbgaa cbbbtgcbbb gccacggaac ggtctgcgtt 5760 
gtcgggaaga bgcgbgatcb gabcctbcaa ctcagcaaaa gttcgattta bbcaacaaag 5820 
ccgccgtccc gtcaagtcag cgtaatgctc bgccagtgbb acaaccaatb aaccaattct 5880 
gabbagaaaa acbcabcgag cabcaaabga aacbgcaabb tattcatatc aggatbabca 5940 
ataccatatt bbbgaaaaag ccgtttctgt aatgaaggag aaaactcacc gaggcagttc 6000 
cataggatgg caagatcctg gtatcggtct gcgabbccga ctcgtccaac atcaatacaa 6060 
cctattaatt tcccctcgbc aaaaabaagg bbabcaagbg agaaabcacc abgagbgacg 6120 
acbgaatccg gtgagaatgg caaaagctba bgcatbbcbb bccagacbtg bbcaacaggc 6180 
cagccatbac gctcgtcabc aaaabcacbc gcabcaacca aaccgttatb cattcgbgab 6240 
bgcgcctgag cgagacgaaa bacgcgabcg cbgbbaaaag gacaabbaca aacaggaabc 6300 
gaabgcaacc ggcgcaggaa cacbgccagc gcatcaacaa tatbbtcacc tgaatcagga 6360 
babbcbtcba abacctggaa bgcbgbbbbc ccggggabcg cagtggtgag baaccabgca 6420 
bcabcaggag tacggabaaa abgcttgatg gbcggaagag gcataaatbc cgbcagccag 6480 
btbagtcbga ccabctcatc bgbaacabca bbggcaacgc bacctbtgcc abgbbbcaga 6540 
aacaacbcbg gcgcabcggg cbtcccatac aabcgabaga btgbcgcacc bgabbgcccg 6600 
acabtabcgc gagcccatbt abacccatab aaabcagcab ccabgbtgga abtbaabcgc 6660 
ggccbcgagc aagacgbbbc ccgbbgaaba bggcbcabaa cacccctbgb abtacbgtbb 6720 
abgbaagcag acagbtbbab bgbtcabgab gabababbtt batcttgtgc aabgtaacab 67B0 
cagagabbbb gagacacaac gtggcbbbcc cccccccccc abtatbgaag cabtbabcag 6840. 
ggbbabbgbc tcatgagcgg atacatabtb gaatgbabtb agaaaaabaa acaaataggg 6900 
gbbccgcgca cabbbccccg aaaagbgcca ccbgacgbcb aagaaaccab tabbabcabg 6960 
acabbaaccb abaaaaabag gcgbabcacg aggcccbbbc gbc 7003 



<210> 74 
<211> 7186 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVRl012x/s conbaining HIV gene6 
<400> 74 

bcgcgcgbbb cggtgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg tcagcgggbg 120 
ttggcgggbg bcggggcbgg ctbaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gbgtgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabbggcca bbgcabacgb bgbabccaba bcabaababg bacabbbaba tbggcbcabg 300 
bccaacatba ccgccabgbb gacabbgabb abbgacbagb babbaabagb aabcaabbac 360 
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ggggbcabba gttcatagcc cabababgga 
cccgcctggc tgaccgccca acgacccccg 
catagbaacg ccaataggga cbbtccabbg 
tgcccacbtg gcagtacatc aagtgbabca 
tgacggtaaa tggcccgcct ggcabbabgc 
ttggcagtac atcbacgbab tagtcatcgc 
catcaatggg cgtggatagc ggtttgactc 
cgtcaatggg agtttgtttt ggcaccaaaa 
ctccgcccca ttgacgcaaa bgggcggbag 
agctcgttta gtgaaccgtc agatcgccfcg 
tagaagacac cgggaccgab ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
bcbbabgcab gctatactgt ttttggcttg 
baggbgabgg tatagcttag cctataggtg 
tattggtgac gatactttcc abbacbaabc 
tabtggcbab abgccaabac bcbgbccbbc 
ggatggggtc ccatttatta bbbacaaabb 
cgcagttttt attaaacata gcgtgggatc 
catgggctct tctccggtag cggcggagct 
agcggcbcab ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gbggagabbg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gtbgcggtgc bgbbaacggb ggagggcagb 
cgcgccacca gacabaabag cbgacagacb 
bgcagtcacc gtcgbcgaca cgbgbgabca 
gbgaaggaga agbaccagca ccbgtggcgc 
ggcabgcbga tgabcbgcag cgccaccgag 
cccgbgbgga aggaggccac caccacccbg 
accgaggbgc acaacgbgbg ggccacccac 
gaggtggbgc bggbcgacgb gaccgagaac 
cagatgcacg aggacabcab cagccbgbgg 
acccccctgb gcgbgagccb gaagbgcacc 
agcagcggcc gcabgabcab ggagaagggc 
accagcatcc gcggcaaggb gcagaaggag 
cccabcgaca acgacaccac cagcbacagc 
caggccbgcc ccaaggbgag cbbcgagccc 
bbcaccabcc tgaagbgcaa cgacaagacc 
agcaccgtgc agbgcaccca cggcabccgc 
ggcagcctgg ccgaggagga ggbggbgabc 
accabcabcg bgcaggbgga caccacggbg 
acccgcaagc gaabbcgcab ccagcgcggc 
abcggcaaca bgcgccaggc ccacbgcgab 
aagcagabcg acagcaagcb gcgcgagcag 
cagagcagcg gcggcgaccc cgagabcgbg 
bbcbacbgcg acagcaccca gcbgbbcaac 
gagggcagca acaacaccga gggcagcgac 
abcatcaaca bgbggcagaa ggbgggcaag 
abccgcbgca gcagcgabab caccggccbg 
aacgagagcg agabcbbccg cctgggcggc 
cbgbacaagb acaaggbggb gaagabcgag 
cgccgcgbgg tgcagcgcga gaagcgcgcc 
cbgggcgccg ccggcagcac cabgggcgcc 
cagcbgcbga gcggcabcgb gcagcagcag 
cagcaccbgc bgcagcbgac cgtgbggggc 



gbbccgcgtt acabaacbba cggbaaabgg 420 
cccabbgacg bcaabaabga cgtabgbbcc 480 
acgbcaabgg gbggagbabb bacggbaaac 540 
babgccaagb acgcccccba bbgacgbcaa 600 
ccagtacabg accbbatggg actbbccbac 660 
tabbaccabg gbgabgcggt bbbggcagba 720 
acggggabbb ccaagbcbcc accccabbga 780 
bcaacgggac bbbccaaaab gbcgbaacaa 840 
gcgtgbacgg bgggaggbcb ababaagcag 900 
gagacgccat ccacgcbgbt bbgaccbcca 960 
cggccgggaa cggbgcabbg gaacgcggab 1020 
cbabagactc bataggcaca ccccbbbggc 1080 
gggccbabac acccccgcbb ccbtatgcba 1140 
bgggbbabbg accabbabbg accacbcccc 1200 
cabaacabgg cbcbbbgcca caacbabcbc 1260 
agagacbgac acggacbcbg tatbbbbaca 1320 
cacababaca acaacgccgt cccccgtgcc 1380 
bccacgcgaa bcbcgggbac gbgbtccgga 1440 
bccacabccg agcccbggbc ccabgccbcc 1500 
cbccbaacag bggaggccag acbbaggcac 1560 
cacaaggccg bggcggbagg gbabgbgtcb 1620 
gcbgacgcag abggaagacb taaggcagcg 1680 
gtatbcbgab aagagbcaga ggbaacbccc 1740 
gbagbcbgag cagbacbcgb bgcbgccgcg 1800 
aacagacbgb bccbbbccab gggbcbbbbc 1860 
gababcgcgg ccgcbcbaga caccabgcgc 1920 
bggggctggc gcbggggcac cabgcbgcbg 1980 
aagcbgtggg tgaccgbgta cbacggcgbg 2040 
cbcbgcgcca gcgacgccaa ggccbacgac 2100 
gccbgcgbgc ccaccgaccc caacccccag 2160 
bbcgacabgt ggaagaacga cabggtggag 2220 
gaccagagcc bgaagcccbg cgbgaagctg 22 80 
gaccbgaaga acgacaccaa caccaacagc 2340 
gagabcaagg acbgbagcbb cgacabcagc 2400 
bacgccbbct bcbacaagcb ggacabcabc 2460 
cbgaccagcb gcgaaacbag bgbgabcacc 2520 
abccccaacc acbacbgcgc ccccgccggc 2580 
bbcgacggca ccgggcccbg caccgacgbg 2640 
cccgbggbga gcacccagct gcbgcbgaac 2700 
cgcagcgbcg acbbcaccga caacgccaag 2760 
gagabcgabb gcacccgccc caacaacaac 2820 
cccggccgcg ccbbcgbgac cabcggcaag 2880 
abcagccgcg ccaagbggga caacacccbg 2940 
bbcggcaacg acaagaccab cabcbbcaag 3000 
acccacagcb bcaacbgcgg cggcgagbbc 3060 
agcaccbggb bcaacagcac cbggagcacc 3120 
accabcaccc bgcccbgccg cabcaagcag 3180 
gccabgbacg ccccccccab cagcggccag 3240 
cbgcbgaccc gcgacggcgg caacagcaac 3300 
ggcgacabgc gcgacaacbg gcgcagcgag 3360 
ccccbgggcg bggcccccac caaggccaag 342 0 
gbgggcabcg gcgcccbgbt ccbgggcbbc 3480 
gccagcabga ccctgaccgt gcaggcccgc 354 0 
aacaaccbgc bgcgcgccab cgaggcccag 3600 
atcaagcagc bgcaggcccg cabccbggcc 3660 
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gtggagcgct accbgaagga ccagcagcbg ctgggcatct ggggcbgcag cggcaagcbb 3720 
atctgcacca ccgccgbgcc ctggaacgcc agctggagca acaagagcct ggagcagatc 3780 
tggaaccaca ccacctggat ggagtgggac cgcgagatca acaactacac cagcctgatc 3840 
cacagcctga tcgaggagag ccagaaccag caggagaaga acgagcagga gctgctggag 3900 
ctggacaagt gggccagcct gtggaactgg ttcaacatca ccaactggct gbggbacabc 3960 
aagctgttca tcatgatcgt gggcggcctg gbgggccbgc gcabcgbgbt cgccgtgcbg 4020 
agcabcgtga accgcgtgcg ccagggcbac agccccctga gcttccagac ccacctgccc 4080 
atcccccgcg gccccgaccg ccccgagggc atcgaggagg agggcggcga gcgcgaccgc 414 0 
gaccgcacga tccgccbggfc gaacggcagc bgaggabcca gatcbgcbgb gccttcbagt 4200 
bgccagccat cbgbtgbtbg ccccbccccc gtgccbbcct bgacccbgga aggtgccact 4260 
cccacbgbcc bbbccbaaba aaabgaggaa abbgcabcgc abtgbcbgag baggtgbcat 4320 
bcbabbcbgg ggggbggggb ggggcaggac agcaaggggg aggabbggga agacaabagc 43 80 
aggcabgcbg gggabgcggb gggcbctabg ggbacccagg tgcbgaagaa btgacccggb 4440 
bccbccbggg ccagaaagaa gcaggcacab ccccbbcbcb gbgacacacc cbgtccacgc 4500 
ccctggbbct tagtbccagc cccacbcaba ggacacbcab agcbcaggag ggctccgccb 4560 
bcaabcccac ccgcbaaagb acttggagcg gtcbcbcccb cccbcabcag cccaccaaac 4620 
caaaccbagc ctccaagagb gggaagaaab taaagcaaga baggcbabba agbgcagagg 4680 
gagagaaaab gccbccaaca bgtgaggaag taabgagaga aabcabagaa bbbtaaggcc 4740 
abcabggccb baabcttccg cbtcctcgcb cacbgacbcg cbgcgcbcgg bcgttcggcb 4 800 
gcggcgagcg gbabcagcbc acbcaaaggc ggbaabacgg tbabccacag aatcagggga 4 860 
baacgcagga aagaacatgb gagcaaaagg ccagcaaaag gccaggaacc gbaaaaaggc 4920 
cgcgttgcbg gcgbbbtbcc abaggcbccg ccccccbgac gagcatcaca aaaatcgacg 4980 
cbcaagbcag aggbggcgaa acccgacagg acbabaaaga taccaggcgb btccccctgg 504 0 
aagcbcccbc gbgcgctcbc cbgbbccgac ccbgccgcbb accggabacc bgbccgccbb 5100 
bctcccbbcg ggaagcgbgg cgcbbbcbca bagcbcacgc bgtaggbabc bcagttcggb 5160 
gbaggbcgbb cgcbccaagc bgggcbgbgb gcacgaaccc cccgbtcagc ccgaccgcbg 522 0 
cgccbtabcc ggtaactiatc gbctbgagbc caacccggba agacacgacb babcgccacb 5280 
ggcagcagcc acbggbaaca ggabbagcag agcgaggbab gbaggcggbg ctacagagbb 5340 
cbtgaagbgg bggccbaacb acggcbacac bagaagaaca gbabbbggba bcbgcgcbcb 5400 
gcbgaagcca gtbacctbcg gaaaaagagt bggbagcbcb bgabccggca aacaaaccac 5460 
cgcbggbagc ggbggbttbb bbgbbbgcaa gcagcagabb acgcgcagaa aaaaaggabc 5520 
bcaagaagab ccttbgabct btbcbacggg gtctgacgcb cagbggaacg aaaactcacg 5580 
bbaagggabb btggbcabga gabbatcaaa aaggabctbc accbagabcc tbtbaaabba 5640 
aaaabgaagb btbaaatcaa bcbaaagbab atatgagbaa actbggbcbg acagbbacca 5700 
abgcbbaabc agbgaggcac cbabcbcagc gabcbgbcba bbtcgbbcab ccabagbbgc 5760 
cbgacbcggg gggggggggc gcbgaggbcb gccbcgbgaa gaaggbgbbg cbgacbcaba 5820 
ccaggccbga atcgccccab cabccagcca gaaagbgagg gagccacggb tgabgagagc 5880 
bbtgbtgbag gtggaccagb bggbgabbbb gaacbbbtgc bbtgccacgg aacggtcbgc 594 0 
gbtgbcggga agabgcgbga bcbgabccbb caacbcagca aaagbbcgab bbabtcaaca 6000 
aagccgccgb cccgbcaagb cagcgbaabg cbcbgccagb gbbacaacca atbaaccaab 6060 
bctgabbaga aaaacbcabc gagcabcaaa bgaaacbgca abbtabbcab atcaggabba 6120 
bcaabaccab abbbbbgaaa aagccgbbbc bgbaabgaag gagaaaacfcc accgaggcag 6180 
bbccabagga bggcaagatc cbggtabcgg bcbgcgabbc cgactcgbcc aacabcaata 6240 
caaccbabba attbcccctc gbcaaaaaba aggbbabcaa gbgagaaabc accabgagbg 6300 
acgacbgaab ccggbgagaa bggcaaaagc btabgcabbb cbtbccagac bbgbbcaaca 6360 
ggccagccab' bacgcbcgbc abcaaaabca ctcgcabcaa ccaaaccgbb abbcabbcgb 6420 
gatbgcgccb gagcgagacg aaatacgcga bcgcbgbbaa aaggacaabb acaaacagga 64 80 
abcgaabgca accggcgcag gaacacbgcc agcgcatcaa caababbbbc accbgaabca 654 0 
ggabatbcbb ctaabacctg gaabgctgbb bbcccgggga bcgcagbggt gagbaaccab 6600 
gcabcabcag gagbacggab aaaatgctbg abggbcggaa gaggcabaaa bbccgbcagc 6660 
cagbbbagbc bgaccatcbc abcbgbaaca bcabbggcaa cgcbaccbbt gccabgbtbc 6720 
agaaacaacb ctggcgcabc gggcttccca bacaabcgab agabbgbcgc accbgabbgc 6780 
ccgacabbab cgcgagccca bbbataccca babaaabcag cabccabgbt ggaabbbaab 684 0 
cgcggccbcg agcaagacgb tbcccgbbga ababggcbca baacacccct bgbabbacbg 6900 
bbbatgbaag cagacagbtb babbgbtcab gabgababab bbtbabcbbg bgcaabgtaa 6960 
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catcagagat tttgagacac aacgtggctt tccccccccc cccattattg aagcatttat 7020 

cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 7080 

ggggbtccgc gcacatfctcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 7140 

atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtc 7186 



<210> 75 
<211> 7186 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 75 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac ga.tactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 204 0 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtcgacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgfcgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 234 0 
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agcagcggcc gcatgabcat ggagaagggc gagatcaagg actgtagctt cgacabcagc 2400 
accagcatcc gcggcaaggt gcagaaggag fcacgccbbcb tcfcacaagct ggacatcatc 2460 
cccatcgaca acgacaccac cagctacagc ctgaccagcb gcgaaactag bgbgabcacc 2520 
caggcctgcc ccaaggtgag cbbcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcaccatcc tgaagtgcaa cgacaagacc ttcgacggca ccgggccctg caccgacgtg 2640 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagcb gctgctgaac 2700 
ggcagcctgg ccgaggagga ggtggtgatc cgcagcgtcg acttcaccga caacgccaag 2760 
accahcatcg tgcaggtgga caccacggbg gagatcgabb gcacccgccc caacaacaac 2820 
acccgcaagc gaabtcgcafc ccagcgcggc cccggccgcg ccttcgtgac catcggcaag 2880 
atcggcaaca hgcgccaggc ccactgcaac atcagccgcg ccaagtggaa caacaccctg 2940 
aagcagatcg acagcaagct gcgcgagcag tbcggcaaca acaagaccat catcttcaag 3000 
cagagcagcg gcggcgaccc cgagatcgtg acccacagct tcaactgcgg cggcgagttc 3060 
tbcbacbgca acagcaccca gcbgttcaac agcacctggfc tcaacagcac ctggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accatcaccc tgccctgccg catcaagcag 3180 
abcabcaaca tgtggcagaa ggtgggcaag gccatgtacg ccccccccat cagcggccag 3240 
atccgctgca gcagcaacat caccggcctg ctgctgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agatcttccg cctgggcggc ggcgacatgc gcgacaactg gcgcagcgag 3360 
ctgtacaagt acaaggtggt gaagatcgag cccctgggcg tggcccccac caaggccaag 3420 
cgccgcgtgg tgcagcgcga gaagcgcgcc gtgggcatcg gcgcccbgtb cctgggcttc 3480 
ctgggcgccg ccggcagcac catgggcgcc gccagcatga ccctgaccgt gcaggcccgc 3540 
cagctgctga gcggcatcgt gcagcagcag aacaacctgc tgcgcgccat cgaggcccag 3600 
cagcacctgc tgcagctgac cgtgtggggc atcaagcagc tgcaggcccg catcctggcc 3660 
9tggagcgct accbgaagga ccagcagctg cbgggcabct ggggctgcag cggcaagcbb 3720 
abcbgcacca ccgccgbgcc ctggaacgcc agcbggagca acaagagcct ggagcagatc 3780 
tggaaccaca ccacctggat ggagfcgggac cgcgagatca acaactacac cagccbgabc 3840 
cacagcctga tcgaggagag ccagaaccag caggagaaga acgagcagga gcbgcbggag 3900 
cbggacaagb gggccagcct gtggaactgg ttcaacatca ccaactggct gbggbacabc 3960 
aagctgttca tcatgatcgt gggcggcctg gtgggcctgc gcatcgtgtt cgccgtgctg 4020 
agcatcgtga accgcgtgcg ccagggctac agccccctga gcttccagac ccacctgccc 4080 
atcccccgcg gccccgaccg ccccgagggc atcgaggagg agggcggcga gcgcgaccgc 4140 
gaccgcacga tccgcctggt gaacggcagc tgaggabcca gatcbgctgt gccbbcbagt 4 2 00 
tgccagccat cbgbtgttbg cccctccccc gtgccttcct tgaccctgga aggtgccact 4260 
cccactgtcc bbbccbaaba aaatgaggaa attgcatcgc atbgbcbgag taggtgtcat 4320 
. tctattctgg ggggtggggt ggggcaggac agcaaggggg aggabbggga agacaatagc 4380 
aggcatgctg gggatgcggt gggctctatg ggbacccagg tgctgaagaa ttgacccggt 4440 
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gbgacacacc ctgtccacgc 4500 
cccbggbbcb bagbbccagc cccactcata ggacactcat agctcaggag ggctccgcct 4560 
tcaatcccac ccgctaaagt acttggagcg gtctctccct cccfccatcag cccaccaaac 4620 
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 4680 
gagagaaaat gcctccaaca tgtgaggaag fcaatgagaga aatcatagaa ttttaaggcc 4740 
atcahggcct taatcttccg cttcctcgcfc cactgactcg ctgcgctcgg tcgttcggct 4 800 
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 4860 
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 4 920 
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 4980 
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 5040 
aagctccctc gtgcgctctc ctgttccgac ccbgccgctt accggatacc tgtccgcctt 5100 
tctcccttcg ggaagcgtgg cgcttfcctca tagcbcacgc bgbaggbabc tcagbbcggt 5160 
gbaggbcgbt cgcbccaagc bgggcbgbgb gcacgaaccc cccgttcagc ccgaccgcbg 5220 
cgccbbabcc ggbaacbabc gbcbbgagbc caacccggba agacacgacb tatcgccacb 5280 
ggcagcagcc acbggbaaca ggabbagcag agcgaggtab gbaggcggtg cbacagagbb 5340 
cbtgaagbgg bggccbaact acggcbacac bagaagaaca gtabtbggba tctgcgcbcb 54 00 
gctgaagcca gbtaccbbcg gaaaaagagb bggbagctcb bgabccggca aacaaaccac 5460 
cgcbggtagc ggtggbttbt bbgbbbgcaa gcagcagatt acgcgcagaa aaaaaggatc 5520 
bcaagaagab ccttbgabcb bb.bcbacggg gbcbgacgct cagbggaacg aaaactcacg 5580 
tbaagggatb bbggbcabga gabbatcaaa aaggabctbc acctagabcc tbbbaaabba 5640 
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aaaatgaagfc bbtaaatcaa bcbaaagbat atatgagtaa acttggtctg acagttacca 5700 

atgcttaatc agbgaggcac cbabcbcagc gatctgtcta bbbcgbbcab ccatagttgc 5760 

ctgactcggg gggggggggc gctgaggtct gcctcgtgaa gaaggbgbbg ctgactcata 5820 

ccaggcctga atcgccccab catccagcca gaaagtgagg gagccacggt tgatgagagc 5880 

tttgttgtag gtggaccagt tggtgatttt gaacbbbbgc tttgccacgg aacggtctgc 5940 

gtbgtcggga agatgcgtga bcbgatccbt caacbcagca aaagbbcgab ttattcaaca 6000 

aagccgccgb cccgbcaagb cagcgbaabg ctctgccagb gbtacaacca atbaaccaab 6060 

bctgatbaga aaaacbcabc gagcatcaaa bgaaacbgca abtbabbcab atcaggabba 6120 

bcaabaccab abbbtbgaaa aagccgbbbc bgbaabgaag gagaaaacbc accgaggcag 6180 

bbccabagga bggcaagabc cbggbabcgg bcbgcgabbc cgacbcgbcc aacabcaaba 6240 

caaccbabba abbbccccbc gbcaaaaaba aggtbabcaa gbgagaaabc accabgagtg 6300 

acgacbgaat ccggbgagaa bggcaaaagc btatgcattt cttbccagac btgbbcaaca 6360 

ggccagccab bacgcbcgbc abcaaaabca ctcgcabcaa ccaaaccgbb atbcabbcgb 6420 

gabbgcgccb gagcgagacg aaabacgcga bcgcbgbbaa aaggacaabb acaaacagga 6480 ' 

abcgaabgca accggcgcag gaacacbgcc agcgcabcaa caababbbbc accbgaabca 6540 

ggababbcbt cbaabaccbg gaabgcbgbb btcccgggga bcgcagbggb gagbaaccab 6600 

gcabcatcag gagbacggab aaaabgcbbg atggbcggaa gaggcabaaa bbccgbcagc 6660 

cagtbbagbc bgaccatcbc abcbgbaaca bcatbggcaa cgcbaccbtb gccabgbbtc 6720 

agaaacaacb cbggcgcatc gggcbbccca bacaatcgab agabbgbcgc accbgabbgc 6780 

ccgacabbab cgcgagccca bbbabaccca babaaabcag catccabgtb ggaabbbaab 6840 

cgcggccbcg agcaagacgb bbcccgbbga atatggcbca taacaccccb bgbabtactg 6900 

bbbabgtaag cagacagbbb babbgbbcab gabgababab bbbbabcbbg bgcaabgbaa 6960 

cabcagagab btbgagacac aacgbggcbb bccccccccc cccattabbg aagcabtbab 7020 

cagggbtabb gbcbcatgag cggabacaba btbgaabgba bbbagaaaaa baaacaaaba 70B0 

ggggbbccgc gcacabttcc ccgaaaagbg ccaccbgacg bcbaagaaac cabbabbabc 7140 
abgacatbaa ccbabaaaaa baggcgbabc acgaggccct bbcgtc 7186 



<210> 76 
<211> 7498 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 

• 

<400> 76 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 

cagcbbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 

bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgta ctgagagbgc 180 

accabatgcg gbgbgaaaba ccgcacagab gcgtaaggag aaaabaccgc abcagabbgg 240 

cbabbggcca bbgcabacgb bgbabccaba bcabaababg tacabbbaba bbggcbcabg 300 

bccaacabba ccgccabgbb gacabbgabb abbgacbagb babbaabagb aabcaabbac 360 

ggggbcabba gbbcabagcc catababgga gbbccgcgbt acabaacbta cggbaaatgg 420 

cccgccbggc bgaccgccca acgacccccg cccabbgacg tcaabaabga cgbabgbtcc 4 80 

cabagbaacg ccaataggga cbbtccabbg acgbcaabgg gbggagbatb bacggbaaac 540 

bgcccacbbg gcagbacabc aagtgbabca babgccaagb acgcccccta bbgacgbcaa 600 

bgacggbaaa bggcccgcct ggcabbabgc ccagbacabg accbbabggg acbbbccbac 660 

btggcagbac abcbacgbat bagtcabcgc babbaccabg gbgabgcggb bbbggcagba 720 

cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagtcbcc accccattga 780 

cgbcaatggg agbbtgbbbt ggcaccaaaa bcaacgggac tbbccaaaab gbcgbaacaa 840 

ctccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg tgggaggbcb ababaagcag 900 

agcbcgtbba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgtb bbgaccbcca 960 

bagaagacac cgggaccgat ccagccbccg cggccgggaa cggtgcabtg gaacgcggab 1020 
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bccccgbgcc aagagtgacg taagtaccgc 
tcttatgcat gcbabacbgb tbbbggcbbg 
taggbgabgg tatagcttag cctataggtg 
tattggtgac gatactttcc abbactaabc 
tabbggcbab abgccaabac tcbgbccbbc 
ggabggggbc ccabtbabba tttacaaabt 
cgcagtbbbb attaaacata gcgtgggatc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag abgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
bgcagbcacc gbcgbcgaca cgtgtgatca 
gtgaaggaga agtaccagca cctgtggcgc 
ggcatgctga tgatctgcag cgccaccgag 
cccgbgbgga aggaggccac caccaccctg 
accgaggtgc acaacgtgbg ggccacccac 
gaggtggtgc bggbgaacgb gaccgagaac 
cagatgcacg aggacabcab cagcctgtgg 
acccccctgt gcgtgagcct gaagtgcacc 
agcagcggcc gcatgatcat ggagaagggc 
accagcatcc gcggcaaggb gcagaaggag 
cccabcgaca acgacaccac cagctacagc 
caggccbgcc ccaaggtgag cttcgagccc 
ttcaccatcc tgaagtgcaa caacaagacc 
agcaccgtgc agtgcaccca cggcatccgc 
ggcagcctgg ccgaggagga ggtggtgatc 
accatcatcg tgcagctgaa caccagcgtg 
acccgcaagc gaattcgcat ccagcgcggc 
atcggcaaca tgcgccaggc ccactgcaac 
aagcagatcg acagcaagct gcgcgagcag 
cagagcagcg gcggcgaccc cgagatcgtg 
ttctactgca acagcaccca gcbgbbcaac 
gagggcagca acaacaccga gggcagcgac 
atcatcaaca bgbggcagaa ggtgggcaag 
atccgctgca gcagcaacat caccggcctg 
aacgagagcg agatcttccg cctgggcggc 
ctgtacaagt acaaggtggt gaagatcgag 
cgccgcgtgg tgcagcgcga gaagcgcgcc 
cbgggcgccg ccggcagcac cabgggcgcc 
cagctgctga gcggcatcgt gcagcagcag 
cagcacctgc tgcagctgac cgtgtggggc 
gbggagcgcb acctgaagga ccagcagctg 
atctgcacca ccgccgtgcc ctggaacgcc 
bggaaccaca ccacctggat ggagtgggac 
cacagcctga tcgaggagag ccagaaccag 
ctggacaagt gggccagcct gbggaacbgg 
aagctgttca tcatgatcgt gggcggcctg 
agcatcgtga accgcgtgcg ccagggctac 
atcccccgcg gccccgaccg ccccgagggc 
gaccgcacga bccgccbggb gaacggcagc 
ctgtgcctgt bcagcbacca ccgcctgcgc 
gagctgctgg gccgccgcgg ctgggaggcc 



cbabagacbc babaggcaca cccctttggc 1080 
gggcctatac acccccgctt ccttatgcta 1140 
tgggttattg accattattg accactcccc 1200 
cataacatgg ctctttgcca caacbabctc 1260 
agagactgac acggachctg babtbbbaca 1320 
cacatataca acaacgccgt cccccgtgcc 1380 
tccacgcgaa tctcgggtac gtgttccgga 1440 
tccacatccg agccctggtc ccatgcctcc 1500 
ctcctaacag tggaggccag acbbaggcac 1560 
cacaaggccg bggcggbagg gtabgbgbct 1620 
gctgacgcag atggaagact taaggcagcg 1680 
gtattctgat aagagtcaga ggtaacfcccc 1740 
gtagtctgag cagtactcgt tgctgccgcg 1800 
aacagactgt tcctttccat gggtcttttc 1860 
gatatcgcgg ccgctctaga caccatgcgc 1920 
bggggcbggc gctggggcac catgctgctg 1980 
aagctgbggg tgaccgtgta ctacggcgtg 204 0 
ctctgcgcca gcgacgccaa ggcctacgac 2100 
gcctgcgtgc ccaccgaccc caacccccag 2160 
ttcgacatgt ggaagaacga catggtggag 22 20 
gaccagagcc tgaagccctg cgtgaagctg 22 80 
gacctgaaga acgacaccaa caccaacagc 2340 
gagatcaaga actgcagctt caacatcagc 2400 
tacgccttct tctacaagct ggacatcatc 2460 
ctgaccagcfc gcaacaccag cgtgatcacc 2520 
atccccaacc actactgcgc ccccgccggc 2580 
fctcaacggca ccggccccfcg caccaacgtg 2640 
cccgtggtga gcacccagct gctgctgaac 2700 
cgcagcgtga acttcaccga caacgccaag 2760 
gagatcaacb gcacccgccc caacaacaac 2820 
cccggccgcg cctbcgbgac catcggcaag 2880 
abcagccgcg ccaagtggaa caacacccbg 2940 
btcggcaaca acaagaccab catcbbcaag 3000 
acccacagct bcaacbgcgg cggcgagtbc 3060 
agcaccbggt bcaacagcac cbggagcacc 3120 
accabcaccc bgcccbgccg cabcaagcag 3180 
gccatgbacg ccccccccab cagcggccag 3240 
cbgcbgaccc gcgacggcgg caacagcaac 3 3 00 
ggcgacatgc gcgacaactg gcgcagcgag 3360 
ccccbgggcg bggcccccac caaggccaag 3420 
gtgggcatcg gcgccctgtb ccbgggctbc 34 80 
gccagcatga ccctgaccgb gcaggcccgc 3 540 
aacaacctgc bgcgcgccab cgaggcccag 3600 
abcaagcagc bgcaggcccg cabcctggcc 3660 
cbgggcatcb ggggcbgcag cggcaagctt 3 720 
agcbggagca acaagagccb ggagcagabc 3780 
cgcgagabca acaacbacac cagccbgabc 3 840 
caggagaaga acgagcagga gcbgcbggag 3900 
ttcaacabca ccaacbggcb gbggbacabc 3960 
gtgggcctgc gcabcgbgbb cgccgbgctg 4 020 
agcccccbga gcbtccagac ccacctgccc 4080 
abcgaggagg agggcggcga gcgcgaccgc 414 0 
cbggcccbga bcbgggacga ccbgcgcagc 4200 
gaccbgcbgc bgabcgtgac ccgcatcgbg 4260 
cbgaagbacb ggbggaacct gcbgcagbac 4320 
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.tggagccagg agctgaagaa cagcgccgtg 
gccgagggca ccgaccgcgt gatcgaggtg 
atcccccgcc gcatccgcca gggcctggag 
gtgccttcta gttgccagcc atctgbtgtt 
gaaggtgcca ctcccactgt ccfcttcctaa 
agtaggtgtc attctabtct ggggggtggg 
gaagacaata gcaggcatgc tggggatgcg 
aabbgacccg gbbccbccbg ggccagaaag 
ccctgtccac gcccctggtt cttagttcca 
agggcbccgc ctbcaabccc acccgctaaa 
agcccaccaa accaaaccta gccbccaaga 
taagtgcaga gggagagaaa atgcctccaa 
aattttaagg ccabcabggc cbbaabcbbc 
ggtcgttcgg ctgcggcgag cggtatcagc 
agaatcaggg gataacgcag gaaagaacat 
ccgtaaaaag gccgcgttgc tggcgttttt 
caaaaabcga cgctcaagtc agaggtggcg 
gtttccccct ggaagctccc bcgbgcgcbc 
cctgtccgcc tttctccctt cgggaagcgt 
tctcagttcg gtgbaggbcg ttcgctccaa 
gcccgaccgc bgcgcctbat ccggfcaacta 
cttatcgcca ctggcagcag ccactggtaa 
tgctacagag bbcbtgaagt ggtggcctaa 
tatctgcgct ctgctgaagc cagttacctt 
caaacaaacc accgctggta gcggtggttt 
aaaaaaagga tcbcaagaag abcctttgab 
cgaaaacbca cgbbaaggga bbbbggtcat 
ccbbtbaaab baaaaabgaa gbbttaaatc 
bgacagtbac caabgcbbaa bcagbgaggc 
abccatagbb gcctgactcg gggggggggg 
bgcbgacbca baccaggccb gaatcgcccc 
gtbgabgaga gctbbgttgb aggtggacca 
ggaacggbct gcgbbgtcgg gaagabgcgb 
abbbabbcaa caaagccgcc gtcccgtcaa 
caabbaacca abbcbgatba gaaaaacbca. 
atabcaggab babcaabacc atabbbtbga 
bcaccgaggc agtbccabag gabggcaaga 
ccaacabcaa bacaaccbat baabbbcccc 
bcaccabgag bgacgacbga abccggbgag 
actbgbbcaa caggccagcc atbacgcbcg 
btabbcabbc gbgabtgcgc ctgagcgaga 
bbacaaacag gaabcgaabg caaccggcgc 
tcaccbgaab caggababbc bbctaatacc 
gbgagbaacc atgcabcabc aggagtacgg 
aabbccgbca gccagbbbag bcbgaccabc 
bbgccabgbb bcagaaacaa cbcbggcgca 
gcaccbgabb gcccgacabb abcgcgagcc 
tbggaabbba abcgcggccb cgagcaagac 
cbbgbabbac bgtbbabgba agcagacagb 
bgbgcaabgt aacabcagag abbbbgagac 
bgaagcabbb abcagggbba bbgbcbcabg 
aabaaacaaa baggggbbcc gcgcacabbb 
accattatba bcabgacabb aaccbabaaa 



agccbgcbga acgccaccgc cabcgccgbg 43 80 
gtgcagggcg ccbgccgcgc cabccgccac 4440 
cgcabcctgc bgbgaggabc cagabcbgcb 4500 
bgccccbccc ccgbgccbbc cbbgacccbg 4560 
baaaabgagg aaattgcabc gcatbgbcbg 4620 
gtggggcagg acagcaaggg ggaggabbgg 4680 
gbgggcbcba bgggtaccca ggtgctgaag 4740 
aagcaggcac abccccbbcb cbgtgacaca 4 800 
gccccacbca taggacacbc abagcbcagg 4860 
gbacbbggag cggbcbcbcc cbcccbcabc 4920 
gbgggaagaa abbaaagcaa gabaggcbab 4 980 
cabgbgagga agbaabgaga gaaabcabag 5040 
cgcbbccbcg cbcacbgacb cgctgcgctc 5100 
tcacbcaaag gcggbaatac ggbbabccac 5X60 
gbgagcaaaa ggccagcaaa aggccaggaa 5220 
ccabaggctc cgccccccbg acgagcabca 5280 
aaacccgaca ggacbabaaa gabaccaggc 5340 
tccbgbbccg accctgccgc bbaccggaba 5400 
ggcgcbbbcb cabagcbcac gcbgbaggta 54 60 
gcbgggcbgb gbgcacgaac cccccgbbca 5520 
bcgbcbbgag tccaacccgg baagacacga 55 80 
caggabtagc agagcgaggb abgbaggcgg 5640 
ctacggcbac acbagaagaa cagbabbbgg 5700 
cggaaaaaga gbbggbagcb cbbgabccgg 5760 
bbbbgbbbgc aagcagcaga btacgcgcag 5820 
cbbbbcbacg gggbcbgacg cbcagbggaa 5880 
gagatbabca aaaaggabcb bcaccbagab 5940 
aabctaaagb abababgagb aaacbbggbc 6000 
accbabcbca gcgabcbgbc batttcgbbc 6060 
gcgcbgaggb cbgcctcgbg aagaaggbgt 6120 
abcabccagc cagaaagbga gggagccacg 6180 
gttggbgabb tbgaacbbbb gctbbgccac 6240 
gatcbgabcc bbcaacbcag caaaagbbcg 6300 
gbcagcgbaa bgctcbgcca gbgbtacaac 6360 
bcgagcabca aabgaaacbg caabbbabbc 6420 
aaaagccgbt bcbgbaabga aggagaaaac 64 8 0 
bccbggbabc ggbcbgcgab bccgacbcgb 6540 
tcgbcaaaaa baaggbtatc aagbgagaaa 6600 
aatggcaaaa gcbbabgcab bbcbbbccag 6660 
tcabcaaaab cacbcgcabc aaccaaaccg 6720 
cgaaabacgc gabcgctgbb aaaaggacaa 6780 
aggaacacbg ccagcgcabc aacaabatbb 6840 
tggaabgcbg btbbcccggg gatcgcagbg 6900 
abaaaabgcb bgabggtcgg aagaggcaba 6960 
tcabcbgtaa cabcatbggc aacgcbaccb 7020 
bcgggcbtcc cabacaabcg atagatbgbc 7080 
cabbbabacc cababaaatc agcabccabg 714 0 
gbtbcccgbb gaababggcb cabaacaccc 7200 
bbtabtgtbc abgabgabab abbbbbabcb 7260 
acaacgbggc bbbccccccc cccccatbab 7320 
agcggabaca babbbgaabg babbbagaaa 7380 
ccccgaaaag bgccaccbga cgbcbaagaa 7440 
aabaggcgba bcacgaggcc cbbbcgbc 7498 
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<210> 77 
<211> 7315 
<212> DNA 

<2X3> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 

4 

<400> 77 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 18 0 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 42 0 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccactfcg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgacfcc acggggattt ccaagtctcc accccattga 78 0 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa. tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc lagatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 10 80 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tafcagcttag cctataggtg tgggttattg accattattg accactcccc 12 00 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 12 60 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 13 20 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgb cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 14 4 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtgaacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 222 0 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaaga actgcagctt caacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tctacaagct ggacatcatc 24 6 0 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcaacaccag cgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcaccatcc tgaagtgcaa caacaagacc ttcaacggca ccggcccctg caccaacgtg 264 0 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagct gctgctgaac 27 00 
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ggcagcctgg ccgaggagga ggtggbgabc cgcagcgtga acttcaccga caacgccaag 2760 
accatcatcg tgcagctgaa caccagcgtg gagabcaacb gcacccgccc caacaacaac 2820 
acccgcaagc gaattcgcat ccagcgcggc cccggccgcg ccttcgtgac cabcggcaag 2880 
atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagtggaa caacacccbg 2940 
aagcagatcg acagcaagct gcgcgagcag btcggcaaca acaagaccat cabcbtcaag 3000 
cagagcagcg gcggcgaccc cgagatcgtg acccacagcb tcaactgcgg cggcgagttc 3060 
ttctactgca acagcaccca gctgttcaac agcacctggt tcaacagcac ctggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accatcaccc tgccctgccg catcaagcag 3180 
atcahcaaca tgtggcagaa ggtgggcaag gccatgtacg ccccccccat cagcggccag 3240 
abccgctgca gcagcaacat caccggcctg ctgcbgaccc gcgacggcgg caacagcaac 3 300 
aacgagagcg agabcttccg cctgggcggc ggcgacatgc gcgacaactg gcgcagcgag 33 60 
cbgbacaagb acaaggtggt gaagabcgag ccccbgggcg tggcccccac caaggccaag 3420 
cttaccgtgc aggcccgcca gctgctgagc ggcatcgtgc agcagcagaa caacctgctg 34 80 
cgcgccatcg aggcccagca gcaccbgctg cagcbgaccg tgbggggcat caagcagctg 3540 
caggcccgca tcctggccgt ggagcgctac ctgaaggacc agcagctgga gcagatctgg 3600 
aaccacacca cctggatgga gtgggaccgc gagabcaaca acbacaccag ccbgatccac 3660 
agcctgabcg aggagagcca gaaccagcag gagaagaacg agcaggagct gctggagcbg 3720 
gacaagbggg ccagcctgbg gaacbggbtc aacabcacca acbggcbgbg gbacatcaag 3780 
cbgtbcatca bgabcgbggg cggccbggtg ggcctgcgca bcgbgttcgc cgtgctgagc 3 840 
abcgtgaacc gcgbgcgcca gggctacagc ccccbgagcb tccagaccca ccbgcccabc 3900 
ccccgcggcc ccgaccgccc cgagggcatc gaggaggagg gcggcgagcg cgaccgcgac 3960 
cgcacgatcc gccbggtgaa cggcagcctg gcccbgatct gggacgacct gcgcagccbg 4020 
bgcctgttca gcbaccaccg cctgcgcgac cbgcbgcbga bcgbgacccg cabcgbggag 4080 
cbgcbgggcc gccgcggcbg ggaggcccbg aagtactggt ggaaccbgct gcagbacbgg 414 0 
agccaggagc bgaagaacag cgccgbgagc cbgctgaacg ccaccgccat cgccgbggcc 4200 
gagggcaccg accgcgfcgab cgaggtggtg cagggcgcct gccgcgccat ccgccacatc 4260 
ccccgccgca bccgccaggg cctggagcgc abcctgcbgt gaggabccag atctgcbgbg 4320 
ccbbctagtt gccagccabc bgtbgbtbgc ccctcccccg bgccbbcctt gacccbggaa 4380 
ggtgccacbc ccacbgbccb tbccbaataa aabgaggaaa bbgcabcgca bbgtctgagb 4440 
aggbgtcatb cbabbcbggg gggbggggbg gggcaggaca gcaaggggga ggatbgggaa 4500 
gacaabagca ggcatgcbgg ggatgcggtg ggcbctatgg gtacccaggb gctgaagaat 4560 
bgacccggtt cctcctgggc cagaaagaag caggcacatc cccbbctctg bgacacaccc 4620 
bgbccacgcc ccbggbbcbb agtbccagcc ccacbcabag gacacbcata gcbcaggagg 4680 
gctccgccbb caabcccacc cgcbaaagba cbbggagcgg tctcbcccbc ccbcabcagc 4740 
ccaccaaacc aaaccbagcc bccaagagbg ggaagaaatb aaagcaagat aggcbabbaa 4 800 
gbgcagaggg agagaaaabg cctccaacat gtgaggaagb aabgagagaa abcabagaat 4 860 
ttbaaggcca tcatggcctt aabcbbccgc ttcctcgctc acbgactcgc bgcgcbcggt 4920 
cgtbcggcbg cggcgagcgg batcagcbca cbcaaaggcg gbaabacggt babccacaga 4980 
atcaggggat aacgcaggaa agaacabgtg agcaaaaggc cagcaaaagg ccaggaaccg 5040 
baaaaaggcc gcgbbgcbgg cgtbbbbcca baggcbccgc cccccbgacg agcabcacaa 5100 
aaabcgacgc bcaagtcaga ggtggcgaaa cccgacagga cbabaaagab accaggcgtt 5160 
tccccctgga agcbccctcg bgcgcbctcc tgbtccgacc cbgccgctba ccggabaccb 5220 
gbccgccbbt ctccctbcgg gaagcgbggc gcbtbctcab agcbcacgct gbaggtatct 5280 
cagbbcggbg taggbcgbbc gctccaagcb gggcbgbgbg cacgaacccc ccgtbcagcc 5340 
cgaccgcbgc gccbtabccg gbaacbatcg bcbtgagbcc aacccggbaa gacacgacbt 5400 
atcgccacbg gcagcagcca cbggbaacag gabbagcaga gcgaggbabg baggcggbgc 54 60 
tacagagbtc bbgaagbggb ggccbaacba cggcbacacb agaagaacag babbbggbat 5520 
ctgcgctcbg cbgaagccag bbacctbcgg aaaaagagbb ggbagcbcbb gatccggcaa 5580 
acaaaccacc gcbggtagcg gbggbbbbbb bgtbbgcaag cagcagabta cgcgcagaaa 5640 
aaaaggabcb caagaagabc cbtbgabcbb tbcbacgggg bctgacgcbc agbggaacga 5700 
aaacbcacgt baagggabbb bggbcabgag atbabcaaaa aggatctbca ccbagabccb 5760 
bbtaaatbaa aaabgaagbb bbaaabcaab cbaaagbaba babgagbaaa cbbggbcbga 5820 
cagtbaccaa bgcbbaabca gtgaggcacc batcbcagcg abcbgbcbat bbcgtbcatc 5880 
catagbtgcc tgactcgggg ggggggggcg cbgaggbcbg ccbcgfcgaag aaggfcgbbgc 5940 
bgacbcabac caggcctgaa bcgccccabc abccagccag aaagtgaggg agccacggbb 6000 
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gatgagagct ttgttgbagg tggaccagtt ggbgabtbbg aacbbbbgcb ttgccacgga 6060 
acggtctgcg ttgtcgggaa gabgcgbgab ctgatccttc aactcagcaa aagttcgatt 6120 
tatbcaacaa agccgccgtc ccgtcaagtc agcgbaabgc tctgccagtg ttacaaccaa 6180 
ttaaccaatt ctgattagaa aaactcatcg agcatcaaat gaaactgcaa tttattcata 6240 
tcaggattat caataccata bbbbbgaaaa agccgttbct gtaatgaagg agaaaacbca 6300 
ccgaggcagt bccabaggab ggcaagatcc bggbatcggb ctgcgattcc gactcgtcca 6360 
acabcaabac aaccbabbaa tt tec cc teg bcaaaaabaa ggttatcaag bgagaaabca 6420 
ccatgagtga cgactgaatc eggbgagaab ggcaaaagct babgcabbbc tttccagact 6480 
tgbbcaacag gccagccabt acgctcgtca tcaaaatcac tcgcatcaac caaacegtta 6540 
tbcabtcgbg attgegecbg agegagaega aabacgegat cgctgttaaa aggacaatta 6600 
caaacaggaa tegaatgeaa ccggcgcagg aacactgcca gcgcatcaac aatabbbtca 6660 
cctgaatcag gatattctbc taabacctgg aabgctgttt beceggggat cgcagtggtg 6720 
agtaaccabg cabcabcagg agbaeggaba aaabgcttga bggtcggaag aggcabaaat 6780 
tccgbcagcc agbbtagbcb gaccabctca bebgtaacab cabtggcaac gcbaccbttg 6840 
ccabgbbbca gaaacaaebe bggegcabeg ggcbtcccat acaatcgaba gabbgbegea 6900 
ccbgabbgcc cgacabbabc gcgagcccab tbabacccab abaaabcagc abccabgbbg 6960 
gaabbbaabc gcggccbcga geaagaegbt becegbbgaa batggebcab aacaccccbb 7020 
gbabtacbgb btatgbaagc agacagbbbb attgbbcabg abgabababb bbbatctbgb 7080 
gcaatgtaac atcagagabb bbgagacaca acgtggcbbt cccccccccc ccabbabbga 714 0 
agcabbbatc agggbtabbg bctcabgagc ggabacabat bbgaabgtab bbagaaaaat 7200 
aaacaaabag gggtbccgcg cacabbbccc cgaaaagbgc caccbgacgb cbaagaaacc 7260 
abbabbabca tgacabbaac cbataaaaab aggegbabca cgaggcccbb begbc 7315 



<210> 78 
<211> 7498 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 78 

begegegtbb eggbgatgae ggbgaaaacc bebgacacat gcagcbcccg gagaeggbca 60 
cagctbgbcb gtaageggab geegggagea gacaagcccg beagggegeg beagegggbg 120 
bbggcgggbg beggggebgg cbtaacbabg eggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gegbaaggag aaaabacege abcagabbgg 240 
cbabtggcca ttgcabacgb tgbabecaba bcabaababg bacabbbaba bbggcbcabg 3 00 
becaacabba ccgccabgbb gacatbgatb abbgactagb babbaabagb aabcaabtac 360 
ggggbcabba gttcabagee cabababgga gbbccgcgbb acabaaebba eggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga egbabgbbec 480 
cabagbaacg ccaabaggga ebbbecabbg aegbcaabgg gbggagbabt baeggbaaac 540 
bgcccacbbg gcagtacabc aagbgbabca babgecaagb acgcccccba tbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagtacabg accbbabggg acbbbccbac 660 
bbggcagbac abebaegbab tagbcabege batbaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg egbggabage ggbbbgactc aeggggabbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbbbgbbbb ggcaccaaaa beaaegggae bbbecaaaab gbcgbaacaa 840 
cbccgcccca btgacgcaaa tgggeggbag gcgbgbacgg bgggaggbcb atabaagcag 900 
agebegtbba gbgaaccgbc agabegecbg gagacgecab ccacgcbgbb bbgacctcca 960 
bagaagacac egggaccgab ccagccbccg eggcegggaa eggbgcabbg gaacgeggat 1020 
bccccgbgcc aagagbgacg taagbacege cbabagaebe babaggcaca ccccbbbggc 1080 
bebbabgcab gcbabacbgb bbbtggcbtg gggecbatae acccccgcbb cebbabgeba 1140 
baggbgabgg babagebbag ccbabaggbg bgggbbabbg accabbabbg accacbcccc 1200 
batbggtgac gataebbbec abbacbaabc cabaacabgg cbcbbbgcca caacbabcbc 1260 
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babbggcbab atgccaatac tctgtccttc agagactgac acggacbcbg babbbbbaca 1320 
ggabggggbc ccatttatta bbbacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbtbbb abbaaacaba gcgbgggabc bccacgcgaa bcbcgggbac gbgbbccgga 1440 
catgggcbcb tcbccggtag cggcggagcb tccacatccg agccctggbc ccabgcctcc 1500 
agcggcbcab ggbcgctcgg cagctccbbg cbccbaacag tggaggccag actbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbct 1620 
gaaaabgagc gbggagatbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagtbgbb gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagb gbagtcbgag cagtacbcgb bgcbgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagactgb bcctbbccab gggbcbtbbc 1860 
bgcagbcacc gtcgbcgaca cgbgbgabca gababcgcgg ccgcbcbaga caccabgcgc 1920 
gbgaaggaga agtaccagca ccbgbggcgc bggggcbggc gcbggggcac cabgcbgcbg 1980 
ggcatgcbga bgabcbgcag cgccaccgag aagcbgbggg tgaccgbgba cbacggcgbg 2040 
cccgtgbgga aggaggccac caccacccbg ctcbgcgcca gcgacgccaa ggccbacgac 2100 
accgaggbgc acaacgbgbg ggccacccac gccbgcgbgc ccaccgaccc caacccccag 2160 
gaggtggbgc bggbcgacgb gaccgagaac bbcgacabgb ggaagaacga cabggbggag 2220 
cagatgcacg aggacabcat cagccbgbgg gaccagagcc tgaagcccbg cgbgaagcbg 2280 
acccccctgt gcgbgagccb gaagbgcacc gaccbgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcabgabcab ggagaagggc gagabcaagg acbgbagcbb cgacabcagc 24 00 
accagcabcc gcggcaaggb gcagaaggag bacgcctbcb bcbacaagcb ggacabcabc 2460 
cccatcgaca acgacaccac cagcbacagc ctgaccagcb gcgaaacbag bgbgabcacc 2520 
caggcctgcc ccaaggbgag cbbcgagccc abccccaacc acbacbgcgc ccccgccggc 2580 
bbcaccabcc bgaagbgcaa cgacaagacc bbcgacggca ccgggcccbg caccgacgbg 2640 
agcaccgbgc agbgcaccca cggcabccgc cccgtggbga gcacccagcb gcbgcbgaac 2 700 
ggcagccbgg ccgaggagga ggbggbgabc cgcagcgbcg acbbcaccga caacgccaag 2 760 
accabcatcg bgcaggbgga caccacggbg gagabcgabt gcacccgccc caacaacaac 2820 
acccgcaagc gaabbcgcab ccagcgcggc cccggccgcg ccttcgbgac catcggcaag 2 880 
abcggcaaca bgcgccaggc ccacbgcgab abcagccgcg ccaagbggga caacacccbg 2940 
aagcagabcg acagcaagcb gcgcgagcag tbcggcaacg acaagaccab catcbbcaag 3000 
cagagcagcg gcggcgaccc cgagabcgbg acccacagcb bcaacbgcgg cggcgagbbc 3 060 
btcbacbgcg acagcaccca gcbgbbcaac agcaccbggb bcaacagcac cbggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accabcaccc tgcccbgccg catcaagcag 3180 
abcabcaaca bgbggcagaa ggbgggcaag gccabgbacg ccccccccab cagcggccag 3240 
abccgcbgca gcagcgatab caccggccbg cbgcbgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agabcbbccg ccbgggcggc ggcgacabgc gcgacaactg gcgcagcgag 3360 
cbgbacaagb acaaggbggt gaagabcgag cccctgggcg bggcccccac caaggccaag 342 0 
cgccgcgbgg bgcagcgcga gaagcgcgcc gtgggcatcg gcgccctgtb ccbgggcbbc 34 80 
ctgggcgccg ccggcagcac cabgggcgcc gccagcabga cccbgaccgb gcaggcccgc 3540 
cagcbgcbga gcggcabcgt gcagcagcag aacaaccbgc tgcgcgccab cgaggcccag 3600 
cagcaccbgc bgcagcbgac cgbgbggggc abcaagcagc bgcaggcccg cabccbggcc 3660 
gtggagcgcb accbgaagga ccagcagcbg cbgggcabct ggggcbgcag cggcaagcbb 3720 
abcbgcacca ccgccgbgcc cbggaacgcc agcbggagca acaagagccb ggagcagabc 37 80 
bggaaccaca ccaccbggab ggagbgggac cgcgagatca acaacbacac cagccbgabc 3B4 0 
cacagccbga bcgaggagag ccagaaccag caggagaaga acgagcagga gcbgcbggag 3900 
cbggacaagb gggccagcct gbggaacbgg tbcaacabca ccaacbggcb gbggbacabc 3960 
aagcbgbbca bcabgabcgb gggcggccbg gbgggccbgc gcabcgbgtb cgccgbgcbg 4020 
agcabcgtga accgcgbgcg ccagggcbac agcccccbga gcbbccagac ccaccbgccc 4 080 
abcccccgcg gccccgaccg ccccgagggc abcgaggagg agggcggcga gcgcgaccgc 414 0 
gaccgcacga tccgccbggb gaacggcagc ctggcccbga bctgggacga cctgcgcagc 4 200 
cbgbgccbgb tcagcbacca ccgccbgcgc gaccbgcbgc bgabcgbgac ccgcabcgbg 4260 
gagcbgctgg gccgccgcgg cbgggaggcc cbgaagbacb ggbggaaccb gctgcagbac 4320 
bggagccagg agcbgaagaa cagcgccgbg agccbgcbga acgccaccgc cabcgccgbg 4380 
gccgagggca ccgaccgcgb gabcgaggbg gbgcagggcg ccbgccgcgc catccgccac 4440 
abcccccgcc gcabccgcca gggccbggag cgcabccbgc bgbgaggatc cagabcbgcb 4500 
gbgccbbcba gbbgccagcc abcbgbbgbb tgccccbccc ccgbgccbtc cbtgacccbg 4560 
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gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaatbgcabc gcattgtctg 4620 
agbaggtgtc attctattct ggggggbggg gbggggcagg acagcaaggg ggaggabtgg 4680 
gaagacaata gcaggcatgc tggggatgcg gtgggctcba tgggtaccca ggtgctgaag 4740 
aattgacccg gttcctcctg ggccagaaag aagcaggcac atccccttct ctgbgacaca 4 800 
ccctgtccac gcccctggtt cttagttcca gccccactca taggacactc atagctcagg 4860 
agggctccgc cttcaatccc acccgctaaa gtacttggag cggtctctcc ctccctcatc 4920 
agcccaccaa accaaaccta gcctccaaga gtgggaagaa attaaagcaa gataggctat 4980 
taagtgcaga gggagagaaa abgcctccaa catgtgagga agtaatgaga gaaatcatag 5040 
aatbttaagg ccatcatggc cttaatcttc cgcttcctcg cbcactgact cgcfcgcgctc 5100 
gghcgttcgg ctgcggcgag cggbabcagc tcactcaaag. gcggtaatac ggttatccac 5160 
agaatcaggg gafcaacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 5220 
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgccccccbg acgagcatca 52 80 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 5340 
gtbbcccccb ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 5400 
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 5460 
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 5520 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 55B0 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 5640 
bgcbacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 5700 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 5760 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 5820 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtcbgacg ctcagtggaa 5880 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 5940 
ccbbttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6000 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgahctgtc tat'ttcgtbc 6060 
atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg aagaaggtgt 612 0 
tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga gggagccacg 6180 
gttgatgaga gctttgtfcgt aggfcggacca gttggtgatt ttgaactttt gcttfcgccac 6240 
ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagttcg 63 00 
atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctchgcca gtgttacaac 6360 
caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg caabttattc 6420 
atatcaggat tatcaatacc abattbttga aaaagccgbt tcbgtaatga aggagaaaac 6480 
tcaccgaggc agtbccatag gabggcaaga tcctggtabc ggtctgcgat tccgactcgt 6540 
ccaacatcaa tacaacctat taatttcccc tcgbcaaaaa taaggbbatc aagtgagaaa 6600 
tcaccatgag tgacgactga atccggtgag aabggcaaaa gctbatgcat fctctttccag 6660 
acbtgttcaa caggccagcc abbacgctcg tcabcaaaat cactcgcatc aaccaaaccg 6720 
ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa 6780 
ttacaaacag gaatcgaatg caaccggcgc aggaacacbg ccagcgcatc aacaabattt 684 0 
tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg gatcgcagtg 6900 
gbgagtaacc atgcatcatc aggagbacgg ataaaabgct tgatggtcgg aagaggcaba 6960 
aattccgtca gccagtbtag tcbgaccatc tcatctgtaa catcattggc aacgcbacct 7020 
btgccatgtb tcagaaacaa ctctggcgca tcgggctfccc catacaatcg atagattgbc 7080 
gcacctgatt gcccgacatt atcgcgagcc catbtatacc catataaatc agcatccabg 7140 
tbggaattta atcgcggcct cgagcaagac gtttcccgbt gaababggct cataacaccc 7200 
ctbgbattac fcgtbbatgta agcagacagt tbtattgttc atgatgatat atttttabct 7260 
tgtgcaatgb aacatcagag atttbgagac acaacgtggc tttccccccc cccccatbat 7320 
tgaagcattt atcagggtta ttgtctcatg agcggataca batttgaatg tatbbagaaa 7380 
aataaacaaa taggggbtcc gcgcacabtt ccccgaaaag tgccacctga cgtctaagaa 744 0 
accattatta tcabgacatt aacctataaa aataggcgta bcacgaggcc cttbcgtc 74 98 
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<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 79 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgjtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 10 80 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga caccatgcgc 1920 
gtgaaggaga agtaccagca cctgtggcgc tggggctggc gctggggcac catgctgctg 1980 
ggcatgctga tgatctgcag cgccaccgag aagctgtggg tgaccgtgta ctacggcgtg 2040 
cccgtgtgga aggaggccac caccaccctg ctctgcgcca gcgacgccaa ggcctacgac 2100 
accgaggtgc acaacgtgtg ggccacccac gcctgcgtgc ccaccgaccc caacccccag 2160 
gaggtggtgc tggtcgacgt gaccgagaac ttcgacatgt ggaagaacga catggtggag 2220 
cagatgcacg aggacatcat cagcctgtgg gaccagagcc tgaagccctg cgtgaagctg 2280 
acccccctgt gcgtgagcct gaagtgcacc gacctgaaga acgacaccaa caccaacagc 2340 
agcagcggcc gcatgatcat ggagaagggc gagatcaagg actgtagctt cgacatcagc 2400 
accagcatcc gcggcaaggt gcagaaggag tacgccttct tctacaagct ggacatcatc 24 60 
cccatcgaca acgacaccac cagctacagc ctgaccagct gcgaaactag tgtgatcacc 2520 
caggcctgcc ccaaggtgag cttcgagccc atccccaacc actactgcgc ccccgccggc 2580 
ttcaccatcc tgaagtgcaa cgacaagacc ttcgacggca ccgggccctg caccgacgtg 2640 
agcaccgtgc agtgcaccca cggcatccgc cccgtggtga gcacccagct gctgctgaac 2700 
ggcagcctgg ccgaggagga ggtggtgatc cgcagcgtcg acttcaccga caacgccaag 2 7 60 
accatcatcg tgcaggtgga caccacggtg gagatcgatt gcacccgccc caacaacaac 2820 
acccgcaagc gaattcgcat ccagcgcggc cccggccgcg ccttcgtgac catcggcaag 2880 
atcggcaaca tgcgccaggc ccactgcaac atcagccgcg ccaagtggaa caacaccctg 2 94 0 
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aagcagatcg acagcaagcb gcgcgagcag ttcggcaaca acaagaccat cabcbbcaag 3000 
cagagcagcg gcggcgaccc cgagatcgbg acccacagcb tcaactgcgg cggcgagttc 3060 
ttctactgca acagcaccca gctgttcaac agcacctggt tcaacagcac ctggagcacc 3120 
gagggcagca acaacaccga gggcagcgac accatcaccc tgccctgccg catcaagcag 3180 
abcabcaaca fcgtggcagaa ggtgggcaag gccatgtacg ccccccccat cagcggccag 3240 
atccgctgca gcagcaacab caccggcctg ctgctgaccc gcgacggcgg caacagcaac 3300 
aacgagagcg agatcbtccg cctgggcggc ggcgacatgc gcgacaactg gcgcagcgag 3360 
ctgbacaagt acaaggbggb gaagabcgag ccccbgggcg bggcccccac caaggccaag 3420 
cgccgcgbgg bgcagcgcga gaagcgcgcc gbgggcabcg gcgcccbgbb ccbgggcbbc 34 80 
cbgggcgccg ccggcagcac catgggcgcc gccagcatga cccbgaccgb gcaggcccgc 3540 
cagcbgcbga gcggcabcgb gcagcagcag aacaacctgc bgcgcgccab cgaggcccag 3600 
cagcaccbgc bgcagcbgac cgtgbggggc abcaagcagc tgcaggcccg cabcctggcc 3660 
9bggagcgcb accbgaagga ccagcagcbg cbgggcatct ggggcbgcag cggcaagcbt 3720 
atcbgcacca ccgccgbgcc ctggaacgcc agcbggagca acaagagccb ggagcagabc 3 7 80 
bggaaccaca ccaccbggab ggagbgggac cgcgagabca acaactacac cagccbgabc 3 840 
cacagcctga bcgaggagag ccagaaccag caggagaaga acgagcagga gctgcbggag 3 900 
cbggacaagb gggccagccb gtggaactgg bbcaacatca ccaactggcb gtggtacabc 3 960 
aagcbgtbca bcatgabcgb gggcggccbg gtgggcctgc gcatcgbgbb cgccgbgcbg' 4020 
agcabcgbga accgcgbgcg ccagggctac agcccccbga gcbtccagac ccaccbgccc 4080 
abcccccgcg gccccgaccg ccccgagggc abcgaggagg agggcggcga gcgcgaccgc 4140 
gaccgcacga bccgccbggb gaacggcagc cbggccctga bcbgggacga ccbgcgcagc 42 00 
cbgtgcctgb bcagcbacca ccgccbgcgc gaccbgctgc bgatcgbgac ccgcabcgtg 4260 
gagcbgcbgg gccgccgcgg ctgggaggcc cbgaagbacb ggbggaaccb gcbgcagbac 4320 
bggagccagg agcbgaagaa cagcgccgbg agccbgcbga acgccaccgc catcgccgbg 4380 
gccgagggca ccgaccgcgb gabcgaggbg gbgcagggcg ccbgccgcgc cabccgccac 4440 
abcccccgcc gcabccgcca gggccbggag cgcabccbgc tgtgaggabc cagabcbgcb 4500 
gbgccbbcba gbbgccagcc abcbgtbgbb bgccccbccc ccgtgccbbc cbbgacccbg 4560 
gaaggbgcca cbcccacbgb ccbbbccbaa baaaabgagg aaabbgcabc gcabbgbcbg 4620 
agbaggtgtc attcbattct ggggggbggg gbggggcagg acagcaaggg ggaggatbgg 4680 
gaagacaaba gcaggcabgc bggggabgcg gbgggctcba bgggbaccca ggbgcbgaag 474 0 
aabbgacccg gtbccbccbg ggccagaaag aagcaggcac atccccbbcb ctgbgacaca 4800 
cccbgbccac gccccbggbb cbbagbbcca gccccactca baggacactc atagcbcagg 4860 
• agggcbccgc cttcaatccc acccgcbaaa gbacbtggag cggbcbctcc ctcccbcatc 4920 
agcccaccaa accaaaccba gcctccaaga gbgggaagaa atbaaagcaa gabaggcbab 4980 
baagbgcaga gggagagaaa abgccbccaa cabgbgagga agtaatgaga gaaabcatag 5040 
aabbbbaagg ccabcabggc cbbaabcbtc cgctbccbcg ctcacbgacb cgcbgcgcbc 5100 
ggbcgbbcgg cbgcggcgag cggbabcagc bcacbcaaag gcggbaabac ggbbabccac 5160 
agaabcaggg gabaacgcag gaaagaacab gtgagcaaaa ggccagcaaa aggccaggaa 5220 
ccgbaaaaag gccgcgbbgc bggcgbbttb ccabaggcbc cgcccccctg acgagcatca 5280 
caaaaabcga cgcbcaagbc agaggbggcg aaacccgaca ggacbabaaa gabaccaggc 5340 
gbbbcccccb ggaagcbccc bcgbgcgcbc bccbgbbccg acccbgccgc ttaccggaba 5400 
ccbgbccgcc bbbcbcccbb cgggaagcgb ggcgcbbbcb cabagcbcac gcbgbaggba 5460 
bcbcagbbcg gtgtaggbcg bbcgcbccaa gcbgggctgb gtgcacgaac cccccgbbca 5520 
gcccgaccgc bgcgccbbat ccggbaacta bcgbcbbgag bccaacccgg taagacacga 5580 
cbbabcgcca cbggcagcag ccacbggtaa caggatbagc agagcgaggb abgbaggcgg 5640 
bgcbacagag bbcbbgaagt ggbggccbaa cbacggcbac acbagaagaa cagbabbbgg 5700 
babcbgcgcb cbgcbgaagc cagbbacctb cggaaaaaga gtbggtagcb ctbgabccgg 5760 
caaacaaacc accgcbggba gcggbggbbb bbfcbgbtbgc aagcagcaga btacgcgcag 5820 
aaaaaaagga bcbcaagaag abccbbbgab cbbbtcbacg gggbcbgacg cbcagbggaa 5880 
cgaaaacbca cgbbaaggga bbbbggbcat gagabbabca aaaaggatcb tcaccbagab 5940 
ccbbbbaaab taaaaabgaa gbbbbaaatc aatcbaaagb atababgagb aaachbggbc 6000 
bgacagbbac caabgcbbaa bcagbgaggc accbabctca gcgabcbgtc babbbcgbbc 6060 
abccabagbb gccbgacbcg gggggggggg gcgcbgaggb ctgcctcgtg aagaaggbgb 6120 
bgcbgacbca taccaggccb gaabcgcccc abcabccagc cagaaagbga gggagccacg 6180 
gbbgabgaga gcbbbgbbgt aggbggacca gbtggtgabb btgaacbbbb gcfcbtgccac 6240 
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ggaacggtct gcgttgtcgg gaagatgcgt gabcbgabcc ttcaactcag caaaagttcg 6300 

atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa bgcbcbgcca gbgbb.acaac 6360 

caattaacca attctgatta gaaaaactca tcgagcatca aabgaaactg caabtbabtc 6420 

atatcaggat babcaabacc atatttttga aaaagccgtt bcbgtaabga aggagaaaac 6480 

tcaccgaggc agttccatag gatggcaaga tcctggtatc ggbctgcgab bccgacbcgb 6540 

ccaacatcaa bacaacctab baabbbcccc bcgtcaaaaa taaggttatc aagtgagaaa 6600 

tcaccatgag bgacgactga atccggtgag aatggcaaaa gcttatgcat bbcbbbccag 6660 

acttgttcaa caggccagcc attacgctcg tcabcaaaat cactcgcatc aaccaaaccg 6720 

bbabbcabbc gbgabtgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa 6780 

bbacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaababbt 6840 

tcacctgaafc caggatatbc ttctaahacc tggaatgctg ttttcccggg gabcgcagbg 6900 

gbgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg aagaggcata 6960 

aattccgtca gccagtttag tctgaccatc bcabctgtaa catcatbggc aacgcbacct 7020 

btgccatgbb tcagaaacaa ctctggcgca bcgggcbtcc catacaatcg atagatbgbc 7080 

gcacctgatt gcccgacabt abcgcgagcc cabbbatacc cabataaatc agcatccabg 714 0 

tbggaatbta atcgcggcct cgagcaagac gbtbcccgbb gaababggcb cataacaccc 7200 

cbbgtatbac tgbtbabgba agcagacagb bbtabtgtbc abgafcgabab atbbtbabcb 7260 

tgbgcaabgb aacabcagag abbttgagac acaacgtggc bbbccccccc cccccattab 7320 

tgaagcattt abcagggbba bbgtcbcabg agcggataca tabbbgaatg tabbtagaaa 7380 

aabaaacaaa taggggbbcc gcgcacabtt ccccgaaaag tgccaccbga cgtctaagaa 744 0 

accatbabba tcabgacabb aaccbabaaa aataggcgba bcacgaggcc cbtbcgtc 7498 



<210> 80 
<211> 5549 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 80 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagctbgbcb gtaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 
bbggcgggbg bcggggcbgg ctbaacbatg cggcabcaga gcagabbgba cbgagagbgc 180 
accatabgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc atcagabbgg 24 0 
cbabtggcca btgcabacgt bgbabccata bcabaatabg tacattbata fcbggcbcabg 300 
tccaacabba ccgccabgbb gacatbgatb abtgactagb tatbaabagt aabcaabbac 360 
ggggbcabta gbtcabagcc cabatatgga gbbccgcgtb acabaacbta cggbaaabgg 420 
cccgcctggc bgaccgccca acgacccccg cccabtgacg bcaabaabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbtbccabbg acgbcaabgg gbggagbabb tacggbaaac 54 0 
bgcccacbbg gcagtacabc aagbgtabca babgccaagb acgcccccba tbgacgbcaa 600 
bgacggtaaa tggcccgcct ggcabbabgc ccagbacatg accttabggg acbbbccbac 660 
bbggcagbac abcbacgbat bagtcatcgc babbaccatg gbgatgcggb btbggcagba 720 
cabcaatggg cgtggabagc ggttbgacbc acggggatbb ccaagbcbcc accccabbga 7 80 
cgbcaatggg agbbbgbbbt ggcaccaaaa bcaacgggac btbccaaaab gtcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb abataagcag 900 
agcbcgbbba gtgaaccgbc agabcgccbg gagacgccab ccacgctgtt bbgacctcca 960 
bagaagacac cgggaccgat ccagccbccg cggccgggaa cggfcgcabbg gaacgcggab 102 0 
bccccgtgcc aagagbgacg baagbaccgc cbabagacbc babaggcaca ccccbbtggc 1080 
bcbbatgcab gctabacbgt bbbbggcbbg gggccbatac acccccgcbb ccbbabgcba 1140 
baggbgabgg babagcbbag ccbabaggbg bgggbbatbg accabtabbg accacbcccc 1200 
babbggbgac gabacbbbcc abbactaabc cataacatgg ctctttgcca caactabcbc 1260 
babbggctab abgccaabac bcbgbccbtrc agagacbgac acggacbcbg babbtbbaca 1320 
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ggatggggbc ccatttatta tbbacaaabb 
cgcagbbbbb abbaaacata gcgbgggabc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag abgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
bgcagbcacc gbcgtcgaca cgtgtgatca 
cggcaagtgg agcaagagca gcgtgatcgg 
cgccgagccc gccgccgacg gcgbgggcgc 
cafccaccagc agcaacaccg ccgccaacaa 
ggaggaggag gbgggcbbcc ccgtgacccc 
ggccgccgtg gacctgagcc acbbccbgaa 
cagccagcgc cgccaggaca tcctggacct 
cgactggcag aactacaccc ccggccccgg 
ctacaagctg gtgcccgtgg agcccgacaa 
cagcctgctg caccccgtga gcctgcacgg 
gtggcgcttc gacagccgcc tggccttcca 
cttcaagaac tgcbgaggab ccagatctgc 
ttgcccctcc cccgtgcctt ccttgaccct 
ataaaatgag gaaattgcat cgcattgtct 
ggtggggcag gacagcaagg gggaggatfcg 
ggtgggctch atgggtaccc aggtgctgaa 
gaagcaggca catccccbbc tctgtgacac 
agccccactc abaggacacb catagctcag 
agtactbgga gcggtctctc cctccctcat 
agtgggaaga aatbaaagca agabaggcba 
acabgbgagg aagbaabgag agaaabcata 
ccgcbbccbc gctcacbgac tcgcbgcgcb 
cbcacbcaaa ggcggbaaba cggtbabcca 
bgbgagcaaa aggccagcaa aaggccagga 
bccabaggcb ccgccccccb gacgagcatc 
gaaacccgac aggactabaa agataccagg 
cbccbgtbcc gacccbgccg cbbaccggab 
bggcgctbbc tcabagctca cgctgbaggb 
agcbgggcbg tgbgcacgaa ccccccgbbc 
abcgbctbga gbccaacccg gbaagacacg 
acaggabbag cagagcgagg babgbaggcg 
acbacggcba cacbagaaga acagtabbbg 
bcggaaaaag agbbggbagc tcbtgabccg 
bbbbbgbbbg caagcagcag abbacgcgca 
bcbbbbcbac ggggbcbgac gcbcagbgga 
bgagabbabc aaaaaggabc bbcaccbaga 
caabcbaaag babababgag baaacbbggb 
caccbabcbc agcgatcbgb cbabbbcgbb 
ggcgcbgagg bcbgccbcgb gaagaaggbg 
cabcabccag ccagaaagbg agggagccac 
agbbggbgat bbbgaacbbb bgcbbbgcca 
bgabcbgabc cbbcaacbca gcaaaagbbc 
agbcagcgta abgcbcbgcc agbgbbacaa 
abcgagcabc aaabgaaacb gcaabbbabb 
aaaaagccgb bbcbgtaabg aaggagaaaa 
abccbggbab cggbctgcga bbccgacbcg 



cacababaca acaacgccgb cccccgbgcc 13 80 
bccacgcgaa bcbcgggbac gtgbbccgga 144 0 
bccacabccg agccctggbc ccatgccbcc 1500 
cbccbaacag bggaggccag acbtaggcac 1560 
cacaaggccg bggcggbagg gbabgbgbcb 1620 
gcbgacgcag abggaagacb baaggcagcg 1680 
gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbagbcbgag cagtacbcgb bgcbgccgcg 1800 
aacagacbgb bcctbtccab gggbcbbbbc 1860 
gababcgcgg ccgcbcbaga ccaccabggg 1920 
cbggcccgcc gbgcgcgagc gcabgcgccg 1980 
cgbgagccgc gaccbggaga agcacggcgc 2 04 0 
cgccgccbgc gccbggcbgg aggcccagga 2100 
ccaggbgccc cbgcgcccca tgaccbacaa 2160 
ggagaagggc ggccbggagg gccbgabcca 2220 
gbggabcbac cacacccagg gcbacbbccc 2280 
cgbgcgctac ccccbgaccb bcggcbggbg 2340 
ggbggaggag gccaacaagg gcgagaacac 2400 
catggacgac cccgagcgcg aggbgcbgga 2460 
ccacgbggcc cgcgagcbgc accccgagba 2520 
bgbgccbbcb agbtgccagc cabcbgtbgt 2580 
ggaaggbgcc acbcccacbg bccbbbccba 2640 
gagbaggbgb cabbcbabbc bggggggbgg 2700 
ggaagacaab agcaggcabg cbggggabgc 2760 
gaabbgaccc ggbbccbccb gggccagaaa 2820 
acccbgbcca cgccccbggb bcbbagbbcc 2880 
gagggcbccg ccbbcaabcc cacccgcbaa 2940 
cagcccacca aaccaaaccb agccbccaag 3000 
bbaagbgcag agggagagaa aabgccbcca 3060 
gaabbtbaag gccabcabgg ccbbaabcbb 3120 
cggbcgbbcg gcbgcggcga gcggbabcag 3180 
cagaatcagg ggabaacgca ggaaagaaca 3240 
accgbaaaaa ggccgcgbtg cbggcgbbbt 3300 
acaaaaabcg acgcbcaagb cagaggbggc 3360 
cgbbbccccc bggaagcbcc cbcgbgcgcb 3420 
accbgbccgc cbbbcbcccb bcgggaagcg 3480 
abcbcagbbc ggbgbaggbc gtbcgcbcca 3540 
agcccgaccg cbgcgccbba bccggbaacb 3600 
acbbabcgcc acbggcagca gccacbggba 3660 
gbgcbacaga gbbcbbgaag bggbggccba 3720 
gbabcbgcgc bcbgcbgaag ccagbbaccb 3780 
gcaaacaaac caccgcbggb agcggbggbb 3840 
gaaaaaaagg abcbcaagaa gabccbbbga 3900 
acgaaaacbc acgbbaaggg abbbbggbca 3960 
bccbbbbaaa bbaaaaabga agbbbbaaab 4020 
cbgacagbba ccaabgcbba abcagbgagg 4080 
cabccabagb bgccbgacbc gggggggggg 4140 
tbgcbgacbc abaccaggcc bgaabcgccc 4200 
ggbbgabgag agcbbbgbbg baggbggacc 4260 
cggaacggbc bgcgbbgbcg ggaagabgcg 4320 
gabbbabbca acaaagccgc cgbcccgbca 4380 
ccaabbaacc aabbcbgabb agaaaaacbc 4440 
cabatcagga bbabcaabac cababbbbbg 4500 
cbcaccgagg cagbbccaba ggabggcaag 4560 
bccaacabca abacaaccba bbaabbbccc 4620 
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ctcgtcaaaa ataaggttat caagtgagaa atcaccatga gtgacgactg aatccggtga 4680 
gaatggcaaa agcttatgca tttctttcca gacttgttca acaggccagc cattacgctc 4740 
gtcatcaaaa tcactcgcat caaccaaacc gttattcatt cgtgattgcg cctgagcgag 4 800 
acgaaatacg cgatcgctgt taaaaggaca attacaaaca ggaatcgaafc gcaaccggcg 4860 
caggaacact gccagcgcat caacaatatt ttcacctgaa tcaggatatt cttctaatac 4 920 
ctggaatgct gttttcccgg ggatcgcagt ggtgagtaac catgcatcat caggagtacg 4980 
gataaaatgc ttgatggtcg gaagaggcat aaattccgtc agccagttta gtcbgaccat 5040 
ctcatctgta acatcattgg caacgctacc tttgccatgt ttcagaaaca actctggcgc 5100 
atcgggcttc ccatacaatc gatagattgt cgcacctgat tgcccgacat tatcgcgagc 5160 
ccattfcatac ccabataaat cagcatccat gttggaattt aatcgcggcc tcgagcaaga 5220 
cgtttcccgt tgaatatggc tcataacacc ccttgtatta ctgtttatgt aagcagacag 5280 
ttttattgtt catgatgata tatttttatc ttgtgcaatg taacatcaga gattttgaga 5340 
cacaacgtgg ctttcccccc ccccccatta ttgaagcatt tatcagggtt attgtctcat 5400 
gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt 5460 
tccccgaaaa gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa 5520 
aaataggcgt atcacgaggc cctttcgtc 5549 



<210> 81 
<211> 5549 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 81 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 7 80 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatcfcc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
99atggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gfcgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
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gaaaatgagc gtggagattg ggcbcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtabbctgab aagagtcaga ggtaactccc 1740 
gthgcggtgc bgbbaacggb ggagggcagt gtagbctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataahag ctgacagact aacagactgt bccbbbccab gggbcbbbbc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatabcgcgg ccgctctaga ccaccatggg 1920 
cggcaagtgg agcaagagca gcgtgatcgg ctggcccgcc gtgcgcgagc gcatgcgccg 1980 
cgccgagccc gccgccgacg gcgtgggcgc cgtgagccgc gacctggaga agcacggcgc 204 0 
catcaccagc agcaacaccg ccgccaacaa cgccgcctgc gcctggctgg aggcccagga 2100 
ggaggaggag gbgggcbbcg cagbgaccgc acaggbggca cbgcgcgcaa bgaccbacaa 2160 
ggccgccgbg gaccbgagcc acbbccbgaa ggagaagggc ggccbggagg gcctgatcca 2220 
cagccagcgc cgccaggaca bccbggaccb gbggatcbac cacacccagg gcbacttccc 2280 
cgactggcag aacbacaccc ccggccccgg cgbgcgcbac ccccbgaccb bcggcbggtg 2340 
ctacaagcbg gtgcccgbgg agcccgacaa ggfcggaggag gccaacaagg gcgagaacac 2400 
cagccbgcbg caccccgbga gccbgcacgg cabggcagca cccgagcgcg aggtgctgga 24 60 
gtggcgcbbc gacagccgcc bggccttcca ccacgbggcc cgcgagcbgc accccgagba 2520 
cbtcaagaac tgcbgaggab ccagabctgc tgbgccbtcb agbbgccagc cabcbgtbgb 2580 
bbgcccctcc cccgbgccbb ccbbgacccb ggaaggbgcc actcccacbg bccbtbccba 2640 
abaaaabgag gaaabbgcab cgcabtgtcb gagtaggtgb cabbctabbc bggggggbgg 2700 
ggbggggcag gacagcaagg gggaggabtg ggaagacaab agcaggcabg cbggggatgc 2760 
ggtgggctct abgggbaccc aggbgcbgaa gaabtgaccc ggbbccbcct gggccagaaa 2820 
gaagcaggca cabccccbbc bcbgbgacac acccbgtcca cgccccbggt bcbbagbbcc 2880 
agccccacbc ataggacacb catagcbcag gagggcbccg ccbbcaabcc cacccgcbaa 2940 
agbacbbgga gcggbcbcbc ccbccctcat cagcccacca aaccaaaccb agccbccaag 3 000 
agbgggaaga aabtaaagca agabaggcta btaagbgcag agggagagaa aabgcctcca 3060 
acabgtgagg aagbaatgag agaaabcaba gaatbbbaag gccabcabgg ccbbaatcbt 3120 
ccgcbbcctc gctcactgac tcgcbgcgct cggtcgbbcg gcbgcggcga gcggtabcag 3180 
cbcactcaaa ggcggbaaba cggbbabcca cagaatcagg ggabaacgca ggaaagaaca 3240 
bgbgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgbbg cbggcgbbbt 3300 
tccabaggcb ccgccccccb gacgagcatc acaaaaabcg acgcbcaagb cagaggbggc 3 360 
gaaacccgac aggacbabaa agabaccagg cgbbfcccccc tggaagcbcc ctcgtgcgcb 3420 
cbcctgbbcc gacccbgccg cbbaccggab accbgtccgc cbbbcbcccb bcgggaagcg 34 80 
bggcgcbbtc bcabagcbca cgcbgbaggb abcbcagbbc ggbgbaggbc gbbcgcbcca 3540 
agcbgggcbg bgbgcacgaa ccccccgbbc agcccgaccg cbgcgccbba bccggbaacb 3600 
abcgbcbbga gbccaacccg gbaagacacg acbbabcgcc acbggcagca gccacbggba 3660 
acaggabbag cagagcgagg babgbaggcg gbgcbacaga gbbcbtgaag bggbggccba 3720 
acbacggcba cacbagaaga acagbatbbg gbabcbgcgc bcbgctgaag ccagbbaccb 3780 
bcggaaaaag agbbggbagc tcbbgatccg gcaaacaaac caccgctggb agcggbggbt 3 84 0 
bbbbtgbbbg caagcagcag abbacgcgca gaaaaaaagg abcbcaagaa gatccbbbga 3900 
bcbbbbcbac ggggtcbgac gcbcagbgga acgaaaacbc acgtbaaggg abbbbggbca 3960 
bgagabbatc aaaaaggabc bbcaccbaga bccbbbtaaa bbaaaaabga agbbbbaaab 4020 
caatcbaaag babababgag baaacbbggb cbgacagbba ccaabgcbba abcagbgagg 4080 
caccbabcbc agcgabcbgb cbabbbcgbb catccabagt bgccbgacbc gggggggggg 4140 
ggcgcbgagg bctgccbcgt gaagaaggbg bbgcbgacbc abaccaggcc bgaabcgccc 4200 
cabcabccag ccagaaagbg agggagccac ggtbgabgag agctbbgbbg baggbggacc 4260 
agbbggtgab bbbgaacbbb bgcbbbgcca cggaacggbc bgcgbbgbcg ggaagabgcg 4320 
bgabcbgabc cbtcaacbca gcaaaagbbc gabtbabbca acaaagccgc cgbcccgbca 4380 
agbcagcgba abgcbcbgcc agbgbbacaa ccaabbaacc aabbcbgatb agaaaaacbc 4440 
abcgagcatc aaatgaaacb gcaabbbabb cababcagga btatcaabac cababbbbtg 4500 
aaaaagccgb bbcbgbaabg aaggagaaaa cbcaccgagg cagbbccaba ggabggcaag 4560 
abccbggbab cggbcbgcga bbccgacbcg bccaacabca atacaaccta bbaabbbccc 4620 
cbcgbcaaaa abaaggbbab caagbgagaa abcaccabga gtgacgacbg aabccggbga 4680 
gaabggcaaa agcbbabgca btbcbbbcca gacbbgbtca acaggccagc cabbacgcbc 4740 
gbcabcaaaa bcacbcgcab caaccaaacc gbtabbcabfc cgbgabbgcg ccbgagcgag 4800 
acgaaatacg cgabcgctgb baaaaggaca abtacaaaca ggaabcgaab gcaaccggcg 4860 
caggaacacb gccagcgcab caacaababb bbcaccbgaa bcaggababb cbbcbaafcac 4920 
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ctggaatgct gttttcccgg ggatcgcagt ggtgagtaac catgcatcat caggagtacg 4980 
gataaaatgc ttgatggtcg gaagaggcat aaattccgtc agccagttta gtctgaccat 5040 
ctcatctgta acatcattgg caacgctacc tttgccatgt ttcagaaaca actctggcgc 5100 
atcgggcttc ccatacaatc gatagattgt cgcacctgat tgcccgacat tatcgcgagc 51S0 
ccatttatac ccatataaat cagcatccat gttggaattt aatcgcggcc tcgagcaaga 5220 
cgtttcccgt tgaatatggc tcataacacc ccttgtatta ctgtttatgt aagcagacag 5280 
ttttattgtt catgatgata tafcttttatc ttgtgcaatg taacatcaga gattttgaga 5340 
cacaacgtgg ctttcccccc ccccccatta ttgaagcabt tatcagggtt attgtctcat 5400 
gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt 5460 
tccccgaaaa gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa 5520 
aaataggcgt atcacgaggc cctttcgtc 5549 



<210> 82 
<211> 5549 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 82 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gfcagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga ccaccatggg 1920 
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cggcaagfcgg agcaagagca gcgtgatcgg ctggcccgcc gtgcgcgagc gcatgcgccg 1980 

cgccgagccc gccgccgacg gcgtgggcgc cgtgagccgc gaccbggaga agcacggcgc 2040 

cabcaccagc agcaacaccg ccgccaacaa cgccgcctgc gcctggctgg aggcccagga 2100 

9939939939 gtgggctbcc ccgtgacccc ccaggtgccc ctgcgcccca tgacctacaa 2160 

ggccgccgbg gacctgagcc acbbccbgaa ggagaagggc ggcctggagg gcctgatcca 2220 

cagccagcgc cgccaggaca tcctggacct gtggatctac cacacccagg gctacttccc 2280 

cgactggcag aactacaccc ccggccccgg cgtgcgctac ccccbgaccb tcggcbggbg 2340 

cbacaagcbg gtgcccgtgg agcccgacaa ggtggaggag gccaacaagg gcgagaacac 2400 

cagcctgctg caccccgtga gcctgcacgg catggcagca cccgagcgcg aggtgcbgga 2460 

gtggcgcttc gacagccgcc tggccttcca ccacgtggcc cgcgagctgc accccgagta 2520 

cttcaagaac bgcbgaggat ccagatctgc bgbgccbbcb agttgccagc cabcbgbbgb 2580 

ttgcccctcc cccgtgcctt ccbbgacccb ggaaggtgcc acbcccacbg tcctttccta 2640 

ataaaatgag gaaabtgcat cgcabbgbcb gagbaggbgb cabbctatbc tggggggtgg 2700 

ggtggggcag gacagcaagg gggaggattg ggaagacaab agcaggcabg cbggggabgc 2760 

ggtgggctcb abgggbaccc aggbgctgaa gaabtgaccc ggttcctcct gggccagaaa 2820 

gaagcaggca cabccccbbc tctgtgacac accctgtcca cgccccbggb tcttagbtcc 2880 

agccccactc ataggacacb cabagctcag gagggcbccg ccbtcaatcc cacccgcbaa 2940 

agtacttgga gcggbcbcbc ccbccctcat cagcccacca aaccaaacct agcctccaag 3000 

agtgggaaga aatbaaagca agataggcta ttaagbgcag agggagagaa aatgcctcca 3060 

acatgtgagg aagtaatgag agaaatcaba gaabtttaag gccatcatgg ccbbaabcbb 3120 

ccgcttcctc gctcactgac tcgctgcgct cggbcgbbcg gctgcggcga gcggtatcag 3180 

cbcacbcaaa ggcggtaaba cggttatcca cagaatcagg ggataacgca ggaaagaaca 3240 

bgbgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 3300 

tccataggct ccgccccccb gacgagcatc acaaaaatcg acgctcaagt cagaggbggc 33 60 

gaaacccgac aggacbataa agabaccagg cgtbtccccc bggaagcbcc ctcgtgcgct 3420 

ctcctgttcc gacccbgccg cbbaccggab accbgtccgc cbbbcbccct tcgggaagcg 3480 

tggcgctttc bcatagcbca cgctgtaggt atcbcagttc ggtgtaggtc gttcgctcca 3540 

agctgggctg bgtgcacgaa ccccccgbbc agcccgaccg ctgcgccbta tccggtaact 3600 

atcgtcttga gtccaacccg gtaagacacg actbatcgcc actggcagca gccactggta 3660 

acaggabbag cagagcgagg tabgbaggcg gbgctacaga gttcttgaag tggtggccba 3720 

actacggcta cactagaaga acagtatthg gbabctgcgc bctgctgaag ccagttaccb 3780 

tcggaaaaag agbbggtagc tcbbgabccg gcaaacaaac caccgctggt agcggtggtt 384 0 

bbbbbgbbbg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctbtga 3900 

tcttbbcbac ggggtctgac gctcagtgga acgaaaactc acgbbaaggg abbbbggbca 3960 

bgagabbabc aaaaaggatc ttcacctaga tccttbtaaa tbaaaaabga agttttaaat 4020 

caabctaaag tatatatgag taaacttggt ctgacagbba ccaatgctta atcagtgagg 4080 

caccbabcbc agcgabcbgfc ctatbfccgtt catccabagt bgcctgactc gggggggggg 4140 

ggcgctgagg bctgccbcgt gaagaaggtg tbgctgactc ataccaggcc bgaabcgccc 4200 

cabcabccag ccagaaagbg agggagccac ggtbgabgag agctbtgbbg baggbggacc 4260 

agbbggbgab bbbgaacbbb bgcbbbgcca cggaacggtc bgcgbbgbcg ggaagabgcg 4320 

bgabcbgabc cbbcaacbca gcaaaagbbc gabtbatbca acaaagccgc cgbcccgbca 43 80 

agbcagcgba abgcbcbgcc agbgbtacaa ccaabbaacc aabbcbgabt agaaaaactc 4440 

abcgagcatc aaabgaaacb gcaabbbatb cababcagga bbabcaatac cababtbbtg 4500 

aaaaagccgb btcbgbaabg aaggagaaaa cbcaccgagg cagbbccaba ggatggcaag 4560 

abccbggtab cggbcbgcga bbccgactcg tccaacabca abacaaccba bbaabbtccc 4620 

cbcgbcaaaa abaaggbbab caagbgagaa abcaccabga gbgacgacbg aabccggbga 4680 

gaabggcaaa agctbabgca bbtctbbcca gacbbgbbca acaggccagc cabtacgcbc 4740 

gbcabcaaaa bcacbcgcab caaccaaacc gbbabbcatb cgbgabbgcg ccbgagcgag 4800 

acgaaabacg cgatcgcbgb baaaaggaca abbacaaaca ggaabcgaab gcaaccggcg 4 860 

caggaacacb gccagcgcab caacaababb bbcaccbgaa bcaggababb cbbcbaabac 4920 

cbggaabgcb gbbtbcccgg ggabcgcagb ggbgagbaac cabgcabcab caggagbacg 4980 

gabaaaabgc bbgabggbcg gaagaggcab aaabbccgbc agccagbtta gbctgaccab 5040 

cbcabcbgba acabcabbgg caacgcbacc bbbgccabgb btcagaaaca acbcbggcgc 5100 

abcgggcbbc ccatacaabc gatagabbgb cgcaccbgab bgcccgacab babcgcgagc 5160 

ccabbbabac ccatabaaab cagcabccat gbbggaabbb aatcgcggcc bcgagcaaga 5220 
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cgbbbcccgb tgaababggc tcafcaacacc ccbbgbabba ctgtttatgt aagcagacag 5280 
tbbbabbgbb cabgabgaba babbbbbabc bbgbgcaabg baacabcaga gabbbbgaga 5340 
cacaacgbgg ctttcccccc ccccccabba bbgaagcabb babcagggbb abtgbcbcab 5400 
gagcggabac ababbbgaab gbabtbagaa aaataaacaa abaggggbbc cgcgcacatb 5460 
bccccgaaaa gbgccaccbg acgbcbaaga aaccabbabb abcabgacab baaccbabaa 5520 
aaabaggcgb abcacgaggc ccbbtcgtc 5549 



<210> 83 
<211> 5549 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 83 

bcgcgcgbtb cggbgabgac ggtgaaaacc bcbgacacat gcagcbcccg gagacggbca 60 
cagcttgbcb gbaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggbg 120 
bbggcgggtg bcggggctgg cbtaactabg cggcatcaga gcagabbgba ctgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaataccgc abcagabbgg 240 
cbabbggcca bbgcabacgb bgtabccaba bcabaababg tacatbbaba bbggcbcabg 300 
bccaacabta ccgccatgbb gacabbgatb atbgacbagb tabbaabagb aabcaatbac 360 
ggggbcabta gbbcatagcc cabababgga gbbccgcgtt acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccattgacg bcaabaabga cgbabgtbcc 480 
cabagbaacg ccaataggga ctbtccabbg acgbcaabgg gbggagbabb bacggbaaac 54 0 
bgcccacbbg gcagbacabc aagbgbabca batgccaagt acgcccccta bbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbabggg actbbccbac 660 
bbggcagbac atcbacgtab bagbcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg cgtggabagc ggbbtgactc acggggattb ccaagbctcc accccabbga 780 
cgbcaabggg agtbbgtbbt ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
ctccgcccca bbgacgcaaa tgggcggbag gcgbgbacgg tgggaggbcb atataagcag 900 
agcbcgtbba gbgaaccgbc agatcgcctg gagacgccat ccacgcbgbb btgaccbcca 960 
bagaagacac cgggaccgat ccagccbccg cggccgggaa cggtgcabbg gaacgcggab 1020 
bccccgtgcc aagagbgacg baagtaccgc cbabagacbc babaggcaca ccccbbbggc 1080 
bcbbabgcab gctabacbgt bbbbggcbbg gggccbabac acccccgcbb cctbatgcta 1140 
baggbgabgg tatagcbbag ccbabaggbg bgggbbabbg accabbabbg accacbcccc 1200 
babbggbgac gabacbbbcc abbacbaabc cabaacatgg cbcbbbgcca caacbabcbc 1260 
babbggcbab abgccaabac bcbgbccbtc agagacbgac acggacbctg tatbbbbaca 1320 
ggabggggbc ccatttatba btbacaaabb cacabataca acaacgccgb cccccgbgcc 1380 
cgcagbbbtb abbaaacaba gcgtgggabc tccacgcgaa bcbcgggbac gbgbbccgga 144 0 ' 
cabgggcbcb bcbccggtag cggcggagcb tccacabccg agcccbggtc ccatgcctcc 1500 
agcggctcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg. bggcggbagg gtabgbgbcb 1620 
gaaaabgagc gbggagatbg ggcbcgcacg gcbgacgcag atggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgbb gtabbcbgat aagagtcaga ggbaacbccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagb gbagbcbgag cagbactcgb bgcbgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagacbgb bccbbbccab gggbcbbbbc 1860 
bgcagbcacc gbcgbcgaca cgbgbgabca gababcgcgg ccgcbctaga ccaccabggg 1920 
cggcaagbgg agcaagagca gcgbgabcgg cbggcccgcc gtgcgcgagc gcabgcgccg 1980 
cgccgagccc gccgccgacg gcgbgggcgc cgbgagccgc gaccbggaga agcacggcgc 2040 
cabcaccagc agcaacaccg ccgccaacaa cgccgccbgc gccbggcbgg aggcccagga 2100 
ggaggaggag gbgggcbbcg cagbgaccgc acaggbggca cbgcgcgcaa tgaccbacaa 2160 
ggccgccgbg gacctgagcc acbtcctgaa ggagaagggc ggccbggagg gccbgabcca 2220 
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cagccagcgc cgccaggaca tcctggacct gtggatctac cacacccagg gctacttccc 2280 
cgactggcag aactacaccc ccggccccgg cgtgcgctac cccctgacct tcggctggtg 2340 
ctacaagctg gtgcccgtgg agcccgacaa ggbggaggag gccaacaagg gcgagaacac 24 00 
cagcctgctg caccccgtga gcctgcacgg catggacgac cccgagcgcg aggtgctgga 24 60 
gbggcgcbbc gacagccgcc fcggccttcca ccacgtggcc cgcgagctgc accccgagba 2520 
cttcaagaac tgctgaggat ccagabcbgc bgtgccttct agttgccagc cabcbgtbgb 2580 
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg bccbtbccba 2640 
ataaaatgag gaaattgcat cgcabtgtcb gagbaggbgb cabbctabbc bggggggbgg 2700 
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggabgc 2760 
ggtggg ctct atgggtaccc aggtgctgaa gaattgaccc ggttcctcct gggccagaaa 2820 
gaagcaggca catccccttc tctgtgacac accctgtcca cgcccctggt tcttagttcc 2880 
agccccacbc ataggacact catagctcag gagggctccg ccttcaatcc cacccgctaa 2940 
agbacbbgga gcggtctctc ccbcccbcat cagcccacca aaccaaacct agcctccaag 3000 
agtgggaaga aattaaagca agataggcta ttaagtgcag agggagagaa aatgcctcca 3060 
acatgtgagg aagbaatgag agaaatcata gaattttaag gccatcatgg ccttaatctt 3120 
ccgcttcctc gctcactgac tcgcbgcgcb cggtcgbbcg gcbgcggcga gcggtatcag 3180 
ctcactcaaa ggcggtaata cggttabcca cagaatcagg ggabaacgca ggaaagaaca 3240 
tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgbbg cbggcgbbbt 3300 
bccabaggcb ccgcccccct gacgagcabc acaaaaatcg acgctcaagt cagaggtggc 3360 
gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct 3420 
ctcctgttcc gacccfegccg cttaccggat acctghccgc ctttctccct hcgggaagcg 34 80 
bggcgctbbc bcatagcbca cgctgtaggt abctcagbbc ggtgtaggtc gttcgctcca 3540 
agcbgggcbg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact 3600 
abcgbcbbga gbccaacccg gtaagacacg acttatcgcc actggcagca gccactggta 3660 
acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta 3 720 
actacggcta cactagaaga acagbabbtg gbabcbgcgc tctgctgaag ccagtbaccb 3780 
tcggaaaaag agbtggbagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt 3 840 
tttttgtttg caagcagcag abbacgcgca gaaaaaaagg atctcaagaa gabccbbbga 3 900 
bcbbbbcbac ggggbcbgac gctcagtgga acgaaaactc acgttaaggg attttggtca 3960 
tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat 4020 
caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg 4 080 
caccfcatctc agcgatctgt ctatttcgtt catccatagh tgcctgactc gggggggggg 4140 
ggcgctgagg tctgcctcgt gaagaaggtg ttgctgactc ataccaggcc tgaatcgccc 4200 
catcatccag ccagaaagtg agggagccac ggttgatgag agctttgttg taggtggacc 4260 
agttggtgat tttgaacttt tgctttgcca cggaacggtc tgcgfctgtcg ggaagatgcg 4320 
tgatctgatc cttcaactca gcaaaagttc gatttattca acaaagccgc cgtcccgtca 4380 
agtcagcgta atgctctgcc agtgttacaa ccaattaacc aattctgatt agaaaaactc 4440 
atcgagcatc aaabgaaact gcaatttatt catatcagga ttatcaatac catatttttg 4500 
aaaaagccgt htctgtaatg aaggagaaaa ctcaccgagg cagttccata ggatggcaag 4560 
atcctggtat cggtctgcga ttccgactcg tccaacatca atacaaccta ttaatttccc 4620 
ctcgtcaaaa ataaggttat caagtgagaa atcaccatga gtgacgactg aatccggtga 4680 
gaatggcaaa agcttatgca tttctttcca gacttgttca acaggccagc catfcacgctc 4740 
gtcatcaaaa tcactcgcat caaccaaacc gttattcatt cgtgattgcg ccbgagcgag 4800 
acgaaatacg cgatcgctgt taaaaggaca atbacaaaca ggaabcgaat gcaaccggcg 4860 
caggaacact gccagcgcat caacaababb tbcaccbgaa bcaggabatb cbtcbaatac 4920 
ctggaabgcb gbbbtcccgg ggatcgcagb ggbgagbaac cabgcabcat caggagbacg 4 980 
gabaaaabgc bbgabggbcg gaagaggcab aaabtccgbc agccagbbta gbcbgaccab 5040 
cbcabcbgta acabcatbgg caacgcbacc bbbgccabgt bbcagaaaca acfccbggcgc 5100 
atcgggcbbc ccabacaatc gabagabtgb cgcacctgat bgcccgacab babcgcgagc 5160 
ccattbabac ccabataaat cagcatccat gbbggaatbt aatcgcggcc bcgagcaaga 5220 
cgtbbcccgb bgaababggc bcabaacacc ccbbgbatba ctgbbbabgt aagcagacag 5280 
btbbatbgbb cabgatgaba tabbttbabc bbgbgcaabg baacabcaga gabbbbgaga 5340 
cacaacgbgg cbbbcccccc ccccccabba bbgaagcabt babcagggtb abbgbcbcab 5400 
gagcggabac ababbtgaab gtabtbagaa aaabaaacaa ataggggbtc cgcgcacabb 5460 
bccccgaaaa gbgccaccbg acgbcbaaga aaccabbabb atcabgacab baaccbabaa 5520 
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<210> 84 
<211> 6609 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<40O> 84 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
cccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
999gtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttatcg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt gaaggagaag taccagcacc tgtggcgctg gggctggcgc tggggcacca 1980 
tgctgctggg catgctgatg atctgcagcg ccaccgagaa gctgtgggtg accgtgtact 2040 
acggcgtgcc cgtgtggcgc gaggccacca ccaccctgtt ctgcgccagc gacgccaagg 2100 
cctacgacac cgaggtgcac aacgtgtggg ccacccacgc ctgcgtgccc accgacccca 2160 
acccccagga ggtggtgctg ggcaacgtga ccgagaactt caacatgtgg aagaacaaca 2220 
tggtggacca gatgcacgag gacatcatca gcctgtggga cgagagcctg aagccctgcg 2280 
tgaagctgac ccccctgtgc gtgaccctga actgcaccaa cctgaacatc accaagaaca 2340 
ccaccaacct gaccagcagc agctggggca tgatggagga gggcgagatc aagaactgca 2400 
gcttctacat caccaccagc atccgcaaca aggtgaagaa ggagtacgcc ctgttcaacc 2460 
gcctggacgt ggtgcccgtg aagaacacca gcaacaccaa gtaccgcctg atcagctgca 2520 
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acaccagcgt gatcacccag gccbgcccca aggbgagcbb ccagcccatc cccatccact 2580 
acbgcgbgcc cgccggcttc gccabccbga agtgcaacaa caagaccbbc aacggcagcg 2640 
gcccctgcac caacgbgagc accgtgcagt gcacccacgg catccgcccc gtggtgagca 2700 
cccagctgct gctgaacggc agccbggccg aggaggacab cgbgabccgc agcgaggact 2760 
tcaccgacaa cgbgaagacc atcatcgtgc agctgaacga gagcgtggtg atcaactgca 2820 
cccgccccaa caacaacacc cgcgagcgcc bgagcatcgg ccccggccgc gccttctacg 2880 
cccgccgcaa catcatcggc gacatccgcc aggcccactg caacatcagc cgcgccaagt 294 0 
ggaacaacac cctgcagcag atcgtgatca agctgcgcga gaagttccgc aacaagacca 3 000 
bcgccbbcaa ccagagcagc ggcggcgacc ccgagabcgt gabgcacagc ttcaactgcg 3060 
gcggcgagtt cttctactgc aacaccgccc agctgttcaa cagcaccfcgg aacgtggccg 3120 
gcggcaccaa cggcaccgag ggcaacgaca bcabcacccb gcagtgccgc abcaagcaga 3180 
tcatcaacat gbggcagaag gtgggcaagg ccabgbacgc cccccccatc accggccaga 3240 
tccgctgcag cagcaacatc accggcctgc bgcbgacccg cgacggcggc aacagcaccg 33 00 
agaccgagac cgagatcttc cgccccggcg gcggcgacat gcgcgacaac tggcgcagcg 3360 
agctgtacaa gtacaaggtg gtgcgcatcg agcccatcgg cgtggccccc acccgcgcca 34 2 0 
agcttaccgt gcaggcccgc ctgctgctga gcggcatcgt gcagcagcag aacaacctgc 3480 
bgcgcgccat cgaggcccag cagaacatgc tgcgcctgac cgtgtggggc atcaagcagc 354 0 
bgcaggcccg cgtgctggcc ctggagcgct acctgcgcga ccagcagctc gagbgaggab 3600 
ccagatctgc tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3660 . 
ccbbgacccb ggaaggtgcc actcccactg tcctttccta ataaaabgag gaaattgcat 3720 
cgcattgtct gagtaggtgt cattcbabbc bggggggbgg ggtggggcag gacagcaagg 3780 
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atgggtaccc 3840 
aggtgctgaa gaattgaccc ggttcctcct gggccagaaa gaagcaggca catccccttc 3900 
bcbgtgacac acccbgbcca cgccccbggb tcttagttcc agccccactc ataggacact 3960 
catagctcag gagggcfcccg ccttcaatcc cacccgctaa agtacttgga gcggtctctc 4020 
cctccctcat cagcccacca aaccaaaccb agcctccaag agtgggaaga aattaaagca 4080 
agataggcta ttaagtgcag agggagagaa aatgcctcca acabgbgagg aagtaabgag 414 0 
agaaatcata gaattttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac 4200 
tcgctgcgct cggbcgbbcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 4260 
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 4320 
aaggccagga accgtaaaaa ggccgcgtbg cbggcgbbbb hccataggct ccgcccccct 43 80 
gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggacbataa 4440 
agabaccagg cgbbbccccc bggaagcbcc cbcgtgcgcb cbccbgttcc gacccbgccg 4500 
cbbaccggab accbgbccgc cttbcbccct bcgggaagcg bggcgcbttc tcabagcbca 4560 
cgcbgtaggb abctcagbbc ggbgtaggbc gbbcgctcca agcbgggcbg tgbgcacgaa 4620 
ccccccgbtc agcccgaccg ctgcgccbba bccggbaacb atcgbcbtga gbccaacccg 4680 
gbaagacacg acbbatcgcc actggcagca gccactggta acaggabfcag cagagcgagg 4740 
babgbaggcg gbgcbacaga gbbctbgaag bggbggccba acbacggcta cacbagaaga 480 0 
acagbabttg gbabctgcgc bcbgcbgaag ccagbbacct bcggaaaaag agbtggbagc 4860 
tcbtgabccg gcaaacaaac caccgcbggb agcggbggbb bbbbbgbbbg caagcagcag 4920 
abbacgcgca gaaaaaaagg abcbcaagaa gabccbbbga tcbbbtctac ggggbcbgac 4980 
gcbcagbgga acgaaaacbc acgbbaaggg atbbbggtca bgagatbabc aaaaaggabc 5040 
bbcaccbaga tccbttbaaa tbaaaaabga agbbbbaaat caatcbaaag babababgag 5100 
baaacbbggb cbgacagbba ccaabgcbba abcagbgagg caccbabcbc agcgabcbgb 5160 
cbabbbcgbb catccabagb bgccbgacbc gggggggggg ggcgctgagg bcbgccbcgt 5220 
gaagaaggbg tbgcbgacbc abaccaggcc bgaabcgccc cabcatccag ccagaaagtg 5280 
agggagccac ggbbgabgag agcbbbgtbg baggbggacc agbbggbgat bbbgaacbtb 5340 
bgcbbbgcca cggaacggbc bgcgbbgtcg ggaagabgcg bgabcbgabc cbbcaactca 5400 
gcaaaagbbc gabbbabbca acaaagccgc cgbcccgtca agbcagcgba atgcbcbgcc 5460 
agbgbbacaa ccaabbaacc aabbctgabb agaaaaactc atcgagcabc aaabgaaacb 5520 
gcaabbtabb cababcagga bbabcaabac cabatbbbbg aaaaagccgb btcbgbaabg 5580 
aaggagaaaa cbcaccgagg cagbbccaba ggabggcaag abccbggbab cggbcbgcga 5640 
bbccgacbcg bccaacabca abacaaccba bbaabbbccc cbcgbcaaaa abaaggbbab 5700 
caagbgagaa abcaccabga gbgacgacbg aabccggtga gaabggcaaa agcbbabgca 5760 
bbbcbbbcca gactbgbbca acaggccagc cabbacgcbc gbcabcaaaa bcacbcgcab 5820 
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caaccaaacc gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt 5880 
taaaaggaca attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat 5940 
caacaatatt ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg 6000 
ggatcgcagt ggtgagtaac catgcatcat caggagtacg gataaaabgc ttgatggtcg 6060 
gaagaggcat aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg 6120 
caacgctacc tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc 6180 
gatagattgt cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat 6240 
cagcatccat gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc 6300 
tcataacacc ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgaba 6360 
tatttttatc ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc 6420 
ccccccatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 6480 
gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 6540 
acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc 6600 
cctttcgtc 6609 



<210> 85 
<211> 6783 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 85 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagfcgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag .aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgafc ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac fcctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatfctatta tttacaaatt cacatataca acaacgccgb cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggcbct tctccggtag cggcggagct tccacatccg agccctggtc ccatgccbcc 1500 
agcggctcat ggbcgctcgg cagctccbbg cfccctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gbggagabbg ggctcgcacg gctgacgcag atggaagact baaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gbattctgat aagagtcaga ggtaactccc 1740 
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gbbgcggbgc bgbbaacggb ggagggcagt 
cgcgccacca gacabaabag ctgacagact 
tgcagtcacc gtcgtcgaca cgtgtgatca 
ccabgcgcgt gaaggagaag taccagcacc 
tgctgctggg cabgctgatg atctgcagcg 
acggcgtgcc cgtgtggcgc gaggccacca 
cctacgacac cgaggtgcac aacgtgtggg 
acccccagga ggtggtgctg ggcaacgtga 
tggtggacca gatgcacgag gacatcatca 
tgaagctgac ccccctgtgc gtgaccctga 
ccaccaacct gaccagcagc agctggggca 
gcbtcbacab caccaccagc atccgcaaca 
gcctggacgt ggtgcccgtg aagaacacca 
acaccagcgt gatcacccag gcctgcccca 
actgcgtgcc cgccggcttc gccatcctga 
gcccctgcac caacgtgagc accgbgcagb 
cccagctgct gctgaacggc agcctggccg 
tcaccgacaa cgtgaagacc abcatcgtgc 
cccgccccaa caacaacacc cgcgagcgcc 
cccgccgcaa catcatcggc gacatccgcc 
ggaacaacac cctgcagcag atcgtgatca 
tcgcctfccaa ccagagcagc ggcggcgacc 
gcggcgagtt cttctactgc aacaccgccc 
gcggcaccaa cggcaccgag ggcaacgaca 
bcabcaacab gbggcagaag gtgggcaagg 
tccgctgcag cagcaacatc accggcctgc 
agaccgagac cgagabcbbc cgccccggcg 
agctgtacaa gbacaaggtg gtgcgcatcg 
agcbbaccgt gcaggcccgc ctgctgcbga 
tgcgcgccat cgaggcccag cagaacatgc 
tgcaggcccg cgbgcbggcc ctggagcgct 
ggaacaacab gacctggatg gagtgggagc 
acgacctgct ggagaagagc cagacccagc 
tggacaagtg ggccagccfcg bggaacbggb 
ctgctgtgcc ttctagttgc cagccatctg 
ccctggaagg tgccactccc actgtccttt 
gbcbgagbag gtgtcattct attctggggg 
abbgggaaga caatagcagg catgctgggg 
tgaagaattg acccggttcc tcctgggcca 
acacaccctg tccacgcccc tggttcttag 
tcaggagggc tccgccttca atcccacccg 
tcatcagccc accaaaccaa acctagcctc 
gctattaagt gcagagggag agaaaatgcc 
catagaattt taaggccatc atggccbbaa 
cgctcggtcg bbcggcbgcg gcgagcggba 
bccacagaat caggggafcaa cgcaggaaag 
aggaaccgta aaaaggccgc gtbgctggcg 
cabcacaaaa abcgacgcbc aagbcagagg 
caggcgbbtc cccctggaag cbccctcgbg 
ggabacctgt ccgccbbbcb cccbbcggga 
aggbatctca gttcggtgba ggtcgbbcgc 
gtbcagcccg accgcbgcgc cbbatccggb 
cacgactbab cgccacbggc agcagccacb 
ggcggbgcba cagagbbcbt gaagbggbgg 
ttbggbabcb gcgcfccbgct gaagccagbb 



gbagbcbgag cagtacbcgt tgctgccgcg 1800 
aacagacbgb bcctttccat gggbcbbbbc 1860 
gababcgcgg ccgctcbaga gabatcgcca 1920 
bgbggcgcbg gggcbggcgc bggggcacca 1980 
ccaccgagaa gcbgbgggbg accgbgbacb 2040 
ccacccbgtt cbgcgccagc gacgccaagg 2100 
ccacccacgc cbgcgbgccc accgacccca 2160 
ccgagaacbb caacatgbgg aagaacaaca 2220 
gccbgbggga cgagagccbg aagcccbgcg 2280 
acbgcaccaa ccbgaacabc accaagaaca 2340 
bgabggagga gggcgagabc aagaacbgca 2400 
aggtgaagaa ggagtacgcc cbgbbcaacc 2460 
gcaacaccaa gtaccgcctg abcagcbgca 2520 
aggtgagcbb ccagcccatc cccabccacb 2580 
agbgcaacaa caagaccbtc aacggcagcg 2640 
gcacccacgg cabccgcccc gbggtgagca 2700 
aggaggacab cgbgabccgc agcgaggacb 2760 
agcbgaacga gagcgbggbg abcaacbgca 2 820 
bgagcatcgg ccccggccgc gccbbcbacg 2880 
aggcccactg caacabcagc cgcgccaagb 294 0 
agctgcgcga gaagbbccgc aacaagacca 3000 
ccgagabcgb gabgcacagc btcaacbgcg 3060 
agcbgbbcaa cagcaccbgg aacgbggccg 3120 
bcabcaccct gcagbgccgc abcaagcaga 3180 
ccabgbacgc cccccccabc accggccaga 3240 
bgcbgacccg cgacggcggc aacagcaccg 3300 
gcggcgacab gcgcgacaac tggcgcagcg 3360 
agcccabcgg cgbggccccc acccgcgcca 34 20 
gcggcabcgb gcagcagcag aacaaccbgc 34 80 
bgcgcctgac cgbgbggggc atcaagcagc 3540 
accbgcgcga ccagcagcbc gaggacabcb 3600 
gcgagabcga caacbacacc gacbacabcb 3660 
aggagaagaa cgagaaggag cbgcbggagc 3720 
bcgacatcac caacbggbga ggabccagab 3780 
bbgbbbgccc cbcccccgbg ccbbccbbga 3840 
ccbaabaaaa bgaggaaabt gcabcgcabb 3900 
9bggggtggg gcaggacagc aagggggagg 3960 
abgcggtggg cbcbabgggb acccaggbgc 4020 
gaaagaagca ggcacabccc cbbcbcbgbg 4080 
bbccagcccc actcabagga cactcabagc 4140 
cbaaagtact bggagcggbc bcbcccbccc 4200 
caagagbggg aagaaabbaa agcaagabag 4260 
bccaacabgt gaggaagbaa bgagagaaab 4320 
bcbbccgcbt ccbcgcbcac bgacbcgcbg 4380 
bcagcbcacb caaaggcggb aabacggbba 4440 
aacatgbgag caaaaggcca gcaaaaggcc 4500 
bbtbtccaba ggcbccgccc ccctgacgag 4560 
bggcgaaacc cgacaggacb abaaagabac 4620 
cgcbcbccbg tbccgacccb gccgcbbacc 4680 
agcgbggcgc tbbcbcatag cbcacgcbgb 4740 
bccaagcbgg gcbgbgbgca cgaacccccc 4800 
aacbabcgbc bbgagbccaa cccggbaaga 4860 
ggbaacagga bbagcagagc gaggbabgba 4920 
ccbaacbacg gcbacactag aagaacagba 4980 
accbtcggaa aaagagbtgg bagcbcbbga 504 0 
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tccggcaaac aaaccaccgc bggbagcggb ggtttttttg tttgcaagca gcagattacg 5100 

cgcagaaaaa aaggatctca agaagatcct bbgabcbbbb cbacggggbc bgacgcbcag 5160 

tggaacgaaa acbcacgtba agggabtttg gtcabgagab tatcaaaaag gatcttcacc 5220 

tagatccttt taaattaaaa abgaagbbbb aaatcaatct aaagtatata tgagtaaact 5280 

tggtctgaca gbbaccaabg cbbaabcagb gaggcaccta tctcagcgat cbgbcbabbb 5340 

cgbbcatcca bagbbgccbg acbcgggggg ggggggcgcb gaggbcbgcc bcgbgaagaa 5400 

ggtgbbgcbg acbcabacca ggcctgaabc gccccabcab ccagccagaa agbgagggag 5460 

ccacggbtga bgagagctbb gbbgbaggtg gaccagttgg tgabttbgaa cbbtbgcbtt 5520 

gccacggaac ggbcbgcgbb gbcgggaaga bgcgbgatcb gabcctbcaa cbcagcaaaa 5580 

gbbcgabbba bbcaacaaag ccgccgbccc gbcaagtcag cgbaabgcbc bgccagbgbb 5640 

acaaccaabb aaccaabbcb gabtagaaaa acbcabcgag cabcaaabga aacbgcaabb 5700 

babtcababc aggabbabca abaccababb bbbgaaaaag ccgbbbcbgb aabgaaggag 5760 

aaaactcacc gaggcagbbc cabaggabgg caagabccbg gbabcggbcb gcgabtccga 5820 

cbcgbccaac abcaabacaa ccbabtaatb bccccbcgbc aaaaataagg bbatcaagbg 5880 

agaaatcacc abgagbgacg actgaabccg gtgagaatgg caaaagcbba bgcabtbctb 5940 

bccagacbbg bbcaacaggc cagccabbac gcbcgbcabc aaaatcacbc gcabcaacca 6000 

aaccgbbabb cabbcgbgab bgcgccbgag cgagacgaaa bacgcgabcg cbgbbaaaag 6060 

gacaattaca aacaggaabc gaabgcaacc ggcgcaggaa cacbgccagc gcabcaacaa 6120 

babtbtcacc bgaabcagga babtcbbcta ataccbggaa tgcbgttbbc ccggggabcg 6180 

cagbggtgag taaccabgca bcatcaggag bacggataaa atgcbtgabg gtcggaagag 6240 

gcabaaabbc cgbcagccag bbtagbctga ccabcbcabc bgbaacabca btggcaacgc 63 00 

baccbbbgcc abgbbbcaga. aacaactcbg gcgcatcggg cbbcccabac aabcgabaga 6360 

bbgbcgcacc bgabbgcccg acabbabcgc gagcccabbb abacccabab aaabcagcab 6420 

ccabgbbgga abbbaabcgc ggccbcgagc aagacgbbbc ccgbbgaaba bggcbcabaa 6480 

caccccbtgb abbacbgbbb abgbaagcag acagbbbbab bgbbcabgab gabababbbt 6540 

tabcbtgbgc aabgbaacat cagagabbtb gagacacaac gbggctbbcc cccccccccc 6600 

abbabtgaag cabbbabcag ggbbabbgbc bcabgagcgg abacabatbb gaabgtabbb 6660 

agaaaaabaa acaaabaggg gbbccgcgca catbbccccg aaaagtgcca ccbgacgbct 672 0 

aagaaaccab babbabcabg acabbaaccb abaaaaabag gcgbabcacg aggcccbbtc 6780 
gbc 6783 



<210> 86 
<211> 6861 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 86 

bcgcgcgbbb cggbgabgac ggbgaaaacc bctgacacab gcagcbcccg gagacggbca 60 

cagcbtgbcb gtaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggtg 12 0 

tbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 

accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 

cbabbggcca ttgcabacgb bgbabccaba bcabaatabg bacabbbaba bbggcbcabg 300 

bccaacabba ccgccabgbb gacabbgabb atbgacbagb babbaabagb aabcaabbac 360 

ggggbcabba gbbcabagcc cabatatgga gtbccgcgbb acataacbba cggtaaabgg 420 

cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 

cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggtaaac 54 0 

bgcccacbbg gcagbacabc aagtgbabca batgccaagb acgcccccba tbgacgbcaa 600 

bgacggbaaa bggcccgccb ggcatbabgc ccagbacabg accbbabggg acbttccbac 660 

bbggcagbac abcbacgtat bagbcabcgc babbaccabg gbgatgcggb tbbggcagba 720 

cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
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cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggbag gcgtgtacgg tgggaggtct ababaagcag 900 
agcbcgbbba gtgaaccgbc agatcgcctg gagacgccab ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagccbccg cggccgggaa cggbgcabbg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctabagactc tabaggcaca ccccbbbggc 1080 
tcttatgcat gctatactgt tbbbggcbtg gggcctatac acccccgctt ccbtabgcta 1140 
baggtgabgg babagcbtag ccbabaggtg bgggbtabbg accattattg accactcccc 1200 
tattggtgac gatacbbtcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg babbtbbaca 1320 
ggatggggtc ccatttatta tbbacaaatb cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt abbaaacaba gcgtgggatc tccacgcgaa bcbcgggbac gtgbbccgga 1440 
catgggctct tctccggtag cggcggagcb tccacatccg agcecbggbc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagcbccbtg ctcctaacag tggaggccag acbtaggcac 1560 
agcacaatgc ccaccaccac cagbgtgccg cacaaggccg bggcggbagg gbatgbgbcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagact baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgtb gbabbctgab aagagbcaga ggbaactccc 174 0 
gbbgcggbgc bgbbaacggb ggagggcagb gtagtctgag cagbacbcgb bgctgccgcg 1800 
cgcgccacca gacataatag cbgacagact aacagactgb tccbttccab gggbctbbtc i860 
bgcagbcacc gbcgtcgaca cgtgbgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccabgcgcgt gaaggagaag baccagcacc bgbggcgcbg gggcbggcgc bggggcacca 1980 
bgcbgcbggg cabgcbgabg atcbgcagcg ccaccgagaa gctgtgggbg accgfcgtacb 2 04 0 
acggcgtgcc cgbgbggcgc gaggccacca ccaccctgtt cbgcgccagc gacgccaagg 2100 
ccbacgacac cgaggtgcac aacgbgtggg ccacccacgc cbgcgbgccc accgacccca 2160 
acccccagga ggbggbgcbg ggcaacgtga ccgagaacbb caacabgtgg aagaacaaca 22 2 0 
bggbggacca gabgcacgag gacabcabca gcctgbggga cgagagccbg aagcccbgcg 22 80 
bgaagcbgac cccccbgtgc gbgaccctga acbgcaccaa ccbgaacabc accaagaaca 234 0 
ccaccaaccb gaccagcagc agcbggggca tgabggagga gggcgagabc aagaacbgca 24 00 
gcbtcbacab caccaccagc abccgcaaca aggbgaagaa ggagbacgcc ctgtbcaacc 2460 
gccbggacgb ggbgcccgbg aagaacacca gcaacaccaa gtaccgcctg atcagctgca 2520 
acaccagcgb gabcacccag gccbgcccca aggtgagcbt ccagcccabc cccabccacb 2580 
acbgcgtgcc cgccggcttc gccabccbga agtgcaacaa caagacctbc aacggcagcg 2640 
gccccbgcac caacgtgagc accgtgcagt gcacccacgg cabccgcccc gbggbgagca 2700 
cccagcbgcb gcbgaacggc agcctggccg aggaggacab cgbgabccgc agcgaggacb 2760 
bcaccgacaa cgbgaagacc abcabcgbgc agctgaacga gagcgbggbg abcaacbgca 2820 
cccgccccaa caacaacacc cgcgagcgcc bgagcabcgg ccccggccgc gccbtcbacg 2880 
cccgccgcaa catcatcggc gacabccgcc aggcccacbg caacabcagc cgcgccaagb 2940 
ggaacaacac cctgcagcag atcgtgatca agcbgcgcga gaagbbccgc aacaagacca 3000 
bcgccbtcaa ccagagcagc ggcggcgacc ccgagatcgb gabgcacagc ttcaacbgcg 3060 
gcggcgagbb cbtcbacbgc aacaccgccc agcbgbbcaa cagcaccbgg aacgbggccg 3120 
gcggcaccaa cggcaccgag ggcaacgaca bcabcacccb gcagbgccgc atcaagcaga 3180 
bcabcaacab gbggcagaag gbgggcaagg ccabgbacgc cccccccatc accggccaga 3240 
tccgcbgcag cagcaacabc accggccbgc bgcbgacccg cgacggcggc aacagcaccg 3300 
agaccgagac cgagatcbbc cgccccggcg gcggcgacat gcgcgacaac bggcgcagcg 3360 
agcbgtacaa gbacaaggbg gbgcgcabcg agcccabcgg cgbggccccc acccgcgcca 3420 
agcbbaccgb gcaggcccgc cbgcbgcbga gcggcabcgb gcagcagcag aacaaccbgc 3480 
bgcgcgccab cgaggcccag cagaacatgc tgcgccbgac cgbgbggggc atcaagcagc 3540 
bgcaggcccg cgbgcbggcc cbggagcgcb accbgcgcga ccagcagcbc gaggacabcb 3600 
ggaacaacat gaccbggabg gagbgggagc gcgagatcga caacbacacc gacbacabcb 3660 
acgaccbgcb ggagaagagc cagacccagc aggagaagaa cgagaaggag cbgcbggagc 3720 
bggacaagbg ggccagccbg bggaacbggb tcgacatcac caactggcbg bggtacabcc 37 80 
gccbgtbcab cabgabcgtg ggcggccbga bcggcctgcg cabcgbgbtc gccgbgcbga 3840 
gcabctgagg abccagabcb gcbgbgcctb cbagbbgcca gccatcbgtb gbbbgccccb 3900 
cccccgbgcc bbccbbgacc cbggaaggbg ccacbcccac bgtccbbbcc baataaaabg 3960 
aggaaabtgc abcgcatbgb cbgagbaggb gtcabbcbab bcbggggggb ggggbggggc 4020 
aggacagcaa gggggaggab bgggaagaca abagcaggca bgcbggggab gcggbgggcb 4080 



- 201 - 



WO 02/32943 



PCT/US01/25721 



ctatgggbac ccaggbgcbg aagaattgac ccggttcctc ctgggccaga aagaagcagg 414 0 
cacatcccct tctcbgtgac acacccbgbc cacgcccctg gttcttagtt ccagccccac 4200 
tcafcaggaca ctcatagctc aggagggctc cgccbbcaab cccacccgct aaagbacbbg 4260 
gagcggtctc tccctccctc atcagcccac caaaccaaac ctagcctcca agagtgggaa 4320 
gaaattaaag caagataggc batbaagbgc agagggagag aaaatgcctc caacatgtga 4380 
ggaagtaatg agagaaatca tagaabbbta aggccatcab ggccbbaabc ttccgcttcc 4440 
bcgcbcactg actcgctgcg cbcggbcgbt cggctgcggc gagcggbabc agcbcactca 4500 
aaggcggbaa tacggttatc cacagaatca ggggabaacg caggaaagaa catgtgagca 4560 
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtb bbbccatagg 462 0 
cbccgccccc cbgacgagca bcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 468 0 
acaggactab aaagabacca ggcgtfctccc cctggaagcb cccbcgtgcg cbcbccbgbb 4740 
ccgaccctgc cgcttaccgg atacctgtcc gcctbtcbcc cbbcgggaag cgtggcgcbt 4800 
b'cbcabagcb cacgcbgbag gtatctcagb bcggtgbagg tcgtbcgcbc caagctgggc 4860 
tgbgbgcacg aaccccccgb tcagcccgac cgcbgcgcct tatccggtaa cbatcgtcbt 4920 
gagtccaacc cggtaagaca cgacttatcg ccacbggcag cagccactgg baacaggabt 4980 
agcagagcga ggbatgtagg cggtgcbaca gagbtcttga agbggbggcc baactacggc 5040 
tacactagaa gaacagtatt bggbatctgc gctctgcbga agccagbtac cbbcggaaaa 5100 
agagttggba gcbccbgabc cggcaaacaa accaccgctg gfcagcggbgg btbbtbbgtb 5160 
tgcaagcagc agabbacgcg cagaaaaaaa ggatctcaag aagabcctbt gabctbttcb 5220 
acggggtctg acgctcagbg gaacgaaaac tcacgbbaag ggattttggb cabgagabta 5280 
bcaaaaagga bctbcaccta gabccttbba aattaaaaab gaagttttaa. abcaabcbaa 5340 
agbabatabg agtaaacbtg gtctgacagt taccaabgct taatcagbga ggcacctatc 5400 
bcagcgabct gbcbabbtcg btcatccata gbbgcctgac tcgggggggg ggggcgctga 5460 
ggbctgccbc gbgaagaagg bgttgcbgac tcataccagg ccbgaatcgc cccatcabcc 5520 
agccagaaag bgagggagcc acggbbgabg agagcbtbgt bgtaggbgga ccagbbggbg 5580 
attttgaact bbtgctttgc cacggaacgg bctgcgbbgb cgggaagabg cgbgabcbga 5640 
tccttcaacb cagcaaaagb tcgabtbatt caacaaagcc gccgbcccgb caagtcagcg 5700 
baatgcbcbg ccagtgbtac aaccaabbaa ccaabbcbga btagaaaaac bcabcgagca 5760 
bcaaatgaaa cbgcaabbba ttcabatcag gatbabcaat accatatbbt tgaaaaagcc 582 0 
gbbtcbgbaa bgaaggagaa aactcaccga ggcagbbcca taggabggca agabcctggt 588 0 
abcggtcbgc gatbccgacb cgtccaacat caabacaacc tabtaabbtc cccbcgtcaa 5940 
aaataaggtb atcaagbgag aaatcaccab gagbgacgac tgaatccggt gagaabggca 6000 
aaagcbbatg catbbcbbbc cagactbgbb caacaggcca gccabbacgc bcgtcabcaa 6060 
aabcacbcgc abcaaccaaa ccgbbabbca bbcgtgatbg cgccbgagcg agacgaaaba 6120 
cgcgatcgcb gtbaaaagga caabbacaaa caggaatcga abgcaaccgg cgcaggaaca 6180 
cbgccagcgc abcaacaaba btbbcaccbg aabcaggaba bbcbtctaat accbggaabg 6240 
cbgtbtbccc ggggabcgca gbggbgagba accabgcabc abcaggagba cggabaaaab 6300 
gcbtgabggb cggaagaggc abaaabtccg bcagccagbt bagbcbgacc abcbcabcbg 6360 
baacabcabb ggcaacgcta ccbbbgccab gbbbcagaaa caacbcbggc gcabcgggcb 6420 
bcccabacaa bcgabagatb gbcgcaccbg abbgcccgac abbatcgcga gcccatbbat 6480 
acccababaa abcagcabcc abgbbggaab tbaabcgcgg ccbcgagcaa gacgtbtccc 6540 
gbtgaatabg gcbcabaaca ccccbtgbab bacbgbbbab gbaagcagac agtbtbabtg 6600 
bbcabgabga bababbbbba bcbbgbgcaa bgbaacabca gagatbbbga gacacaacgb 6660 
ggctbbcccc ccccccccat batbgaagca tbbatcaggg bbattgbcbc atgagcggab 6720 
acatabttga abgbatbbag aaaaataaac aaabaggggb bccgcgcaca bttccccgaa 6780 
aagbgccacc bgacgbcbaa gaaaccabba btabcabgac abtaaccbab aaaaabaggc 6840 
gbabcacgag gcccbbbcgb c 6 861 



<210> 87 
<211> 7552 
<212> DMA 

<213> Artificial Sequence 
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<220> 

<223> plasmid pVR1012x/s contain 
<400> 87 

tcgcgcgttt cggtgatgac ggtgaaaacc 
cagcttgtct gtaagcggat gccgggagca 
ttggcgggtg tcggggctgg cthaactatg 
accatatgcg gtgtgaaata ccgcacagat 
ctahtggcca ttgcatacgt tgtatccata 
tccaacatta ccgccatgtt gacattgatt 
ggggtcatta gttcatagcc catatatgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga ctttccattg 
tgcccacttg gcagtacatc aagtgtatca 
tgacggtaaa tggcccgcct ggcattatgc 
ttggcagtac atctacgtat tagtcatcgc 
catcaatggg cgtggatagc ggtttgactc 
cgtcaatggg agtttgtttt ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgttta gtgaaccgtc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
tcttatgcat gctatactgt ttttggcttg 
taggtgatgg tatagcttag cctataggtg 
tattggtgac gatactttcc attactaatc 
tattggctat atgccaatac tctgtccttc 
ggatggggtc ccafcttatta tttacaaatt 
cgcagttttt attaaacata gcgtgggatc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gtbgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca cgtgtgatca 
ccatgcgcgt gaaggagaag taccagcacc 
tgctgctggg catgctgatg atctgcagcg 
acggcgtgcc cgtgtggcgc gaggccacca 
cctacgacac cgaggtgcac aacgtgtggg 
acccccagga ggtggtgctg ggcaacgtga 
tggtggacca gatgcacgag gacatcatca 
tgaagctgac ccccctgtgc gtgaccctga 
ccaccaacct gaccagcagc agctggggca 
gcttctacat caccaccagc atccgcaaca 
gcctggacgt ggtgcccgtg aagaacacca 
acaccagcgt gatcacccag gcctgcccca 
actgcgtgcc cgccggcttc gccatcctga 
gcccctgcac caacgtgagc accgtgcagt 
cccagctgct gctgaacggc agcctggccg 
tcaccgacaa cgtgaagacc atcatcgtgc 
cccgccccaa caacaacacc cgcgagcgcc 
cccgccgcaa catcatcggc gacatccgcc 
ggaacaacac cctgcagcag atcgtgatca 
tcgccttcaa ccagagcagc ggcggcgacc 



mg HIV genes 

tctgacacat gcagctcccg gagacggtca 60 
gacaagcccg tcagggcgcg tcagcgggtg 12 0 
cggcatcaga gcagattgta ctgagagtgc 180 
gcgtaaggag aaaataccgc atcagattgg 24 0 
tcataatatg tacatttata ttggctcatg 300 
attgactagt tattaatagt aatcaattac 360 
gttccgcgtt acataactta cggtaaatgg 420 
cccattgacg tcaataatga cgtatgfctcc 480 
acgtcaatgg gtggagtatt tacggtaaac 540 
tatgccaagt acgcccccta ttgacgtcaa 600 
ccagtacatg accttatggg actttcctac 660 
tattaccatg gtgatgcggt tttggcagta 720 
acggggattt ccaagtctcc accccattga 780 
tcaacgggac tttccaaaat gtcgtaacaa 840 
gcgtgtacgg tgggaggtct atataagcag 900 
gagacgccat ccacgctgtt ttgacctcca 960 
cggccgggaa cggtgcattg gaacgcggat 1020 
ctatagactc tataggcaca cccchttggc 1080 
gggcctatac acccccgctt ccttatgcta 1140 
tgggttattg accattattg accactcccc 1200 
cataacatgg ctctttgcca caactatcfcc 1260 
agagactgac acggactctg tatttttaca 1320 
cacatataca acaacgccgt cccccgtgcc 13 80 
tccacgcgaa tctcgggtac gtgttccgga 1440 
tccacatccg agccctggtc ccatgcctcc 1500 
ctcctaacag tggaggccag acttaggcac 1560 
cacaaggccg tggcggtagg gfcatgtgtct 1620 
gctgacgcag atggaagact taaggcagcg 1680 
gtattctgat aagagtcaga ggtaactccc 1740 
gbagtctgag cagtactcgt tgctgccgcg 1800 
aacagactgt tcctttccat gggtcttttc 1860 
gatatcgcgg ccgctctaga gatatcgcca 1920 
tgtggcgctg gggctggcgc tggggcacca 1980 
ccaccgagaa gctgtgggtg accgtgtact 2040 
ccaccctgtt ctgcgccagc gacgccaagg 2100 
ccacccacgc ctgcgtgccc accgacccca 2160 
ccgagaactt caacatgtgg aagaacaaca 2220 
gcctgtggga cgagagcctg aagccctgcg 2280 
actgcaccaa cctgaacatc accaagaaca 23 4 0 
tgatggagga gggcgagahc aagaactgca 24 00 
aggtgaagaa ggagtacgcc ctgttcaacc 24 60 
gcaacaccaa gtaccgcctg atcagctgca 2520 
aggtgagctb ccagcccatc cccatccact 2580 
agtgcaacaa caagaccttc aacggcagcg 2640 
gcacccacgg catccgcccc gtggtgagca 2 700 
aggaggacat cgtgatccgc agcgaggact 2760 
agctgaacga gagcgtggtg atcaactgca 2 820 
tgagcatcgg ccccggccgc gccttctacg 2880 
aggcccactg caacatcagc cgcgccaagt 2940 
agctgcgcga gaagttccgc aacaagacca 3000 
ccgagatcgt gatgcacagc ttcaactgcg 3060 
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gcggcgagtt cbbcbactgc aacaccgccc agcbgbbcaa cagcacctgg aacgtggccg 3120 
gcggcaccaa cggcaccgag ggcaacgaca bcabcacccb gcagbgccgc abcaagcaga 3180 
tcatcaacat gtggcagaag gtgggcaagg ccabgbacgc cccccccatc accggccaga 3240 
tccgctgcag cagcaacatc accggcctgc tgctgacccg cgacggcggc aacagcaccg 3300 
agaccgagac cgagatcttc cgccccggcg gcggcgacat gcgcgacaac tggcgcagcg 3360 
agctgtacaa gtacaaggtg gtgcgcatcg agcccatcgg cgtggccccc acccgcgcca 3420 
agcgccgcac cgtgcagcgc gagaagcgcg ccgtgggcat cggcgccgtg bbccbgggct 34 80 
bcctgggcgc cgccggcagc accatgggcg ccgccagcgt gaccctgacc gtgcaggccc 3540 
gcctgctgct gagcggcatc gtgcagcagc agaacaacct gctgcgcgcc atcgaggccc 3 600 
agcagaacat gcbgcgccbg accgbgtggg gcatcaagca gctgcaggcc cgcgtgctgg 3660 
cccbggagcg cbacctgcgc gaccagcagc bgatgggcab ctggggctgc agcggcaagc 3720 
ttatctgcac caccagcgtg ccctggaacg bgagcbggag caacaagagc gtggacgaca 3780 
tctggaacaa catgacctgg abggagtggg agcgcgagat cgacaactac accgactaca 3840 
bcbacgacct gctggagaag agccagaccc agcaggagaa gaacgagaag gagctgctgg 3 900 
agctggacaa gtgggccagc cbgbggaacb ggbbcgacab caccaactgg ctgtggtaca 3960 
tccgcctgtt catcatgatc gbgggcggcc tgatcggcct gcgcatcgtg bbcgccgtgc 402 0 
bgagcatcgt gaaccgcgtg cgccagggct acagcccccb gagcttccag accctgctgc 4080 
ccgccagccg cggccccgac cgccccgagg gcaccgagga ggagggcggc gagcgcgacc 414 0 
gcgaccgcag cggccccagc gtgaacggca gcctggccct gabcbgggac gacctgcgca 4200 
gcctgtgcct gbbcagcbac caccgcctgc gcgaccbgcb gctgatcgbg acccgcatcg 4260 
bggagcbgct gggccgccgc ggctgggagg ccctgaagba cbggbggaac ctgctgcagb 4320 
acbggagcca ggagctgaag aacagcgccg tgagccbgct gcagtacggc bggagctacb 43 80 
tccacgaggc cgbgcaggcc gbgtggcgca gcgccaccga gacccbggcc ggcgcctggg 4440 
gcgaccbgtg ggagaccctg cgccgcggcg gccgctggab cbbggccabc ccccgccgca 4500 
bccgccaggg ccbggagcbg acccbgcbgb gaacacgbgg gabccagabc tgcbgbgcct 4 560 
bcbagbbgcc agccabcbgb bgbbbgcccc tcccccgbgc cbbccbbgac ccbggaaggb 4620 
gccacbccca cbgtccbbbc ctaabaaaat gaggaaatbg cabcgcatbg tcbgagbagg 4680 
bgbcabtcta bbcbgggggg bggggbgggg caggacagca agggggagga ttgggaagac 474 0 
aatagcaggc abgcbgggga tgcggtgggc bcbabgggba cccaggtgcb gaagaattga 4800 
cccggbbccb cctgggccag aaagaagcag gcacabcccc btcbcbgbga cacaccctgb 4860 
ccacgccccb ggbbcbbagb tccagcccca cbcataggac actcatagcb caggagggct 4 920 
ccgccbtcaa bcccacccgc baaagbactt ggagcggtcb cbcccbcccb cabcagccca 4980 
ccaaaccaaa cctagccbcc aagagbggga agaaabbaaa gcaagabagg cbabtaagbg 5040 
cagagggaga gaaaatgcct ccaacatgtg aggaagbaab gagagaaabc abagaabbbb 5100 
aaggccabca bggccbbaab cttccgcbbc cbcgcbcacb gacbcgcbgc gcbcggtcgb 5160 
bcggcbgcgg cgagcggbab cagcbcacbc aaaggcggba abacggbbat ccacagaabc 5220 
aggggabaac gcaggaaaga acabgbgagc aaaaggccag caaaaggcca ggaaccgbaa 5280 
aaaggccgcg bbgcbggcgb bbbbccabag gcbccgcccc ccbgacgagc abcacaaaaa 5340 
bcgacgctca agbcagaggb ggcgaaaccc gacaggacba taaagabacc aggcgbbbcc 5400 
ccctggaagc bcccbcgbgc gcbcbccbgb bccgacccbg ccgctbaccg gabaccbgbc 54 60 
cgccbbbcbc ccbtcgggaa gcgbggcgcb bbcbcabagc bcacgctgba ggbabctcag 5520 
bbcggbgtag gtcgbbcgct ccaagctggg cbgbgbgcac gaaccccccg bbcagcccga 5580 
ccgcbgcgcc bbabccggba acbabcgbcb bgagbccaac ccggtaagac acgacbbabc 5640 
gccacbggca gcagccacbg gbaacaggab bagcagagcg aggbabgbag gcggbgcbac 5700 
agagtbctbg aagbggbggc cbaacbacgg cbacacbaga agaacagbab bbggbabcbg 5760 
cgcbcbgcbg aagccagbba ccbbcggaaa aagagbbggb agcbcbbgab ccggcaaaca 5820 
aaccaccgcb ggbagcggbg gbbbbbttgb bbgcaagcag cagabbacgc gcagaaaaaa 5880 
aggatcbcaa gaagabccbt bgabcbbtbc bacggggbcb gacgcbcagt ggaacgaaaa 5940 
cbcacgbbaa gggabbbbgg bcabgagabb abcaaaaagg abcbbcaccb agabccbbbb 6000 
aaabbaaaaa bgaagbbbba aabcaatcba aagbababab gagbaaacbb ggbcbgacag 6060 
tbaccaabgc bbaabcagbg aggcaccbat cbcagcgabc bgbcbabbtc gbbcabccab 6120 
agtbgcctga ctcggggggg gggggcgcbg aggbcbgccb cgtgaagaag gbgbbgcbga 6180 
cbcabaccag gccbgaabcg ccccabcabc cagccagaaa gbgagggagc cacggbbgab 624 0 
gagagcbbbg btgbaggbgg accagtbggb gabtbbgaac bbtbgcbbbg ccacggaacg 6300 
gbcbgcgtbg bcgggaagab gcgbgabcbg abccbbcaac bcagcaaaag bbcgabbbab 6360 
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tcaacaaagc cgccgtcccg tcaagtcagc gtaatgctct gccagtgtta caaccaatta 6420 
accaattctg attagaaaaa cteatcgagc atcaaatgaa actgcaattt attcatatca 6480 
ggattatcaa taccatattt ttgaaaaagc cgtttctgta atgaaggaga aaactcaccg 6540 
aggcagttcc ataggatggc aagatcctgg tatcggtctg cgattccgac tcgtccaaca 6600 
tcaatacaac ctattaattt cccctcgtca aaaataaggt tatcaagtga gaaatcacca 6660 
tgagtgacga ctgaatccgg tgagaatggc aaaagcttat gcatttcttt ccagacttgt 6720 
tcaacaggcc agccattacg ctcgtcatca aaatcactcg catcaaccaa accgttattc 6780 
attcgtgatt gcgcctgagc gagacgaaat acgcgatcgc tgttaaaagg acaattacaa 6840 
acaggaatcg aatgcaaccg gcgcaggaac actgccagcg catcaacaat attttcacct 6900 
gaatcaggat atfccttctaa tacctggaat gctgttttcc cggggatcgc agtggtgagt 6960 
aaccatgcat catcaggagt acggataaaa tgcttgatgg tcggaagagg cataaattcc 7020 
gtcagccagt ttagtctgac catctcatct gtaacatcat tggcaacgct acctttgcca 7080 
tgtttcagaa acaactctgg cgcatcgggc ttcccataca atcgatagat tgtcgcacct 7140 
gattgcccga cattatcgcg agcccattta tacccatata aatcagcatc catgbtggaa 7200 
tttaatcgcg gcctcgagca agacgtttcc cgttgaatat ggctcataac accccttgta 7260 
ttactgttta tgtaagcaga cagttttatt gttcatgatg atatattttt atcttgtgca 7320 
atgtaacatc agagattttg agacacaacg tggctttccc ccccccccca ttattgaagc 7380 
atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa 7440 
caaatagggg ttccgcgcac atttccccga aaagtgccac ctgacgtcta agaaaccatt 7500 
attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg tc 7552 



<210> 88 
<211> 6774 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 88 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagafc gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattacgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
99atggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
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cgcagttttt attaaacata gcghgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
cafcgggctct bcbccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggcbcat ggtcgctcgg cagctccttg ctcctaacag bggaggccag acttaggcac 1560 
agcacaabgc ccaccaccac cagfcgtgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaatgagc gtggagabbg ggctcgcacg gcbgacgcag atggaagact baaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagfccaga ggtaactccc 1740 
gtbgcggtgc hgttaacggt ggagggcagb gbagbctgag cagtactcgt tgcbgccgcg 1800 
cgcgccacca gacafcaatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgbgbgatca gafcatcgcgg ccgcbcbaga gatatcgcca 1920 
ccatgcgcgb ggagggcabc cagcgcaacb ggaagcagbg gbggatcbgg ggcabccbgg 1980 
gcbbcbggab ggbgatgabc bacaacgtgc gcggcaacct gbgggtgacc gbgbacbacg 2040 
gcgtgcccgb gbggaaggag gccaagacca cccbgttcbg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgbgggcca cccacgccbg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgcbggag aacgtgaccg agaacttcaa cabgbgggag aacgacabgg 2220 
tggagcagat gcaccaggac atcabcagcc tgbgggacca gagcctgaag cccbgcgbga 2280 
agctgacccc cctgtgcgtg accctgcacb gcagcaaccg caccabcgac bacaacaacc 2340 
gcaccgacaa catgggcggc gagabcaaga acbgcagcbb caacabgacc accgaggbgc 2400 
gcgacaagcg cgagaaggtg cacgccctgb tctaccgcct ggacatcgtg cccctgaaga 24 60 
acgagagcag caacaccagc ggcgactacc gcctgabcaa ctgcaacacc agcgccatca 2520 
cccaggcctg ccccaaggbg agcbbcgacc ccatccccab ccacbacbgc gcccccgccg 2580 
gctacgccat cctgaagtgc aacaacaaga ccbtcaacgg caccggcccc tgcaacaacg 2640 
bgagcaccat ccagbgcacc cacggcacca agcccgbggt gagcacccag cbgctgcbga 2700 
acggcagccb ggccgaggag gagabcabca tccgcagcaa gaacctgacc gacaacgbga 2760 
agaccabcab cgbgcaccbg aacgagagcg tggagatcaa cbgcacccgc cccaacaaca 2820 
acacccgcaa gagcabccgc abcggccccg gccaggccbb cbacgccacc ggcgagabca 2880 
tcggcgacat ccgccaggcc cacbgcaaca bcagccgcac cgccbggaac aagacccbgc 2940 
aggaggtggg caagaagcbg gccgagcacb tccccaacaa ggccatcaag bbcgccaagc 3 000 
acagcggcgg cgacctggag atcaccaccc acagcbbcaa cbgccgcggc gagttcttcb 3060 
acbgcaacac cagcagccbg btcaacagca ccbacacccc caacagcacc gagaacabca 3120 
ccggcaccga gaacagcabc atcaccabcc ccbgccgcab caagcagatc abcaacabgb 3180 
ggcagggcgb gggccgcgcc abgbacgccc cccccatcga gggcabcctg accbgccgca 3240 
gcaacabcac cggccbgcbg ctgacccgcg acggcggcac cggcabgcac gacaccgaga 33 00 
tctbccgccb gggcggcggc gacabgcgcg acaactggcg cagcgagctg bacaagbaca 3360 
aggtggbgaa gabcgagccc cbgggcgbgg cccccaccaa ggccaagcbb accgbgcagg 342 0 
cccgccagcb gcbgagcggc abcgbgcagc agcagaacaa ccbgcbgcgc gccatcgagg 3480 
cccagcagca ccbgcbgcag cbgaccgbgb ggggcabcaa gcagcbgcag gcccgcabcc 3540 
bggccgbgga gcgctaccbg aaggaccagc agctggagca gabcbggaac cacaccacct 3600 
ggabggagtg ggaccgcgag abcaacaact acaccagccb gabccacagc cbgatcgagg 3660 
agagccagaa ccagcaggag aagaacgagc aggagcbgct ggagcbggac aagbgggcca 3720 
gccbgbggaa cbggbbcaac abcaccaacb ggcbgbggbg aggabccaga bcbgcbgbgc 3780 
cbtcbagbbg ccagccabcb gbbgbbbgcc cctcccccgb gccttccbbg acccbggaag 3840 
gbgccacfccc cacbgbccbb bccbaabaaa abgaggaaab bgcabcgcab bgbctgagba 3900 
ggbgtcabbc battcbgggg ggtggggbgg ggcaggacag caagggggag gabbgggaag 3960 
acaabagcag gcabgctggg gabgcggbgg gcbcbatggg bacccaggbg ctgaagaabb 4020 
gacccggbbc cbccbgggcc agaaagaagc aggcacatcc cctbcbcbgt gacacacccb 4080 
gtccacgccc cbggbbcbba gbbccagccc cactcabagg acacbcabag cbcaggaggg 4140 
cbccgcctbc aabcccaccc gcbaaagbac tbggagcggb cbcbccctcc ctcatcagcc 4200 
caccaaacca aaccbagcct ccaagagbgg gaagaaabba aagcaagaba ggcbabtaag 4260 
tgcagaggga gagaaaabgc cbccaacatg tgaggaagta atgagagaaa bcabagaabb 4320 
tbaaggccab cabggcctba atcbbccgcb fccctcgctca cbgacbcgcb gcgcbcggbc 43 80 
gbtcggcbgc ggcgagcggb abcagcbcac tcaaaggcgg taabacggbb abccacagaa 4440 
bcaggggata acgcaggaaa gaacabgbga gcaaaaggcc agcaaaaggc caggaaccgb 4500 
aaaaaggccg cgbbgcbggc gttbtbccab aggcbccgcc ccccbgacga gcabcacaaa 4560 
aabcgacgct caagtcagag gbggcgaaac ccgacaggac babaaagaba ccaggcgbbb 4620 
cccccbggaa gcbcccbcgb gcgcbcbccb gttccgaccc bgccgcbbac cggabaccbg 4680 



- 206 - 



WO 02/32943 PCT/US01/25721 

tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg baggbabctc 4740 
agbbcggbgb aggbcgbbcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc 4800 
gaccgctgcg ccbbabccgg baacbabcgb cttgagtcca acccggbaag acacgactta 4860 
tcgccacbgg cagcagccac tggtaacagg atbagcagag cgaggtatgt aggcggbgcb 4920 
acagagbbcb bgaagbggbg gcctaactac ggcbacacba gaagaacagt atttggtatc 4980 
tgcgctctgc tgaagccagt taccttcgga aaaagagbbg gtagctcttg atccggcaaa 5040 
caaaccaccg cbggbagcgg bggbbbbbbb gtttgcaagc agcagattac gcgcagaaaa 5100 
aaaggabcbc aagaagabcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa 5160 
aacbcacgtt aagggabbbb ggtcabgaga btabcaaaaa ggabctbcac cbagabcctb 5220 
ttaaattaaa aabgaagbbb taaatcaatc baaagbabat atgagtaaac bbggbcbgac 5280 
agttaccaab gcbbaabcag bgaggcaccb atctcagcga tcbgbcbabb bcgtbcabcc 5340 
abagbtgccb gacbcggggg gggggggcgc bgaggbcbgc cbcgbgaaga aggbgtbgcb 5400 
gacbcafcacc aggccbgaab cgccccabca bccagccaga aagbgaggga gccacggfcbg 5460 
abgagagcbt bgbbgtaggb ggaccagbbg gtgatbbbga actbttgcbt bgccacggaa 5520 
cggbctgcgb tgbcgggaag abgcgbgatc bgabccbbca actcagcaaa agttcgattb 5580 
abbcaacaaa gccgccgbcc cgtcaagbca gcgtaatgct cbgccagbgb bacaaccaab 5640 
baaccaabbc bgabtagaaa aacbcabcga gcabcaaabg aaacbgcaat bbabbcatat 5700 
caggabbabc aabaccabab bbbbgaaaaa gccgtbbcbg taabgaagga gaaaacbcac 5760 
cgaggcagbb ccabaggabg gcaagabccb ggbatcggbc bgcgabbccg acbcgtccaa 5820 
catcaabaca accbabbaab bbccccbcgb caaaaabaag gbbabcaagb gagaaabcac 5880 
cabgagbgac gacbgaabcc ggtgagaabg gcaaaagcbb abgcatbbcb bbccagactb 5940 
gbbcaacagg ccagccatba cgcbcgbcab caaaabcact cgcabcaacc aaaccgbbab 6000 
bcabbcgbga bbgcgccbga gcgagacgaa abacgcgabc gcbgbbaaaa ggacaabbac 6060 
aaacaggaab cgaatgcaac cggcgcagga acactgccag cgcabcaaca atabbbbcac 6120 
cbgaatcagg ababbcbbcb aabaccbgga abgcbgbbbt cccggggabc gcagbggbga 6180 
gbaaccabgc abcabcagga gbacggataa aabgcbbgab ggbcggaaga ggcabaaabb 6240 
ccgbcagcca gbtbagbcbg accabcbcab cbgbaacatc abbggcaacg ctaccbbbgc 6300 
cabgbbbcag aaacaacbcb ggcgcabcgg gcbbcccaba caabcgatag atbgbcgcac 6360 
cbgabtgccc gacabbabcg cgagcccatb babacccata taaabcagca bccabgbbgg 6420 
aabbbaabcg cggccbcgag caagacgtbb cccgbbgaab abggcbcaba acaccccbbg 6480 
babbacbgbb babgtaagca gacagbbbba bbgbbcabga bgabababbb bbabcbbgbg 6540 
caabgtaaca bcagagabbb tgagacacaa cgbggcbbbc cccccccccc cabtabbgaa 6600 
gcabbbabca gggtbabbgb cbcabgagcg gabacabatb bgaatgbabb bagaaaaaba 6660 
aacaaabagg ggbbccgcgc acabbbcccc gaaaagbgcc accbgacgbc baagaaacca 6720 
bbabbabcab gacatbaacc babaaaaaba ggcgtabcac gaggcccbbb cgbc 6774 



<210> 89 
<211> 6846 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 89 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagctbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 
bbggcgggtg tcggggcbgg cbtaacbabg cggcabcaga gcagabtgba cbgagagbgc 180 
accababgcg gtgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 24 0 
cbabbggcca bbgcabacgb bgbabccaba bcataababg bacabbbaba bbggcbcabg 300 
bccaacabba ccgccabgbb gacabtgabb abbgacbagb babbaabagb aabcaabbac 360 
ggggbcabta gbbcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabtgacg bcaabaabga cgtabgtbcc 480 
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catagtaacg ccaataggga ctttccattg 
bgcccactbg gcagbacabc aagtgtatca 
tgacggtaaa tggcccgcct ggcattatgc 
ttggcagtac atctacgtat tagtcatcgc 
catcaatggg cgtggatagc ggbbbgacbc 
cgtcaatggg agtbbgbttb ggcaccaaaa 
ctccgcccca ttgacgcaaa bgggcggfcag 
agctcgttta gbgaaccgbc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagbgacg baagtaccgc 
tcttatgcat gctatactgt bbbtggcttg 
taggtgatgg babagcbbag cctataggtg 
tattggtgac gatactttcc abbactaatc 
tabbggcbat abgccaabac tctgtccttc 
ggatggggtc ccatttatta tttacaaatt 
cgcagbtbbb attaaacata gcgtgggatc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggbcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaabgagc gbggagatbg ggctcgcacg 
gcagaagaag atgcaggcag cbgagbbgtt 
gttgcggtgc tgttaacggt ggagggcagfc 
cgcgccacca gacataatag cbgacagact 
bgcagtcacc gtcgtcgaca cgtgtgatca 
ccabgcgcgb ggagggcatc cagcgcaact 
gcttctggab ggtgatgatc tacaacgtgc 
gcgtgcccgt gtggaaggag gccaagacca 
acgacgccga ggtgcacaac gtgtgggcca 
cccaggagab ggtgctggag aacgtgaccg 
tggagcagat gcaccaggac atcatcagcc 
agctgacccc cctgbgcgbg accctgcact 
gcaccgacaa catgggcggc gagatcaaga 
gcgacaagcg cgagaaggtg cacgcccbgt 
acgagagcag caacaccagc ggcgacbacc 
cccaggcctg ccccaaggtg agcttcgacc 
gctacgccat cctgaagbgc aacaacaaga 
tgagcaccat ccagbgcacc cacggcacca 
acggcagcct ggccgaggag gagatcatca 
agaccatcat cgtgcacctg aacgagagcg 
acacccgcaa gagcatccgc atcggccccg 
tcggcgacat ccgccaggcc cactgcaaca 
aggaggbggg caagaagctg gccgagcact 
acagcggcgg cgaccbggag atcaccaccc 
acbgcaacac cagcagcctg ttcaacagca 
ccggcaccga gaacagcatc atcaccatcc 
ggcagggcgt gggccgcgcc atgtacgccc 
gcaacabcac cggcctgcbg cbgacccgcg 
tcttccgcct gggcggcggc gacatgcgcg 
aggtggtgaa gabcgagccc ctgggcgtgg 
cccgccagct gcbgagcggc atcgtgcagc 
cccagcagca ccbgcbgcag ctgaccgtgb 
tggccgbgga gcgctacctg aaggaccagc 
ggatggagtg ggaccgcgag atcaacaacb 
agagccagaa ccagcaggag aagaacgagc 
gccbgtggaa ctggbbcaac abcaccaact 



acgtcaatgg gtggagtatt tacggtaaac 540 
batgccaagb acgcccccta ttgacgtcaa 600 
ccagtacabg accbtabggg actthcctac 660 
tattaccatg gtgabgcggt tttggcagta 720 
acggggabtb ccaagbcbcc accccabtga 780 
bcaacgggac tbbccaaaab gtcgbaacaa 840 
gcgtgtacgg tgggaggtcb atataagcag 900 
gagacgccab ccacgcbgtb ttgaccbcca 960 
cggccgggaa cggbgcatbg gaacgcggat 1020 
ctatagactc tataggcaca ccccttbggc 1080 
gggccbatac acccccgcbt ccbbatgcta 1140 
bgggbbabtg accatbabbg accactcccc 1200 
cabaacabgg cbcbbtgcca caacbabcbc 1260 
agagactgac acggactctg tatttttaca 1320 
cacabataca acaacgccgt cccccgtgcc 1380 
tccacgcgaa bctcgggbac gtgttccgga 144 0 
tccacatccg agccctggtc ccatgcctcc 1500 
cbccbaacag bggaggccag acbtaggcac 1560 
cacaaggccg bggcggtagg gtatgbgbct 1620 
gcbgacgcag atggaagact taaggcagcg 1680 
gbabtcbgab aagagtcaga ggtaacbccc 1740 
gbagtctgag cagtacbcgb bgcbgccgcg 1800 
aacagactgt tcctbtccat gggtcttttc 1860 
gababcgcgg ccgcbcbaga gababcgcca 1920 
ggaagcagbg gbggabcbgg ggcabcctgg 1980 
gcggcaacct gbgggbgacc gbgtacbacg 2 040 
ccctgttctg cgccagcgac gccaaggccb 2100 
cccacgccbg cgtgcccacc gaccccaacc 2160 
agaacbbcaa cabgtgggag aacgacabgg 2220 
tgbgggacca gagccbgaag ccctgcgtga 2280 
gcagcaaccg caccabcgac tacaacaacc 2340 
acbgcagcbb caacabgacc accgaggbgc 2400 
bcbaccgccb ggacatcgbg ccccbgaaga 2460 
gcctgabcaa cbgcaacacc agcgccatca 2520 
ccabccccab ccacbacbgc gcccccgccg 2580 
cctbcaacgg caccggcccc tgcaacaacg 2640 
agcccgbggb gagcacccag ctgctgcbga 2700 
bccgcagcaa gaaccbgacc gacaacgbga 2760 
bggagabcaa cbgcacccgc cccaacaaca 2820 
gccaggcctt cbacgccacc ggcgagabca 2880 
bcagccgcac cgccbggaac aagacccbgc 2940 
bccccaacaa ggccatcaag btcgccaagc 3000 
acagcbbcaa ctgccgcggc gagttctbct 3060 
ccbacacccc caacagcacc gagaacabca 3120 
cctgccgcat caagcagatc abcaacabgb 3180 
cccccatcga gggcatcctg acctgccgca 3240 
acggcggcac cggcatgcac gacaccgaga 3300 
acaacbggcg cagcgagcbg bacaagbaca 3360 
cccccaccaa ggccaagctt accgbgcagg 3420 
agcagaacaa ccbgcbgcgc gccabcgagg 34 80 
ggggcabcaa gcagcbgcag gcccgcabcc. 3540 
agcbggagca gabctggaac cacaccaccb 3600 
acaccagccb gabccacagc ctgabcgagg 3660 
aggagcbgcb ggagcbggac aagtgggcca 3720 
ggctgbggta cabcaagcbg bbcabcabga 3780 
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tcgbgggcgg cctggtgggc ctgcgcatcg tgttcgccgt gctgagcatc tgaggatcca 3 840 

gatctgctgt gcctbcbagb tgccagccab ctgttgtttg cccctccccc gtgccttcct 3900 

bgacccbgga aggtgccact cccactgtcc bbbcctaaba aaabgaggaa attgcatcgc 3960 

abbgbcbgag baggtgbcab tcbabbcbgg ggggbggggb ggggcaggac agcaaggggg 4020 

aggattggga agacaatagc aggcabgcbg gggabgcggb gggcbcbabg ggbacccagg 4 080 

bgcbgaagaa bbgacccggt tccbccbggg ccagaaagaa gcaggcacat cccctbcbct 4140 

gbgacacacc cbgbccacgc cccbggtbcb bagbbccagc cccactcata ggacacbcat 4200 

agcbcaggag ggcbccgccb tcaatcccac ccgcbaaagb acbbggagcg gbctcbccct 4260 

cccbcabcag cccaccaaac caaacctagc ctccaagagb gggaagaaat taaagcaaga 4320 

baggctatba agbgcagagg gagagaaaat gccbccaaca bgbgaggaag baabgagaga 43 80 

aabcatagaa bbbbaaggcc abcabggcct taabcttccg cbbccbcgcb cacbgacbcg 4440 

ctgcgcbcgg bcgbbcggcb gcggcgagcg gbabcagcbc acbcaaaggc ggtaabacgg 4 500 

btabccacag aabcagggga baacgcagga aagaacabgb gagcaaaagg ccagcaaaag 4 560 

gccaggaacc gbaaaaaggc cgcgbbgcbg gcgtbttbcc abaggcbccg cccccctgac 4620 

gagcabcaca aaaabcgacg cbcaagtcag aggbggcgaa acccgacagg actataaaga 4680 

baccaggcgt bbcccccbgg aagctcccbc gbgcgcbctc ctgtbccgac ccbgccgctt 4 740 

accggabacc tgbccgccbb bcbccctbcg ggaagcgbgg cgcbbbcbca bagctcacgc 4 800 

bgbaggbabc tcagbbcggb gtaggtcgbt cgcbccaagc bgggcbgbgt gcacgaaccc 4 860 

cccgttcagc ccgaccgcbg cgcctbabcc ggbaacbabc gtcbbgagtc caacccggta 4 920 

agacacgact babcgccact ggcagcagcc acbggtaaca ggabtagcag agcgaggbah 4 980 

gtaggcggbg cbacagagtt cttgaagtgg tggcctaact acggcbacac bagaagaaca 5 040 

gtabtbggba bcbgcgcbcb gctgaagcca gbbacctbcg gaaaaagagt tggbagctcb 5100 

tgabccggca aacaaaccac cgcbggtagc ggbggbbbtb btgbttgcaa gcagcagabt 5160 

acgcgcagaa aaaaaggabc bcaagaagat ccbbbgabcb bbbcbacggg gbcbgacgct 5220 

cagtggaacg aaaacbcacg bbaagggabt ttggbcabga gabbabcaaa aaggatcttc 5280 

acctagatcc bbbtaaabta aaaatgaagt ttbaaatcaa bcbaaagbat atatgagtaa 5340 

actbggbcbg acagttacca abgcttaatc agbgaggcac ctabcbcagc gabctgbcta 5400 

tttcgbbcat ccatagtbgc cbgacbcggg gggggggggc gcbgaggtct gccbcgbgaa 5460 

gaaggbgbbg cbgactcata ccaggccbga atcgccccab cabccagcca gaaagbgagg 5 520 

gagccacggb bgabgagagc bbbgbbgbag gbggaccagb bggbgabbbb gaacbtbtgc 5580 

btbgccacgg aacggbcbgc gbbgbcggga agabgcgbga bcbgabccbb caacbcagca 564 0 

aaagtbcgab tbabtcaaca aagccgccgt cccgbcaagb cagcgbaabg ctctgccagb 5700 

gttacaacca atbaaccaab bctgatbaga aaaacbcabc gagcabcaaa tgaaacbgca 5760 

abbbabbcab abcaggabba bcaabaccab abbbbbgaaa aagccgtbbc bgbaabgaag 5820 

, gagaaaacbc accgaggcag bbccabagga bggcaagatc ctggtabcgg bcbgcgabtc 5880 

cgactcgbcc aacabcaaba caaccbabba abbbcccctc gtcaaaaaba aggbtabcaa 5940 

gbgagaaabc accabgagbg acgacbgaat ccggbgagaa bggcaaaagc tbabgcattt 6000 

ctttccagac ttgttcaaca ggccagccat bacgctcgbc abcaaaatca ctcgcatcaa 6060 

ccaaaccgbt abbcattcgb gatbgcgcct gagcgagacg aaatacgcga tcgctgttaa 6120 

aaggacaabb acaaacagga abcgaabgca accggcgcag gaacacbgcc agcgcabcaa 6180 

caababtbbc accbgaabca ggababbcbt cbaabaccbg gaabgcbgbb bbcccgggga 6240 

bcgcagbggt gagbaaccab gcabcabcag gagbacggab aaaabgcbbg abggbcggaa 6300 

gaggcabaaa bbccgbcagc cagbttagbc bgaccabctc atcbgtaaca bcabbggcaa 6360 

cgcbaccbtt gccatgbttc agaaacaact ctggcgcatc gggcttccca tacaatcgat 6420 

agabbgtcgc accbgatbgc ccgacatbat cgcgagccca tttataccca tabaaatcag 64 80 

cabccabgbb ggaabtbaab cgcggccbcg agcaagacgb bbcccgbbga ababggcbca 654 0 

taacaccccb bgbabtacbg bbbabgbaag cagacagtbb batbgttcab gabgatabab 6600 

bbbbabcbbg tgcaabgtaa cabcagagat tbbgagacac aacgbggcbb bccccccccc 6660 

cccabbabbg aagcabbbab cagggbbatt gbctcabgag cggabacaba tbbgaabgba 6720 

bbtagaaaaa baaacaaaba ggggtbccgc gcacabbbcc ccgaaaagbg ccacctgacg 6780 

tctaagaaac catbabbatc abgacabtaa cctabaaaaa baggcgtabc acgaggcccb 6840 
tbcgbc 6846 
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<210> 90 
<211> 6853 
<2X2> DNA 

<213> Artificial Sequence 
<220> 

<22 3> plascnid pVR1012x/s containing HIV genes 
<400> 90 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
cgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1'800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 192 0 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
ccca 9gagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 22 80 
agctgacccc ccfcgtgcgtg accctgcact gcagcaaccg caccatcgac tacaacaacc 2340 
gcaccgacaa catgggcggc gagatcaaga actgcagctt caacatgacc accgaggtgc 24 00 
gcgacaagcg cgagaaggtg cacgccctgt tctaccgcct ggacatcgtg cccctgaaga 24 60 
acgagagcag caacaccagc ggcgactacc gcctgatcaa ctgcaacacc agcgccatca 2 520 
cccaggcctg ccccaaggtg agcttcgacc ccatccccat ccactactgc gcccccgccg 2580 
gctacgccat cctgaagtgc aacaacaaga ccttcaacgg caccggcccc tgcaacaacg 2640 
tgagcaccat ccagtgcacc cacggcacca agcccgtggt gagcacccag ctgctgctga 2700 
acggcagcct ggccgaggag gagatcatca tccgcagcaa gaacctgacc gacaacgtga 2760 



- 210' - 



WO 02/32943 PCT/US01/25721 

* 

agaccatcat cgtgcacctg aacgagagcg tggagatcaa ctgcacccgc cccaacaaca 2820 
acacccgcaa gagcatccgc atcggccccg gccaggcctt ctacgccacc ggcgagatca 2880 
tcggcgacat ccgccaggcc cacbgcaaca tcagccgcac cgcctggaac aagaccctgc 2540 
sggaggtggg caagaagcbg gccgagcact tccccaacaa ggccatcaag ttcgccaagc 3000 
acagcggcgg cgacctggag atcaccaccc acagcttcaa ctgccgcggc gagttcttct 3 060 
actgcaacac cagcagcctg btcaacagca cctacacccc caacagcacc gagaacatca 3120 
ccggcaccga gaacagcabc atcaccatcc cctgccgcat caagcagatc abcaacabgb 3180 
ggcagggcgt gggccgcgcc atgtacgccc cccccatcga gggcatcctg acctgccgca 3240 
gcaacatcac cggcctgctg cbgacccgcg acggcggcac cggcatgcac gacaccgaga 3300 
tcttccgccc cgagggcggc gacabgcgcg acaactggcg cagcgagctg tacaagtaca 3360 
aggbggbgga gatcaagccc ctgggcatcg cccccaccaa ggccaagctt accgtgcagg 3420 
tgcgccagct gctgagcggc abcgtgcagc agcagagcaa cctgctgcgc gccatcgagg 34 80 
cccagcagca cabgctgcag cbgaccgfcgt ggggcatcaa gcagctgcag acccgcgtgc 3540 
tggccatcga gcgctacctg cgcgaccagc agctcgagga catctggaac aacatgacct 3600 
ggatgcagtg ggaccgcgag atcagcaact acaccaacac catctaccgc ctgctggagg 3 660 
acagccagaa ccagcaggag aagaacgagc aggacctgct ggccctggac aagtggcaga 3720 
accbgbggac cbggbbcggc atcaccaact ggctgtggta catcaagatc bbcabcaaga 3780 
tcgtgggcgg cctgatcggc ctgcgcatca bcttcgccgt gcbgagcabc tgaacacgtg 3840 
ggabccagab ctgctgtgcc fctctagttgc cagccatctg ttgtttgccc ctcccccgtg 3900 
ccttccttga ccctggaagg bgccacbccc actgbccbbb cctaataaaa tgaggaaatfc 3960 
gcabcgcabb gbcbgagbag gtgtcattct atbctggggg gtggggtggg gcaggacagc 4020 
aagggggagg attgggaaga caatagcagg cabgcbgggg atgcggtggg cbcbabgggt 4080 
acccaggbgc tgaagaabbg acccggbbcc bccbgggcca gaaagaagca ggcacatccc 4140 
cbbcbcbgbg acacacccbg tccacgcccc tggttcbbag bbccagcccc acbcabagga 4200 
cactcabagc bcaggagggc tccgccttca abcccacccg cbaaagtacb bggagcggbc 4260 
bcbcccbccc bcabcagccc accaaaccaa acctagccbc caagagbggg aagaaabbaa 4320 
agcaagabag gctabtaagt gcagagggag agaaaabgcc bccaacatgt gaggaagbaa 4380 
bgagagaaat catagaabbt taaggccabc atggccbbaa bcttccgcbb ccbcgcbcac 4440 
tgacbcgcbg cgcbcggbcg bbcggcbgcg gcgagcggba tcagctcacb caaaggcggt 4 500 
aabacggbba bccacagaab caggggabaa cgcaggaaag aacabgtgag caaaaggcca 4560 
gcaaaaggcc aggaaccgba aaaaggccgc gbbgcbggcg bbbttccaba ggcbccgccc 4620 
cccbgacgag cabcacaaaa abcgacgcbc aagbcagagg bggcgaaacc cgacaggacb 4680 
abaaagabac caggcgbbbc ccccbggaag cbcccbcgbg cgcbcbcctg bbccgacccb 4740 
gccgcbbacc ggabaccbgb ccgccbbbcb cccbtcggga agcgbggcgc btbcbcabag 4800 
cbcacgcbgb aggbabctca gttcggbgba ggbcgbbcgc bccaagctgg gcbgbgbgca 4860 
cgaacccccc gbbcagcccg accgcbgcgc cttatccggt aactatcgbc bbgagbccaa 4920 
cccggbaaga cacgacbtab cgccacbggc agcagccacb ggbaacagga tbagcagagc 4980 
gaggbabgba ggcggbgcba cagagbbcbb gaagbggbgg ccbaacbacg gcbacacbag 5040 
aagaacagba bbbggbabcb gcgcbcbgcb gaagccagbb accbbcggaa aaagagtbgg 5100 
bagcbcbbga tccggcaaac aaaccaccgc tggbagcggb ggbtbtfctbg bbbgcaagca 5160 
gcagabbacg cgcagaaaaa aaggabcbca agaagabccb bbgabcbttt cbacggggbc 5220 
tgacgcbcag bggaacgaaa acbcacgbba agggabbtbg gbcabgagab babcaaaaag 5280 
gabcbbcacc bagabccbbb baaabbaaaa abgaagtbbb aaabcaabcb aaagtababa 5340 
tgagbaaacb bggbcbgaca gbbaccaabg cbbaabcagb gaggcaccba tcbcagcgab 5400 
cbgbcbabbb cgbbcatcca bagbbgccbg actcgggggg ggggggcgcb gaggbcbgcc 5460 
tcgbgaagaa ggbgtbgcbg actcabacca ggccbgaabc gccccabcat ccagccagaa 5520 
agbgagggag ccacggbbga bgagagcbbb gbtgbaggtg gaccagttgg bgabbbbgaa 5580 
cbtbbgctbb gccacggaac ggbcbgcgbb gbcgggaaga bgcgbgabcb gabccbbcaa 5640 
cbcagcaaaa gtbcgattta bbcaacaaag ccgccgbccc gbcaagbcag cgbaabgcbc 5700 
bgccagbgbb acaaccaatt aaccaatbcb gabbagaaaa actcabcgag cabcaaabga 5760 
aacbgcaabb babtcababc aggabbabca abaccababb bbtgaaaaag ccgbbbcbgb 5820 
aabgaaggag aaaacbcacc gaggcagbbc cabaggabgg caagabcctg gbabcggbcb 58 80 
gcgabbccga cbcgbccaac atcaabacaa ccbabbaabb bccccbcgbc aaaaataagg 594 0 
bbabcaagbg agaaabcacc abgagbgacg acbgaabccg gbgagaabgg caaaagcbba 6000 
bgcatbbcbb bccagacbtg bbcaacaggc cagccabbac gcbcgbcatc aaaabcacbc 6060 



- 211 - 



WO 02/32943 



PCT/US01/25721 



gcatcaacca aaccgfctatt cattcgtgat tgcgcctgag cgagacgaaa tacgcgatcg 6120 

x ctgttaaaag gacaattaca aacaggaatc gaatgcaacc ggcgcaggaa cactgccagc 6180 

gcatcaacaa tatfcttcacc tgaatcagga tattcttcta atacctggaa tgctgttttc 6240 

ccggggatcg cagtggtgag taaccatgca tcatcaggag tacggataaa atgcttgatg 6300 

gtcggaagag gcataaattc cgtcagccag tttagtctga ccatctcatc tgtaacatca 6360 

ttggcaacgc tacctttgcc atgtttcaga aacaactctg gcgcatcggg cttcccatac 6420 

aatcgataga ttgtcgcacc tgattgcccg acattatcgc gagcccattt atacccatat 6480 

aaatcagcat ccatgttgga atttaatcgc ggcctcgagc aagacgtttc ccgttgaata 6540 

tggctcataa caccccttgt attactgttt atgtaagcag acagttttat tgttcatgat 6600 

gatatatttt tatcttgtgc aatgtaacat cagagatttt gagacacaac gtggctttcc 6660 

cccccccccc attattgaag catttatcag ggttattgtc tcatgagcgg afcacatattt 6720 

gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca 6780 

cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg 6840 

aggccctttc gtc 6853 



<210> 91 
<211> 6761 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 91 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accfctatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tafcaggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggcfccat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
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gttgcggtgc bgtbaacggb ggagggcagt gtagtctgag cagtactcgt bgcbgccgcg 1800 
cgcgccacca gacataatag cbgacagact aacagactgt tccbbbccab gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gababcgcgg ccgcbcbaga gatatcgcca 1920 
ccatgcgcgt gcgcggcatc cagaccagct ggcagaacct gbggcgcbgg ggcaccabga 1980 
tcctgggcat gctggtgatc tacagcgccg ccgagaacct gtgggtggcc gtgtactacg 2040 
gcgtgcccgt gbggaaggac gccgagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacaccga ggtgcacaac gtgtgggaga cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggaggt ggacctggag aacgtgaccg aggacttcaa cabgtggcgc aacaacatgg 2220 
tggagcagab gcacaccgac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc ccbgbgcgbg accctggact gcaacgccac cgccagcaac gtgaccaacg 2340 
agatgcgcaa cbgcagcbbc aacatcacca ccgagcbgaa ggacaagaag cagcaggtgt 24 00 
acagcctgtt cbacaagcbg gacgtggtgc agatcaacga gaagaacgag accgacaagt 2460 
accgcctgat caactgcaac accagcgcca tcacccaggc ctgccccaag gtgagcttcg 2520 
agcccatccc catccactac tgcgcccccg ccggctbcgc catcctgaag tgcaaggaca 2580 
ccgagttcaa cggcaccggc cccbgcaaga acgtgagcac cgtgcagtgc acccacggca 2640 
tccgccccgt gatcagcacc cagctgctgc tgaacggcag cctggccgag gagggcatcc 2700 
agabccgcag cgagaacatc accaacaacg ccaagaccat catcgtgcag ctggacaagg 2760 
ccgtgaagat caactgcacc cgccccaaca acaacacccg caagggcgtg cgcatcggcc 282 0 
ccggccaggc cttchacgcc accggcggca tcatcggcga catccgccag gcccactgcc 2 8 80 
acgtgagccg cgccaagtgg aacgacaccc tgcgcggcgt ggccaagaag ctgcgcgagc 2940 
acttcaagaa caagaccatc atcttcgaga agagcagcgg cggcgacatc gagatcacca 3000 
cccacagctt catctgcggc ggcgagbtcb tctactgcaa caccagcggc cbgtbcaaca 3 060 
gcacctggga gagcaacagc accgagagca acaacaccac cagcaacgac accatcaccc 3120 
tgacctgccg catcaagcag atcatcaaca bgbggcagaa ggtgggccag gccatgtacc 3180 
ccccccccat ccagggcgtg atccgctgcg agagcaacat caccggcctg ctgctgaccc 324 0 
gcgacggcgg caacaacagc accaacgaga tchtccgccc cggcggcggc aacabgcgcg 3300 
acaactggcg cagcgagctg tacaagtaca aggtggtgaa gatcgagccc ctgggcgtgg 3360 
cccccagccg cgccaagcbb accgcccagg cccgccagct gctgagcggc atcgtgcagc 342 0 
agcagagcaa cctgctgcgc gccatcgagg cccagcagca catgctgaag ctgaccgtgt 3480 
ggggcatcaa gcagctgcag gcccgcgtgc tggccgtgga gcgctacctg aaggaccagc 3540 
agcbcgagat ctgggacaac abgaccbggc tgcagtggga caaggagabc agcaaqbaca 3600 
cccagatcat ctacaacctg atcgaggaga gccagaacca gcaggagaag aacgagcagg 3660 
acctgctggc cctggacaag tgggccagcc tgtggaactg gttcgacatc agccgctggc 3720 
tgbggtgaac acgtgggatc cagabcbgct gbgccbbcta gbbgccagcc atctgttgtt 3780 
tgcccctccc ccgtgccbtc cbbgacccbg gaaggtgcca ctcccacbgt cctbtcctaa 3840 
taaaatgagg aaattgcatc gcattgbcbg agtaggbgbc abtcbabbct ggggggbggg 3900 
gbggggcagg acagcaaggg ggaggattgg gaagacaaba gcaggcabgc tggggabgcg 3960 
gtgggctcta bgggbaccca ggtgctgaag aabtgacccg gbbccbcctg ggccagaaag 4020 
aagcaggcac atccccbbct ctgtgacaca ccctghccac gccccbggtb cttagttcca 4080 
gccccacbca baggacacbc abagctcagg agggctccgc ctbcaatccc acccgcbaaa 4140 
gtacbbggag cggbcbcbcc cbcccbcabc agcccaccaa accaaaccta gccbccaaga 4200 
gtgggaagaa abbaaagcaa gabaggctab taagtgcaga gggagagaaa abgccbccaa 42 60 
cabgbgagga agtaatgaga gaaabcatag aabbbbaagg ccabgatbba aggccabcab 4320 
ggccbtaabc bbccgcbbcc tcgcbcacbg acbcgcbgcg ctcggbcgbb cggcbgcggc 4380 
gagcggbabc agctcactca aaggcggtaa bacggbbabc cacagaatca ggggabaacg 4440 
caggaaagaa cabgbgagca aaaggccagc aaaaggccag gaaccgbaaa aaggccgcgb 4500 
tgctggcgbb btbccatagg ctccgccccc cbgacgagca bcacaaaaat cgacgcbcaa 4560 
gbcagaggbg gcgaaacccg acaggacbat aaagatacca ggcgbbbccc ccbggaagct 4620 
ccctcgbgcg ctcbcctgbb ccgacccbgc cgctbaccgg abaccbgbcc gccbbtcbcc 4680 
ctbcgggaag cgbggcgcbb bcbcabagcb cacgctgtag gtabcbcagb bcggbgbagg 474 0 
tcgtbcgctc caagctgggc bgtgbgcacg aaccccccgb tcagcccgac cgcbgcgccb 4800 
tatccggbaa cbabcgtcbb gagbccaacc cggbaagaca cgacbbabcg ccacbggcag 4 860 
cagccacbgg baacaggabb agcagagcga ggbabgbagg cggtgcbaca gagbtctbga 4920 
agtggtggcc baactacggc tacactagaa gaacagbabb bggbabcbgc gcbcbgcbga 4980 
agccagtbac cfcbcggaaaa agagtbggba gcbcbbgabc cggcaaacaa accaccgcbg 5040 
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gtagcggtgg tttttttgtt bgcaagcagc 
aagabccttb gabctttbcb acggggtctg 
ggathttggt catgagatta tcaaaaagga 
gaagttttaa atcaatctaa agtatatatg 
taatcagtga ggcacctatc tcagcgatct 
tcgggggggg ggggcgctga ggtctgcctc 
cctgaabcgc cccatcabcc agccagaaag 
bgbaggbgga ccagttggtg attttgaact 
cgggaagatg cgbgabcbga tccttcaact 
gccgbcccgt caagtcagcg taatgctctg 
ttagaaaaac tcatcgagca tcaaabgaaa 
accatatttb tgaaaaagcc gtttctgtaa 
taggatggca agabccbggt atcggtctgc 
tattaatttc ccctcgtcaa aaataaggbt 
tgaatccggt. gagaatggca aaagcttatg 
gccattacgc tcgtcatcaa aatcactcgc 
cgccbgagcg agacgaaata cgcgatcgct 
atgcaaccgg cgcaggaaca ctgccagcgc 
ttcttctaat accbggaabg ctgttttccc 
atcaggagfca cggataaaat gcbtgabggb 
tagtctgacc atctcatctg taacatcatt 
caactctggc gcatcgggct tcccatacaa 
atbatcgcga gcccabbbab acccatataa 
cctcgagcaa gacgtttccc gtbgaafcatg 
gtaagcagac agbtbtabtg btcatgatga 
gagattttga gacacaacgt ggctttcccc 
ttattgtctc atgagcggat acatatttga 
tccgcgcaca tttccccgaa aagtgccacc 
attaacchat aaaaataggc gtatcacgag 



agattacgcg cagaaaaaaa ggatctcaag 5100 
acgctcagtg gaacgaaaac hcacgttaag 5160 
tcbtcaccba gatccttbta aatbaaaaat 5220 
agtaaacbtg gtctgacagt taccaatgct 5280 
gtctabtbcg ttcatccata gbtgcctgac 5340 
gtgaagaagg tgtbgctgac tcataccagg 5400 
tgagggagcc acggttgatg agagctbtgt 5460 
ttbgcbbbgc cacggaacgg fcctgcgtbgb 5520 
cagcaaaagt bcgabbtatt caacaaagcc 5580 
ccagtgbbac aaccaattaa ccaatbctga 5640 
cfcgcaabbba bbcatabcag gabbabcaat 5700 
tgaaggagaa aacbcaccga ggcagttcca 5760 
gabtccgacb cgtccaacat caatacaacc 5820 
abcaagbgag aaatcaccat gagbgacgac 5880 
catbtcbtbc cagactbgtt caacaggcca 5940 
atcaaccaaa ccgttabtca bbcgbgabbg 6000 
gttaaaagga caatbacaaa caggaatcga 6060 
abcaacaata ttbtcaccbg aatcaggaba 6120 
ggggatcgca gtggtgagta accabgcatc 6180 
cggaagaggc ataaattccg bcagccagbfc 624 0 
ggcaacgcba ccbbtgccat gttbcagaaa 6300 
bcgatagatb ^bcgcacctg abbgcccgac 6360 
abcagcatcc atgbbggaat bbaatcgcgg 6420 
gcbcataaca ccccttgtab bactgttbat 6480 
tatatttbta bcbtgbgcaa tgtaacabca 6540 
ccccccccat tattgaagca bbtatcaggg 6600 
abgbabbtag aaaaabaaac aaataggggt 6660 
fcgacgbcbaa gaaaccabta bbatcatgac 6720 
gcccttbcgt c 6761 



<210> 92 
<211> 6836 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVRioi2x/s containing HIV genes 
<400> 92 

tcgcgcgttb cggtgabgac ggbgaaaacc bcbgacacat gcagctcccg gagacggtca 60 
cagcttgbcb gbaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
fctggcgggtg tcggggctgg ctbaactatg cggcatcaga gcagattgta ctgagagbgc 180 
accatafcgcg gtgbgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt bgtatccata bcataatatg tacabbtaba ttggcbcabg 300 
bccaacatta ccgccabgbt gacattgabt abbgactagt tattaatagt aabcaabbac 360 
ggggtcabta gtfccabagcc cabababgga gtbccgcgbt acataactta cggtaaatgg 420 
cccgccbggc tgaccgccca acgacccccg cccatbgacg tcaataatga cgbabgttcc 480 
catagtaacg ccaabaggga cbtbccabtg acgbcaatgg gtggagbabt tacggtaaac 540 
bgcccacbtg gcagtacabc aagbgtatca babgccaagb acgcccccba btgacgtcaa 600 
tgacggtaaa tggcccgcct ggcatbatgc ccagtacatg accbbatggg acbtbccbac 660 
bbggcagbac atcbacgbab tagtcatcgc babbaccatg gtgabgcggt btbggcagta 720 
catcaatggg cgbggabagc ggttbgacbc acggggattt ccaagtctcc accccabbga 780 
cgbcaatggg agtbbgbtbb ggcaccaaaa tcaacgggac tfctccaaaat gtcgbaacaa 840 
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ctccgcccca ttgacgcaaa tgggcggtag gcgbgbacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agabcgccbg gagacgccab ccacgctgtb bbgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcabtg gaacgcggat 1020 
tccccgtgcc aagagfcgacg taagtaccgc ctabagacbc tataggcaca cccctttggc 1080 
tcttatgcat gcbatacbgb ttbbggcbbg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg babagcbbag cctataggtg tgggbbabbg accattattg accactcccc 1200 
babbggbgac gabacbbbcc attactaatc cataacatgg cbcbbbgcca caactatctc 1260 
tatbggctat abgccaatac tcbgbccttc agagactgac acggacbcbg babbbbbaca 1320 
99atggggtc ccabbbabba bbbacaaabt cacabataca acaacgccgb cccccgbgcc 1380 
cgcagbbbtb abbaaacaba gcgbgggabc tccacgcgaa tcbcgggbac gbgbbccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggfcc ccatgccbcc 1500 
agcggcbcab ggbcgctcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbatgtgbcb 1620 
gaaaatgagc gtggagatbg ggcbcgcacg gcbgacgcag atggaagacb baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgtb gbatbctgab aagagtcaga ggbaacbccc 1740 
gtbgcggbgc bgbbaacggt ggagggcagb gbagtcbgag cagbacbcgb bgcbgccgcg 18 00 
cgcgccacca gacabaatag cbgacagacb aacagacbgb tccbbtccab gggbctbbbc 1860 
bgcagtcacc gbcgbcgaca cgbgbgabca gatabcgcgg ccgcbctaga gabatcgcca 1920 
ccatgcgcgt gcgcggcabc cagaccagct ggcagaaccb gbggcgcbgg ggcaccabga 1980 
tcctgggcat gcbggbgatc tacagcgccg ccgagaaccb gbgggtggcc gtgbacbacg 2040 
gcgtgcccgt gbggaaggac gccgagacca ccctgbbcbg cgccagcgac gccaaggccb 2100 
acgacaccga ggbgcacaac gtgbgggaga cccacgccbg cgbgcccacc gaccccaacc 2160 
. cccaggaggb ggaccbggag aacgbgaccg aggacttcaa cabgtggcgc aacaacabgg 2220 
tggagcagab gcacaccgac atcatcagcc tgbgggacca gagcctgaag cccbgcgbga 2280 
agctgacccc ccbgtgcgbg accctggact gcaacgccac cgccagcaac gbgaccaacg 2340 
agatgcgcaa cbgcagctbc aacabcacca ccgagcbgaa ggacaagaag cagcaggbgb 2400 
acagccbgbb cbacaagcbg gacgbggbgc agatcaacga gaagaacgag accgacaagt 2460 
accgccbgab caacbgcaac accagcgcca tcacccaggc cbgccccaag gbgagctbcg 2520 
agcccabccc catccacbac bgcgcccccg ccggcbbcgc cabccbgaag bgcaaggaca 2580 
ccgagbbcaa cggcaccggc cccbgcaaga acgbgagcac cgbgcagtgc acccacggca 2640 
tccgccccgb gabcagcacc cagcbgcbgc bgaacggcag ccbggccgag gagggcabcc 2700 
agatccgcag cgagaacabc accaacaacg ccaagaccab cabcgbgcag cbggacaagg 2 760 
ccgbgaagab caacbgcacc cgccccaaca acaacacccg caagggcgbg cgcabcggcc 2820 
ccggccaggc cbbcbacgcc accggcggca bcatcggcga cabccgccag gcccacbgcc 2880 
acgtgagccg cgccaagbgg aacgacaccc bgcgcggcgb ggccaagaag cbgcgcgagc 2940 
acbtcaagaa caagaccabc abcbtcgaga agagcagcgg cggcgacabc gagabcacca 3000 
cccacagcbb cabcbgcggc ggcgagtbcb tcbacbgcaa caccagcggc cbgtbcaaca 3060 
gcacctggga gagcaacagc accgagagca acaacaccac cagcaacgac accabcaccc 3120 
bgacctgccg cabcaagcag atcabcaaca tgbggcagaa ggbgggccag gccabgbacc 3180 
ccccccccab ccagggcgtg abccgcbgcg agagcaacat caccggccbg ctgcbgaccc 324 0 
gcgacggcgg caacaacagc accaacgaga bcbbccgccc cggcggcggc aacabgcgcg 3300 
acaacbggcg cagcgagcbg bacaagbaca aggbggtgaa gabcgagccc cbgggcgtgg 33 60 
cccccagccg cgccaagcbb accgcccagg cccgccagcb gcbgagcggc abcgbgcagc 3420 
agcagagcaa cctgcbgcgc gccabcgagg cccagcagca cabgcbgaag cbgaccgbgb 34 80 
ggggcabcaa gcagcbgcag gcccgcgbgc bggccgbgga gcgcbaccbg aaggaccagc 354 0 
agctcgagab cbgggacaac atgaccbggc bgcagbggga caaggagabc agcaactaca 3 600 
cccagabcab cbacaaccbg abcgaggaga gccagaacca gcaggagaag aacgagcagg 3660 
accbgcbggc cctggacaag tgggccagcc bgtggaacbg gbtcgacabc agccgcbggc 3720 
tgbggtacab caagabctbc abcabgabcg bgggcggccb gatcggccbg cgcabcgbgb 3780 
tcgccgtgcb gagcgbgabc bgaacacgbg ggatccagab cbgcbgbgcc bbcbagbbgc 3840 
cagccabctg bbgbbbgccc ctcccccgbg ccbtccbbga cccbggaagg tgccactccc 3900 
acbgtccbbb cctaabaaaa bgaggaaabb gcabcgcatt gbcbgagbag gbgtcabbcb 3960 
atbcbggggg gtggggbggg gcaggacagc aagggggagg abtgggaaga caatagcagg 4020 
cabgctgggg atgcggbggg cbcbabgggb acccaggbgc bgaagaabtg acccggbbcc 40B0 
tcctgggcca gaaagaagca ggcacabccc ctbcbctgtg acacaccctg bccacgcccc 4140 



215 - 



WO 02/32943 PCT/US01/25721 

bggbbcbbag ttccagcccc actcatagga cactcatagc tcaggagggc tccgccttca 4200 

atcccacccg ctaaagtact tggagcggtc tctccctccc tcatcagccc accaaaccaa 4260 

acctagcctc caagagtggg aagaaattaa agcaagatag gctattaagt gcagagggag 432 0 

agaaaatgcc tccaacatgt gaggaagtaa bgagagaaat catagaattt taaggccatg 4380 

abtbaaggcc atcatggcct fcaatcttccg cbbccbcgcb cactgacfccg ctgcgctcgg 444 0 

bcgbbcggcb gcggcgagcg gtabcagctc actcaaaggc ggtaatacgg ttatccacag 4500 

aatcagggga taacgcagga aagaacabgt gagcaaaagg ccagcaaaag gccaggaacc 4560 

gtaaaaaggc cgcgbbgcbg gcgtttttcc abaggcbccg cccccctgac gagcatcaca 462 0 

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga baccaggcgb 4680 

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 474 0 

tgtccgcctt tctcccttcg ggaagcgbgg cgcbbbctca tagcbcacgc bgtaggtabc 4800 

bcagbbcggt gbaggtcgbb cgcbccaagc tgggcbgtgt gcacgaaccc cccgbbcagc 4860 

ccgaccgcbg cgccbbabcc ggbaacbatc gbcbbgagbc caacccggba agacacgact 4520 

babcgccact ggcagcagcc acbggtaaca ggabtagcag agcgaggtat gtaggcggbg 4980 

ctacagagbb cbbgaagbgg bggccbaacb acggcbacac bagaagaaca gtatbbggba 5040 

bcbgcgcbcb gcbgaagcca gbbaccbbcg gaaaaagagb bggbagctct bgatccggca 5100 

aacaaaccac cgcbggbagc ggbggbbbtb bbgbbbgcaa gcagcagabt acgcgcagaa 5160 

aaaaaggabc bcaagaagat ccbbbgabcb bbbcbacggg gbcbgacgcb cagtggaacg 522 0 

aaaacbcacg bbaagggabt tbggbcabga gatbabcaaa aaggatcbbc accbagabcc 5280 

bbbbaaabba aaaabgaagb bbbaaabcaa bcbaaagtab ababgagbaa acbbggbcbg 5340 

acagtbacca abgcbbaabc agbgaggcac cbabcbcagc gabcbgbcba bbbcgbbcab 5400 

ccabagbbgc cbgacbcggg gggggggggc gctgaggbcb gccbcgbgaa gaaggbgbbg 5460 

cbgacbcaba ccaggccbga abcgccccab catccagcca gaaagtgagg gagccacggt 5520 

bgabgagagc bbbgbbgbag gbggaccagb bggbgatbbb gaacbtbbgc bbbgccacgg 5580 

aacggbcbgc gbtgbcggga agabgcgbga bcbgabccbb caacbcagca aaagbbcgat 5640 

m 

bbabbcaaca aagccgccgb cccgbcaagb cagcgbaabg cbcbgccagb gbbacaacca 5700 

abbaaccaab bcbgabbaga aaaacbcabc gagcabcaaa bgaaacbgca abbbabtcab 5760 

abcaggabba bcaabaccat abbbbbgaaa aagccgtbbc bgbaatgaag gagaaaacbc 5820 

accgaggcag bbccabagga tggcaagatc cbggbabcgg bctgcgabbc cgacbcgbcc 5880 

aacabcaaba caaccbabba abbbcccctc gbcaaaaaba aggbbabcaa gbgagaaabc 5940 

accatgagbg acgacbgaab ccggbgagaa bggcaaaagc bbabgcabbb cbbbccagac 6000 

bbgbbcaaca ggccagccab tacgcbcgbc abcaaaabca cbcgcabcaa ccaaaccgbb 6060 

abbcabbcgt gatbgcgcct gagcgagacg aaabacgcga bcgcbgbbaa aaggacaabt 6120 

acaaacagga abcgaabgca accggcgcag gaacactgcc agcgcabcaa caababtbtc 6180 

accbgaabca ggababbcbb cbaabaccbg gaabgcbgbb bbcccgggga bcgcagtggt 624 0 

gagbaaccat gcabcabcag gagbacggab aaaabgcbbg abggbcggaa gaggcataaa 63 00 

bbccgbcagc cagbbbagbc bgaccabcbc abcbgbaaca bcabbggcaa cgcbaccbbt 6360 

gccabgbbbc agaaacaacb cbggcgcatc gggcbtccca bacaatcgab agabbgtcgc 6420 

accbgabbgc ccgacabbat cgcgagccca bbtabaccca babaaabcag cabccatgtt 6480 

ggaabbbaab cgcggccbcg agcaagacgb bbcccgbbga ababggcbca baacacccct 6540 

tgbattacbg bbbabgbaag cagacagbbb babbgbbcab gabgatabab bbbbabcbbg 6600 

bgcaabgbaa cabcagagat tbbgagacac aacgbggcbb bccccccccc cccabbabbg 6660 

aagcabbbab cagggbbabb gbcbcabgag cggabacaba bbbgaabgba bbbagaaaaa 6720 

baaacaaaba ggggbbccgc gcacabbbcc ccgaaaagbg ccacctgacg bcbaagaaac 6780 

cabbabbabc abgacabbaa ccbabaaaaa baggcgbabc acgaggcccb bbcgbc 6836 
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<400> 93 

tcgcgcgttt cggtgatgac gghgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcbtgbcb gtaagcggat gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 
ttggcgggtg bcggggcbgg cttaactatg cggcabcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg bacabbbaba ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcafcagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccatbgacg bcaabaabga cgbabgbbcc 4 80 
catagtaacg ccaabaggga ctttccattg acgtcaatgg gtggagtatt bacggbaaac 54 0 
tgcccacbbg gcagtacatc aagtgtatca batgccaagb acgcccccba btgacgtcaa 600 
bgacggtaaa bggcccgcct ggcabbatgc ccagbacatg accbtatggg acbbtcctac 660 
bbggcagbac abctacgtab bagtcabcgc fcatbaccatg gtgabgcggb bbbggcagba 720 
cabcaabggg cgbggatagc ggbtbgacbc acggggattt ccaagbcbcc accccatbga 780 
cgtcaatggg agbtbgbbtb ggcaccaaaa bcaacgggac btbccaaaab gtcgtaacaa 840 
ctccgcccca bbgacgcaaa tgggcggbag gcgbgbacgg bgggaggbcb atataagcag 900 
agctcgbtba gbgaaccgtc agabcgccbg gagacgccab ccacgctgbb tbgacctcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcabtg gaacgcggab 1020 
tccccgbgcc aagagbgacg taagbaccgc ctatagactc tabaggcaca cccctbtggc 1080 
tcbbatgcat gcbabacbgb ttbtggcbbg gggcctabac acccccgcbt cctbabgcba 1140 
taggbgabgg tabagcbbag cctataggbg bgggttabtg accabbabbg accacbcccc 1200 
tabtggbgac gatactbbcc atbactaatc cabaacatgg cbcbbbgcca caacbatcbc 1260 
babtggcbab abgccaatac bcbgbccbbc agagacbgac acggacbctg babbbbbaca 1320 
ggabggggtc ccatbbabba bbbacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbbbbb abbaaacaba gcgtgggabc bccacgcgaa bcbcgggbac gbgbbccgga 1440 
cabgggcbcb tcbccggtag cggcggagcb bccacabccg agcccbggtc ccatgccbcc 1500 
agcggctcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbtaggcac 1560 
agcacaabgc ccaccaccac cagtgbgccg cacaaggccg bggcggbagg gbabgbgbcb 162 0 
gaaaabgagc gbggagabbg ggctcgcacg gcbgacgcag atggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gtabtcbgab aagagbcaga ggbaacbccc 1740 
gbbgcggbgc tgbbaacggt ggagggcagb gbagtcbgag cagtacbcgb bgcbgccgcg 18 00 
cgcgccacca gacataatag cbgacagacb aacagacbgt bccttbccab gggbcbbbbc 1860 
bgcagbcacc gtcgbcgaca cgbgbgabca gabatcgcgg ccgcbcbaga gababcgcca 1920 
ccatgcgcgb gaaggagacc cagabgaacb ggcctaacct gbggaagbgg ggcabccbga 1980 
bcctgggccb ggbgabcabc bgcagcgcca gcgacaacct gbgggbgacc gtgbacbacg 2040 
gcgbgcccgb gbggcgcgac gccgacacca cccbgbbcbg cgccagcgac gccaaggccc 2100 
acgagaccga ggbgcacaac gbgbgggcca cccacgccbg cgbgcccacc gaccccaacc 2160 
cccaggaggt ggaccbggag aacabcaccg agaacbbcaa cabgbggaag aacaacabgg 2220 
bggagcagab gcaggaggac gbgabcagcc bgbgggacca gagccbgaag cccbgcgbga 2280 
agcbgacccc ccbgbgcgbg acccbgaacb gcatcaacgc caaccbgacc aacgbgaaca 2340 
acabcabcca cggccccaac abcabcggca acabcaccga cgaggbgcgc aacbgcagcb 2400 
bcaacabgac caccgagatc cgcgacaaga agcagaaggb gcacgcccbg btcbacaagc 2460 
bggacctggb gcagatcgag gacaagaaca gcagcgagba ccgccbgabc aactgcaaca 2520 
ccagcgbgab caagcaggcc bgccccaaga bcagcbtcga ccccabcccc abccacbacb 2580 
gcacccccgc cggcbacgcc abccbgaagt gcaacgacaa gaacbbcaac ggcaccggcc 2640 
ccbgcaccaa cgbgagcagc gbgcagbgca cccacggcab caagcccgbg gbgagcaccc 2700 
agctgcbgcb gaacggcagc ctggccgagg aggagatcab cabccgcagc gagaaccbga 2760 
ccaacaacgc caagaccabc atcgbgcacc bgaacaagag cgbggagabc aactgcaccc 2820 
gccccagcaa caacacccgc accagcacca ccabcggccc cggccaggbg bbctaccgca 2880 
ccggcgacab caccggcaac abccgcaagg ccbacbgcga gabcaacggc accaagbgga 294 0 
acgaggcccb gaagcaggbg accaagaagc bgaaggagca cbbcaacaac aagaccabca 3000 
bcbtccagcc ccccagcggc ggcgaccbgg agabcaccab gcaccacbbc aacbgccgcg 3 060 
gcgagbbcbb cbacbgcaac accacccgcc bgbtcaacaa caccbgcabc ggcaacgaga 3120 
ccabcggcgg cbgcaacgac accabcabcc bgcccbgcaa gatcaagcag abcabcaaca 3180 
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bgbggcaggg cgccggcctg gccatgtacg ccccccccat cagcggccgc atcaactgcg 3240 
bgagcaacat caccggcatc ctgctgaccc gcgacggcgg cgccaacaac accagcaacg 3300 
agatcttccg ccccggcggc ggcaacabca aggacaactg gcgcagcgag ctgtacaagt 3360 
acaaggtggt gcagatcgag ccccbgggca bcgcccccac ccgcgccaag cttaccgtgc 3420 
aggcccgcca gctggbgagc ggcatcgtgc agcagcagag caaccbgcbg cgcgccatcg 34 80 
aggcccagca gcacctgctg cagctgaccg tgtggggcat caagcagctg caggcccgcg 3540 
bgcbggccgb ggagcgcbac ctgaaggacc agaagtbccb cgaggagabc tggagcaaca 3600 
tgacctggat cgagtgggag cgcgagatca gcaactacac caaccagabc tacgagatcc 3660 
tgaccgagag ccagaaccag caggaccgca acgagaagga cctgctggag ctggacaagt 3720 
gggcgagcct gtggagctgg ttcgacatca ccaactggct gtggtgaaca cgtgggatcc 3780 
agatctgctg tgccttctag tbgccagcca tctgttgttt gcccctcccc cgtgccttcc 3840 
ttgaccctgg aaggtgccac tcccactgtc cbbbccbaab aaaatgagga aattgcatcg 3900 
cattgtctga gtaggtgtca btcbabtcbg gggggtgggg tggggcagga cagcaagggg 3960 
gaggattggg aagacaatag caggcatgct ggggatgcgg tgggctctat gggtacccag 4020 
gbgcbgaaga attgacccgg bbccbccbgg gccagaaaga agcaggcaca tccccttctc 4080 
tgtgacacac ccbgbccacg cccctggttc ttagttccag ccccactcat aggacactca 4140 
tagctcagga gggctccgcc ttcaatccca cccgctaaag tacttggagc ggbcbcbccc 4200 
tccctcatca gcccaccaaa ccaaaccbag cctccaagag tgggaagaaa bbaaagcaag 4260 
ataggcbatb aagtgcagag ggagagaaaa tgcctccaac atgtgaggaa gtaatgagag 4320 
aaabcabaga abbbbaaggc catgatttaa ggccatcatg gccttaatct tccgcttcct 4380 
cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gcbcacbcaa 4440 
aggcggbaab acggbbabcc acagaabcag gggabaacgc aggaaagaac atgtgagcaa 4500 
aaggccagca aaaggccagg aaccgtaaaa aggccgcgbb gcbggcgtbb btccabaggc 4560 
bccgcccccc tgacgagcab cacaaaaabc gacgcbcaag bcagaggtgg cgaaacccga 4 62 0 
caggacbaba aagataccag gcgtbbcccc cbggaagctc cctcgbgcgc bcbccbgbbc 4680 
cgacccbgcc gcbbaccgga baccbgbccg ccbbbctccc bbcgggaagc gbggcgctbb 4740 
ctcabagcbc acgcbgbagg tabcbcagbb cggbgbaggb cgbbcgcbcc aagcbgggcb 4800 
gtgtgcacga accccccgbt cagcccgacc gcbgcgccbt atccggbaac babcgbcbtg 4860 
agbccaaccc ggbaagacac gacbbatcgc cacbggcagc agccacbggt aacaggabta 4920 
gcagagcgag gbabgbaggc ggbgcbacag agbbcbbgaa gbggbggccb aacbacggcb 4980 
acacbagaag aacagbatbb ggbabcbgcg ctcbgcbgaa gccagtbacc tbcggaaaaa 5040 
gagbbggbag ctcbbgabcc ggcaaacaaa ccaccgctgg tagcggtggb bbbtbbgbbb 5100 
gcaagcagca gabtacgcgc agaaaaaaag gabcbcaaga agabccbbbg abcbbbtcba 5160 
cggggbcbga cgcbcagtgg aacgaaaacb cacgbtaagg gabbbbggbc atgagatbat 5220 
caaaaaggat cbbcacctag abccbbbbaa atbaaaaabg aagbbtbaaa bcaabctaaa 5280 
gbabatabga gbaaacbbgg bcbgacagbb accaabgcbt aabcagbgag gcacctabcb 5340 
cagcgabctg bcbabbbcgb bcatccabag btgccbgact cggggggggg gggcgcbgag 5400 
gbcbgccbcg bgaagaaggb gtbgctgacb cabaccaggc ctgaabcgcc ccabcabcca 5460 
gccagaaagt gagggagcca cggbbgabga gagctbbgbt gbaggtggac cagbtggbga 5520 
ttbbgaacbb btgcbbbgcc acggaacggb cbgcgbbgbc gggaagabgc gbgabcbgab 5580 
cctbcaacbc agcaaaagbb cgabbbabbc aacaaagccg ccgtcccgtc aagbcagcgb 5640 
aabgcbcbgc cagbgbbaca accaabtaac caabtcbgab bagaaaaacb catcgagcab 5700 
caaabgaaac bgcaabbbab bcababcagg abbabcaaba ccatabbbtb gaaaaagccg 5760 
bbbcbgbaab gaaggagaaa acbcaccgag gcagbbccab aggabggcaa gatccbggba 5820 
bcggbcbgcg abbccgacbc gbccaacabc aabacaaccb abbaabbbcc cctcgbcaaa 5880 
aabaaggbba bcaagbgaga aabcaccabg agbgacgacb gaabccggbg agaabggcaa 5940 
aagcbbabgc abbbcbbbcc agactbgbbc aacaggccag ccabbacgcb cgbcabcaaa 6000 
abcacbcgca bcaaccaaac cgbbabbcab bcgtgabtgc gccbgagcga gacgaaabac 6060 
gcgabcgcbg bbaaaaggac aabbacaaac aggaabcgaa bgcaaccggc gcaggaacac 6120 
tgccagcgca bcaacaabab bbbcaccbga abcaggatab bcbbcbaaba ccbggaabgc 6180 
tgtbbbcccg gggabcgcag bggbgagbaa ccabgcatca bcaggagbac ggabaaaabg 6240 
cbtgabggbc ggaagaggca baaatbccgb cagccagtbb agbcbgacca bctcabcbgb 6300 
aacatcabbg gcaacgcbac cbtbgccabg bbtcagaaac aacbcbggcg cabcgggcbb 6360 
cccatacaab cgabagabbg bcgcaccbga bbgcccgaca bbatcgcgag cccabbbaba 6420 
cccababaaa bcagcabcca bgtbggaabb baabcgcggc cbcgagcaag acgbbbcccg 6480 
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ttgaatatgg ctcataacac cccttgtatt actgtttatg taagcagaca gttttattgt 6540 
tcatgatgat atatttttat cbtgtgcaat gtaacatcag agattttgag acacaacgtg 6600 
gctttccccc cccccccatt attgaagcat ttatcagggt tattgtctca tgagcggata 6660 
catatttgaa hgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa 6720 
agtgccacct gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg 6780 
tatcacgagg ccctttcgtc 6800 



<210> 94 
<211> 6872 
<212> DNA 

<213> Artificial Sequence 

■ 

<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 94 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 42 0 

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 

catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 

tgcagtcacc gtcgtcgaca cgtgtgatca gatafccgcgg ccgctctaga gatatcgcca 1920 

ccatgcgcgt gaaggagacc cagatgaact ggcctaacct gtggaagtgg ggcatcctga 1980 

tcctgggcct ggtgatcatc tgcagcgcca gcgacaacct gtgggtgacc gtgtactacg 204 0 

gcgtgcccgt gtggcgcgac gccgacacca ccctgttctg cgccagcgac gccaaggccc 2100 

acgagaccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 

cccaggaggt ggacctggag aacatcaccg agaacttcaa catgtggaag aacaacatgg 2220 
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tggagcagat gcaggaggac gtgatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgaact gcabcaacgc caacctgacc aacgtgaaca 2340 
acatcabcca cggccccaac atcahcggca acatcaccga cgaggtgcgc aactgcagct 2400 
tcaacatgac caccgagatc cgcgacaaga agcagaaggt gcacgccctg ttctacaagc 2460 
tggacctggt gcagatcgag gacaagaaca gcagcgagta ccgcctgatc aactgcaaca 2520 
ccagcgtgat caagcaggcc tgccccaaga tcagcbtcga ccccatcccc atccactact 2580 
gcacccccgc cggcbacgcc abccbgaagt gcaacgacaa gaactbcaac ggcaccggcc 2640 
cctgcaccaa cgtgagcagc gtgcagtgca cccacggcat caagcccgtg gtgagcaccc 2700 
agctgctgct gaacggcagc ctggccgagg aggagatcat catccgcagc gagaacctga 2760 
ccaacaacgc caagaccatc atcgtgcacc tgaacaagag cgtggagatc aactgcaccc 2820 
gccccagcaa caacacccgc accagcacca ccatcggccc cggccaggtg ttctaccgca 2880 
ccggcgacat caccggcaac atccgcaagg cctactgcga gatcaacggc accaagtgga 2940 
acgaggcccb gaagcaggtg accaagaagc. tgaaggagca cttcaacaac aagaccabca 3 000 
tctbccagcc ccccagcggc ggcgaccbgg agatcaccat gcaccacttc aactgccgcg 3060 
gcgagttcbb ctactgcaac accacccgcc tgttcaacaa cacctgcatc ggcaacgaga 3120 
ccatcggcgg ctgcaacgac accatcatcc tgccctgcaa gatcaagcag atcatcaaca 3180 
tgtggcaggg cgccggcctg gccatgtacg ccccccccat cagcggccgc atcaactgcg 3240 
tgagcaacat caccggcatc ctgctgaccc gcgacggcgg cgccaacaac accagcaacg 3300 
agatcttccg ccccggcggc ggcaacatca aggacaactg gcgcagcgag ctgtacaagt 3360 
acaaggtggt gcagatcgag cccctgggca tcgcccccac ccgcgccaag ctbaccgtgc 342 0 
aggcccgcca gctggtgagc ggcatcgtgc agcagcagag caacctgctg cgcgccatcg 34 80 
aggcccagca gcacctgctg cagctgaccg tgtggggcat caagcagctg caggcccgcg 3 54 0 
tgctggccgt ggagcgctac ctgaaggacc agaagttcct cgaggagatc tggagcaaca 3600 
tgacctggat cgagtgggag cgcgagatca gcaacbacac caaccagabc bacgagabcc 3 660 
bgaccgagag ccagaaccag caggaccgca acgagaagga cctgcbggag cbggacaagb 3 720 
gggcgagccb gtggagctgg tbcgacabca ccaacbggcb gbggbacatc aagatcbbca 3780 
bcabgabcgb gggcggccbg atcggccbgc gcatcabcbb cgccgbgcbg agcatcbgaa 3 84 0 
cacgbgggab ccagabcbgc bgbgcctbcb agbtgccagc cabcbgtbgb bbgcccctcc 3900 
cccgbgccbb ccbbgacccb ggaaggtgcc acbcccacbg tcctbbccba ataaaatgag 3960 
gaaabbgcab cgca;bbgbcb gagbaggbgb cattcbabbc bggggggbgg ggtggggcag 4020 
gacagcaagg gggaggabtg ggaagacaat agcaggcabg cbggggatgc ggbgggctct 4080 
abgggbaccc aggtgcbgaa gaatbgaccc ggttcctccb gggccagaaa gaagcaggca 4140 
catccccbbc bcbgbgacac acccbgtcca cgccccbggb bcttagttcc agccccacbc 4200 
abaggacact cabagctcag gagggctccg ccbbcaabcc cacccgcbaa agtacbtgga 4260 
gcggtcbcbc ccbcccbcab cagcccacca aaccaaaccb agcctccaag agtgggaaga 4320 
aabbaaagca agabaggcta ttaagbgcag agggagagaa aabgccbcca acabgbgagg 4380 
aagbaabgag agaaatcaba gaatbbtaag gccabgabbb aaggccabca bggccbbaab 4440 
cttccgcbbc cbcgctcact gacbcgcbgc gcbcggtcgb tcggcbgcgg cgagcggbab 4500 
cagctcacbc aaaggcggba abacggtbab ccacagaabc aggggabaac gcaggaaaga 4560 
acabgbgagc aaaaggccag caaaaggcca ggaaccgbaa aaaggccgcg bbgctggcgb 4620 
btbbccabag gctccgcccc ccbgacgagc abcacaaaaa tcgacgcbca agbcagaggt 4680 
ggcgaaaccc gacaggacba taaagatacc aggcgbbtcc cccbggaagc tccctcgbgc 4740 
gctcbccbgb bccgaccctg ccgcbbaccg gabaccbgbc cgccbbbcbc cctbcgggaa 4800 
gcgbggcgcb bbcbcatagc bcacgcbgba ggbabctcag bbcggbgbag gtcgbbcgct 4860 
ccaagcbggg cbgtgbgcac gaaccccccg bbcagcccga ccgcbgcgcc bbabccggba 4920 
acbabcgbcb bgagbccaac ccggbaagac acgactbabc gccacbggca gcagccacbg 4 980 
gbaacaggab bagcagagcg aggbabgtag gcggbgcbac agagbbcbtg aagbggtggc 5040 
cbaactacgg cbacacbaga agaacagbab bbggbabcbg cgctcbgcbg aagccagbta 5100 
ccbbcggaaa aagagbtggb agctcbbgab ccggcaaaca aaccaccgcb ggbagcggtg 5160 
gbbbbbtbgb btgcaagcag cagabbacgc gcagaaaaaa aggatcbcaa gaagabcctb 5220 
bgabcbbbtc tacggggbcb gacgctcagb ggaacgaaaa cbcacgbtaa gggabtbbgg 5280 
tcabgagatb abcaaaaagg abctbcaccb agabccbbtb aaatbaaaaa bgaagbtbta 5340 
aatcaatcta aagtatabat gagtaaacbb ggbcbgacag bta.ccaabgc bbaatcagbg 5400 
aggcaccbat cbcagcgabc tgbctabttc gbbcatccat agbtgccbga cbcggggggg 5460 
99999 c 9 ct 9 aggbcbgcct cgbgaagaag gtgbbgcbga cbcataccag gcctgaabcg 5520 
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ccccatcabc cagccagaaa gtgagggagc cac'ggbbgab gagagcbbbg bbgtaggbgg 5580 

accagttggt gattttgaac bbbbgcbtbg ccacggaacg gtctgcgttg tcgggaagat 5640 

gcgtgatctg atccttcaac tcagcaaaag ttcgatttat tcaacaaagc cgccgtcccg 5700 

tcaagtcagc gbaabgctcb gccagtgtta caaccaatta accaattcbg abbagaaaaa 5760 

cbcabcgagc atcaaatgaa actgcaattb abtcatatca ggattatcaa taccatattt 5820 

bbgaaaaagc cgtttctgta atgaaggaga aaactcaccg aggcagttcc ataggatggc 5880 

aagatcctgg fcatcggtctg cgabbccgac tcgtccaaca tcaatacaac cbabbaabbb 5940 

ccccbcgbca aaaataaggb tatcaagtga gaaabcacca bgagtgacga cbgaabccgg 6000 

bgagaabggc aaaagcttab gcabbtctbb ccagacttgt bcaacaggcc agccabbacg 6060 
cbcgbcabca aaabcactcg cabcaaccaa accgbbatbc abbcgbgatt gcgccbgagc. 6120 * 

gagacgaaab acgcgabcgc bgbbaaaagg acaatbacaa acaggaabcg aabgcaaccg 6180 

gcgcaggaac acbgccagcg cabcaacaab atbbbcacct gaabcaggab abbcbbctaa 6240 

baccbggaab gcbgbbbbcc cggggabcgc agbggbgagt aaccabgcab cabcaggagb 63 00 

acggabaaaa bgcbbgabgg bcggaagagg cabaaabbcc gbcagccagb tbagbcbgac 6360 

cabcbcabcb gbaacabcab bggcaacgcb accbbbgcca bgbtbcagaa acaacbcbgg 6420 

cgcabcgggc bbcccabaca abcgatagab bgbcgcaccb gabtgcccga catbabcgcg 64 BO 

agcccabbba bacccababa aabcagcabc cabgbbggaa bbbaabcgcg gccbcgagca 6540 

agacgbbbcc cgbbgaabab ggctcabaac accccbbgba bbacbgbbba bgbaagcaga 6600 

cagttbtabb gbbcatgabg abatabbbbt abcbtgbgca abgtaacabc agagabbbbg 6660 

agacacaacg bggcbtbccc ccccccccca tbabtgaagc atbbabcagg gtbabbgbcb 6720 

cabgagcgga bacabattbg aabgbabbba gaaaaabaaa caaabagggg btccgcgcac 67 80 

abttccccga aaagbgccac cbgacgbcba agaaaccafcb atbabcabga cabbaaccta 6840 

baaaaabagg cgbabcacga ggccctbbcg be 6872 



<210> 95 
<2X1> 6757 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/6 conbaining HIV genes 
<400> 95 

begegegbbb eggbgabgae ggbgaaaacc bctgacacab gcagcbcccg gagaeggbca 60 

cagctbgbcb gbaageggab geegggagea gacaagcccg beagggegeg beagegggbg 120 

bbggcgggbg beggggebgg ctbaacbatg eggcabcaga gcagabbgba cbgagagbgc 180 

accababgcg gbgbgaaaba ccgcacagab gegbaaggag aaaatacege abcagabbgg 24 0 

cbabbggcca bbgcabacgb tgbabecata bcabaababg bacatbbaba bbggcbcabg 300 

becaacabba ccgccabgtt gacabbgabb abbgacbagb babbaabagb aabcaabbac 360 

ggggbcabba gbbcabagee catababgga gbtccgcgbb acabaaebba eggbaaabgg 420 

cccgccbggc bgaccgccca acgacccccg cccattgacg bcaabaabga cgtabgbbcc 4 80 

cabagbaacg ccaabaggga ebbbecabbg aegbcaabgg gtggagbatt tacggbaaac 540 

bgcccacbbg gcagbacabc aagbgtabca babgecaagb acgcccccta tbgacgbcaa 600 

bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbabggg acbbbccbac 660 

bbggcagbac atebaegbab bagbcabege babbaccabg gbgabgcggt bbbggcagba 720 

catcaabggg cgtggatagc ggtbtgaebe aeggggabbb ccaagbcbcc accccabbga 780 

cgtcaabggg agbbbgbbbb ggcaccaaaa beaaegggae bbtccaaaab gbcgbaacaa 840 

cbccgcccca btgacgcaaa bgggeggbag gcgbgbacgg bgggaggbcb ababaagcag 900 

agebegbbba gtgaaccgbc agabegecbg gagacgecab ccacgcbgbb bbgaccbcca 960 

bagaagacac egggaccgab ccagcctccg eggcegggaa eggbgcabbg gaacgeggab 1020 

bccccgbgcc aagagbgacg baagbacege cbabagaebe bataggcaca ccccbbbggc 1080 

bebbabgcab gcbabacbgb bbttggcbbg gggecbabac acccccgcbb ccbtabgeba 1140 

baggbgabgg babagebbag cctabaggbg tgggbbabtg accabbabbg accacbcccc 1200 
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batbggtgac gatacbbbcc abtacbaatc cabaacabgg cbctbbgcca caacbatcbc 1260 
babbggcbab atgccaatac bcbgbccbbc agagactgac acggactctg tatttttaca 1320 
ggabggggbc ccabtbabba bbbacaaabb cacatataca acaacgccgb cccccgtgcc 1380 
cgcagbbtbb attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
cabgggcbcb bcbccggbag cggcggagct bccacabccg agccctggtc ccatgcctcc 1500 
agcggctcat gghcgctcgg cagctccttg cbccbaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagbgbgccg cacaaggccg tggcggtagg gbabgbgbct 1620 
gaaaatgagc gtggagattg ggcbcgcacg gcbgacgcag abggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagtbgtt gtattctgat. aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagbacbcgb tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgbgbgatca gabatcgcgg ccgcbcbaga gabatcgcca 1920 
ccatgcgbgb bcgbggbatc ccgcgbaacb ggccgcaghg gbggabgbgg ggtabccbgg 1980 
gtbbctggab gatcatcabc bgccgbgbtg bbggtaacat gbgggbbacc gbbbacbacg 2040 
gbgbbccggb btggaccgac gcbaaaacca cccbgbbcbg cgcbbccgac accaaagccb 2100 
acgaccgbga agbtcacaac gtbtgggcta cccacgcbbg cgttccgacc gacccgaacc 2160 
cgcaggaaafc cgbtcbggaa aacgbbaccg aaaacbbcaa cabgbggaaa aacgacabgg 2220 
btgaccagab gcacgaagac atcabcbccc bgbgggacca gbcccbgaaa ccgtgcgbba 22 80 
aacbgacccc gctgbgcgbt accctgcacb gcaccaacgc baccbbcaaa aacaacgtba 2340 
ccaacgacat gaacaaagaa abccgbaact gctccbbcaa caccaccacc gaaatccgbg 2400 
acaaaaaaca gcagggbbac gctcbgbtcb accgtccgga cabcgbbctg ctgaaagaaa 2460 
accgbaacaa ctccaacaac bccgaabaca bccbgabcaa cbgcaacgcb tccaccabca 2520 
cccaggctbg cccgaaagtt aactbcgacc cgabcccgat ccacbactgc gctccggcbg 2580 
gbbacgcbat cctgaaabgc aacaacaaaa ccbbcbccgg baaaggtccg tgcaacaacg 2640 
bbbccaccgb bcagbgcacc catggbabca aaccggbbgt tbccacccag cbgcbgcbga 2700 
acggbbcccb ggcbgaaaaa gaaabcabca bccgtbccga aaaccbgacc gacaacgbba 2760 
aaaccabcab cgbbcaccbg aacaaabccg bbgaaabcgt tbgcacccgb ccgaacaaca 2820 
acacccgbaa abccabgcgt abcggtccgg gbcagaccbb cbacgcbacc ggtgacabca 2880 
bcggbgacab ccgtcaggct bactgcaaca bctccggbbc caaabggaac gaaacccbga 2940 
aacgbgbbaa agaaaaactg caggaaaacb acaacaacaa caaaaccatc aaabbcgcbc 3000 
cgbccbccgg bggtgaccbg gaaabcacca cccacbccbt caacbgccgb ggtgaatbcb 3060 
bcbactgcaa caccacccgb ctgbbcaaca acaacgcbac cgaagacgaa accabcaccc 312 0 
bgccgbgccg babcaaacag abcabcaaca bgbggcaggg bgbbggbcgb gctabgbacg 3180 
cbccgccgab cgcbggbaac atcacctgca aatccaacat caccggbctg ctgcbggbbc 3240 
gtgacggbgg bgaagacaac aaaaccgaag aaabctbccg bccgggbggb ggtaacabga 3300 
aagacaacbg gcgbbccgaa ctgbacaaab acaaagtbat cgaacbgaaa ccgcbgggba 3360 
bcgcbccgac cggbgcbaag cbbaccgbbc aggctcgbca gcbgcbgbcc bccabcgbbc 3420 
agcagcagbc caaccbgcbg cgbgcbabcg aagcbcagca gcacabgctg cagctgaccg 3480 
bbbggggbab caaacagcbg cagacccgbg bbcbggcbab cgaacgtbac cbgaaagacc 3540 
agcagcbcga gabcbggaac aacabgacct ggabggaabg ggaccgbgaa abcbccaacb 3 600 
acaccgacac cabctaccgb cbgcbggaag actcccagac ccagcaggaa aaaaacgaaa 3660 
aagaccbgcb ggcbcbggac bccbggaaaa accbgbggbc cbggtbcgac abcbccaacb 3720 
ggcbgbggbg aggabccaga bcbgcbgbgc cbbcbagbbg ccagccabcb gtbgbbbgcc 3780 
ccbcccccgb gccbbccbbg acccbggaag gbgccacbcc cacbgbccbb bccbaabaaa 3 840 
abgaggaaab bgcabcgcab bgbcbgagba ggtgbcabbc babbcbgggg ggbggggbgg 3 900 
ggcaggacag caagggggag gabbgggaag acaabagcag gcabgctggg gabgcggbgg 3 960 
gcbcbabggg bacccaggbg cbgaagaabb gacccggbbc cbccbgggcc agaaagaagc 4 020 
aggcacabcc cctbcbctgb gacacacccb gbccacgccc ctggbbcbba gbbccagccc 4080 
cacbcabagg acactcabag cbcaggaggg cbccgccbbc aabcccaccc gcbaaagbac 4140 
bbggagcggb cbcbcccbcc cbcabcagcc caccaaacca aaccbagccb ccaagagbgg 42 00 
gaagaaabta aagcaagaba ggcbatbaag bgcagaggga gagaaaabgc cbccaacabg 4 260 
bgaggaagta abgagagaaa bcabagaabb bbaaggccab gabbbaaggc cabcabggcc 4320 
tbaabctbcc gcbbccbcgc bcactgacbc gcbgcgctcg gtcgbbcggc bgcggcgagc 4380 
ggbabcagcb cacbcaaagg cggbaatacg gbbabccaca gaabcagggg abaacgcagg 4440 
aaagaacabg bgagcaaaag gccagcaaaa ggccaggaac cgbaaaaagg ccgcgbbgcb 45 00 
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ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagbca 4560 
gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg gaagcbcccb 4620 
cgbgcgcbcb cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc 4680 
gggaagcgtg gcgctttctc atagctcacg ctgbaggtat ctcagttcgg tgtaggtcgt 4740 
tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgcb gcgccttatc 4800 
cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc 4860 
cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg 4920 
gtggcctaac tacggctaca ctagaagaac agbatbbggb abctgcgcbc bgctgaagcc 4980 
agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag 5040 
cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga 5100 
tccttbgabc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat 5160 
tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag 5220 
tbbbaaabca atctaaagta tatatgagta aacttggbct gacagttacc aatgcttaat 5280 
cagtgaggca cctatctcag cgatctgtct atbbcgttca tccatagttg cctgactcgg 5340 
9999999999 cgctgaggtc tgcctcgtga agaaggtgtt gctgactcat accaggcctg 54 00 
aatcgcccca tcatccagcc agaaagtgag ggagccacgg ttgatgagag ctttgttgta 5460 
ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtcfcg cgttgtcggg 5520 
aagatgcgtg atctgatcct tcaactcagc aaaagttcga tttattcaac aaagccgccg 5580 
tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa ttctgattag 5640 
aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt atcaatacca 5700 
tatttttgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca gttccatagg 5760 
abggcaagat cctggtatcg gtctgcgatt ccgactcgtc caacabcaab acaaccbabb 5820 
aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt gacgactgaa 5880 
tccggtgaga atggcaaaag cttatgcatt tctttccaga cttgttcaac aggccagcca 5940 
ttacgctcgt catcaaaatc actcgcatca accaaaccgt tattcattcg tgattgcgcc 6000 
tgagcgagac gaaatacgcg atcgctgtta aaaggacaat tacaaacagg aatcgaatgc 6060 
aaccggcgca ggaacacfcgc cagcgcafcca acaatatttt cacctgaatc aggatattct 612 0 
tctaataccfc ggaatgctgt tttcccgggg atcgcagtgg tgagbaacca bgcabcabca 6180 
ggagtacgga taaaatgctt gatggtcgga agaggcabaa atbccgbcag ccagbbbagb 624 0 
ctgaccabcb cabctgbaac abcabbggca acgcbaccbb tgccatgtbb cagaaacaac 6300 
bcbggcgcab cgggcbbccc abacaabcga tagabtgbcg caccbgabbg cccgacatba 6360 
bcgcgagccc abtbabaccc ababaaabca gcabccabgb bggaabbbaa bcgcggccbc 6420 
gagcaagacg ttbcccgbbg aatabggctc ataacacccc ttgbabbacb gbtbabgbaa 6480 
gcagacagbt bbabtgttca bgatgababa bbbbbafccbb gbgcaabgta acabcagaga 654 0 
bbbbgagaca caacgbggcb btcccccccc ccccabtabb gaagcatbba tcagggtbab 6600 
bgbctcatga gcggatacab attbgaabgb abbtagaaaa abaaacaaab aggggtbccg 6660 
cgcacabbbc cccgaaaagb gccacctgac gtcbaagaaa ccatbatbab cabgacabba 6720 
accbataaaa abaggcgtab cacgaggccc bbbcgtc 6757 



<210> 96 
<211> 6829 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVRl012x/s containing HIV genes 
<400> 96 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggbca 60 
cagcttgtcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 12 0 
btggcgggbg tcggggctgg ctbaacbatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gbgtgaaaba ccgcacagat gcgbaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcabaababg tacabbbaba btggcbcabg 300 
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tccaacatta ccgccatgbb gacabtgabt atbgacbagt tabbaabagb aatcaattac 360 

ggggbcatba gttcatagcc cabababgga gbbccgcgbb acabaacbba cggtaaabgg 420 

cccgcctggc bgaccgccca acgacccccg cccattgacg bcaabaatga cgtatgttcc 480 

catagtaacg ccaabaggga ctttccattg acgtcaatgg gbggagbabb bacggtaaac 540 

tgcccacttg gcagbacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattabgc ccagbacatg accbbabggg actttcctac 660 

ttggcagtac atctacgtat tagtcatcgc babtaccabg gtgatgcggt tttggcagta 720 

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagbctcc accccattga 780 * 

cgbcaabggg agbbbgbbbb ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgbgtacgg bgggaggbct atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg ga'gacgccab ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggbgcabbg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg babagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat abgccaabac bcbgtccbbc agagacbgac acggactctg batbbbbaca 1320 

ggatggggtc ccatttatta bbfcacaaatt cacatataca acaacgccgt cccccgtgcc 13 80 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa bcbcgggbac gtgttccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggcbcab ggtcgctcgg cagctccttg cbccbaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtabgbgtcb 1620 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag cbgagttgbb gbabbcbgat aagagtcaga ggtaactccc 174 0 

gtbgcggbgc bgtbaacggt ggagggcagt gtagtctgag cagtactcgt bgcbgccgcg 1800 

cgcgccacca gacabaabag cbgacagacb aacagacbgb bccbbtccat gggbcbbbbc 1860 

bgcagbcacc gbcgbcgaca cgbgbgabca gababcgcgg ccgcbcbaga gababcgcca 1920 

ccabgcgtgb bcgbggbatc ccgcgfcaacb ggccgcagbg gbggabgbgg ggtatccbgg 1980 

gbtbcbggab gabcabcatc bgccgfcgbbg tbggtaacab gbgggtbacc gtbbacbacg 2040 

gbgtbccggt bbggaccgac gcbaaaacca cccbgbbcbg cgcbbccgac accaaagccb 2100 

acgaccgbga agbtcacaac gbbtgggcba cccacgctbg cgbtccgacc gacccgaacc 2160 

cgcaggaaab cgbhctggaa aacgbbaccg aaaactbcaa cabgbggaaa aacgacabgg 2220 

bbgaccagab gcacgaagac abcabcbccc bgbgggacca gbcccbgaaa ccgbgcgbba 2280 

aacbgacccc gcbgbgcgtt acccbgcacb gcaccaacgc baccbbcaaa aacaacgtba 234 0 

ccaacgacab gaacaaagaa abccgtaact gcbccttcaa caccaccacc gaaatccgbg 2400 

acaaaaaaca gcagggbbac gcbcbgtbcb accgtccgga cabcgbbcbg ctgaaagaaa 2460 

accgbaacaa cbccaacaac bccgaabaca bccbgabcaa cbgcaacgcb bccaccabca 2520 

cccaggcbbg cccgaaagbt aacbbcgacc cgabcccgab ccacbacbgc gcbccggcbg 2580 

gbbacgcbab ccbgaaabgc aacaacaaaa cctbctccgg baaaggtccg bgcaacaacg 2640 

bbbccaccgb tcagtgcacc cabggtabca aaccggbtgb bbccacccag cbgcbgcbga 2700 

acggbtcccb ggctgaaaaa gaaatcabca bccgbtccga aaaccbgacc gacaacgbba 2760 

aaaccabcab cgbbcacctg aacaaatccg tbgaaatcgb btgcacccgt ccgaacaaca 2820 

acacccgbaa atccatgcgb atcggtccgg gtcagaccbb cbacgcbacc ggbgacabca 2880 

tcggbgacat ccgtcaggct tactgcaaca bcbccggtbc caaabggaac gaaacccbga 2940 

aacgbgbbaa agaaaaacbg caggaaaacb acaacaacaa caaaaccabc aaabbcgcbc 3000 

cgbccbccgg tggtgaccbg gaaatcacca cccactcctb caactgccgb ggtgaabbct 3 060 

bcbactgcaa caccacccgt ctgbbcaaca acaacgctac cgaagacgaa accabcaccc 3120 

bgccgbgccg babcaaacag atcabcaaca bgtggcaggg bgbbggtcgb gcbabgbacg 3180 

ctccgccgab cgcbggtaac atcacctgca aatccaacab caccggbcbg cbgcbggbbc 3240 

gbgacggbgg bgaagacaac aaaaccgaag aaabcbbccg bccgggbggb ggbaacabga 3 3 00 

aagacaacbg gcgbbccgaa ctgbacaaab acaaagbbab cgaacbgaaa ccgctgggba 3360 

bcgcbccgac cggtgctaag ctbaccgbtc aggcbcgtca gctgcbgbcc tccabcgbtc 3420 

agcagcagbc caacctgctg cgtgcbatcg aagcbcagca gcacabgcbg cagcbgaccg 3480 

bttggggbab caaacagctg cagacccgtg bbcbggctab cgaacgttac cbgaaagacc 354 0 

agcagcbcga gabcbggaac aacabgaccb ggabggaabg ggaccgtgaa abcbccaact 3600 
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acaccgacac catctaccgt ctgctggaag actcccagac ccagcaggaa aaaaacgaaa 3 660 
aagaccbgcb ggcbcfcggac tcctggaaaa acctgtggtc ctggttcgac atctccaact 3720 
ggctgtggta catcaaaatc ttcatcatga tcgttggtgg fcctgatcggt ctgcgtatca 3780 
tcttcgctgt tcbgbccatc tgaggatcca gatctgctgt gccbbcbagb tgccagccat 3 840 
ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc 3 900 
tttcctaata aaatgaggaa attgcatcgc abtgtctgag taggtgtcat tctattctgg 3 960 
gg99tggggt ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg 402 0 
Qggatgcggt 999 ctctat 9 99 taccca 99 tgctgaagaa bbgacccggb tcctcctggg 4080 
ccagaaagaa gcaggcacat ccccbbcbct gtgacacacc ctgtccacgc ccchggttct 414 0 
tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct tcaatcccac 4200 
ccgcbaaagt acttggagcg gbctctcccb ccctcabcag cccaccaaac caaaccbagc 4260 
ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg gagagaaaat 4320 
gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc atgatbbaag 4380 
gccatcatgg ccttaatctt ccgcttcctc gctcacfcgac tcgctgcgct cggtcgttcg 4440 
gctgcggcga gcggtabcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg 4500 
ggabaacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 4560 
ggccgcgbbg ctggcgbbbb bccabaggcb ccgcccccct gacgagcatc acaaaaatcg 4620 
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 4680 
tggaagcbcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat accbgbccgc 4740 
cbtbctcccb bcgggaagcg tggcgctttc bcatagctca cgctgbaggt atctcagbtc 4800 
ggbgtaggtc gttcgcbcca agctgggcbg bgtgcacgaa ccccccgbbc agcccgaccg 4 860 
cbgcgccbta bccggtaact abcgbcttga gbccaacccg gbaagacacg actbabcgcc 4920 
actggcagca gccacbggba acaggabbag cagagcgagg batgtaggcg gtgcbacaga 4 980 
gtbcttgaag tggbggccta actacggcba cacbagaaga acagbabbbg gtatcbgcgc 504 0 
bctgcbgaag ccagtbaccb bcggaaaaag agfctggbagc bcbbgabccg gcaaacaaac 5100 
caccgctggt agcggbggbt tbtbtgtbbg caagcagcag abtacgcgca gaaaaaaagg 5160 
abcbcaagaa gatcctbbga bcttbbctac ggggtcbgac gcbcagtgga acgaaaacbc 5220 
acgbtaaggg atbbtggtca bgagabbabc aaaaaggatc bbcaccbaga tccttbbaaa 5280 
bbaaaaabga agbbbbaaat caabcbaaag babatabgag baaacbbggt cbgacagtba 5340 
ccaatgcbba abcagbgagg caccbabcbc agcgabcbgb cbabtbcgbb cabccabagb 5400 
bgccbgacbc gggggggggg ggcgcbgagg bctgccbcgb gaagaaggbg bbgcbgactc 54 60 
abaccaggcc tgaabcgccc cabcatccag ccagaaagbg agggagccac ggbtgabgag 5520 
agcbttgbtg baggbggacc agbbggtgat tbtgaacttb bgcbbbgcca cggaacggtc 558 0 
bgcgbtgbcg ggaagabgcg bgabcbgatc cbbcaacbca gcaaaagbbc gabbbabbca 5640 
acaaagccgc cgbcccgbca agtcagcgba abgcbcbgcc agbgbbacaa ccaabbaacc 5700 
aabbctgabb agaaaaacbc abcgagcatc aaabgaaacb gcaabbbabb cababcagga 5760 
tbabcaabac cabatbbbbg aaaaagccgt bbcbgbaatg aaggagaaaa ctcaccgagg 5820 
cagbbccaba ggabggcaag abccbggtab cggbctgcga bbccgacbcg bccaacabca 5880 
abacaaccta btaabtbccc cbcgbcaaaa abaaggbbat baagbgagaa abcaccabga 5 940 
gbgacgacbg aabccggbga gaabggcaaa agcbbabgca bbbcbbbcca gactbgbbca 6000 
acaggccagc cabbacgcbc gbcabcaaaa bcacbcgcab caaccaaacc gbbabbcabb 6060 
cgbgabbgcg ccbgagcgag acgaaabacg cgabcgctgb baaaaggaca abbacaaaca 6120 
ggaatcgaat gcaaccggcg caggaacacb gccagcgcat caacaababb btcaccbgaa 618 0 
tcaggatatb ctbcbaabac cbggaatgcb gbtbbcccgg ggabcgcagb ggbgagbaac 6240 
cabgcabcab caggagbacg gabaaaabgc bbgabggtcg gaagaggcab aaattccgbc 6300 
agccagbbba gtcbgaccab cbcabcbgba acabcabbgg caacgcbacc bbbgccabgb 6360 
bbcagaaaca acbcbggcgc abcgggcbbc ccabacaabc gatagabbgb cgcaccbgab 6420 
bgcccgacab babcgcgagc ccabbbabac ccababaaab cagcabccab gtbggaabbt 64 8 0 
aabcgcggcc bcgagcaaga cgbbbcccgb tgaababggc bcabaacacc ccbbgtabba 6540 
cbgtbbabgb aagcagacag bbbbatbgbb catgabgata batbbbbabc btgbgcaabg 6600 
baacatcaga gabbbtgaga cacaacgbgg cbtbcccccc ccccccatba bbgaagcabb 6660 
babcagggbb abbgbcbcab gagcggabac ababbbgaab gtabbbagaa aaabaaacaa 6720 
abaggggbbc cgcgcacabb bccccgaaaa gbgccacctg acgbcbaaga aaccabbabb 6780 
afccabgacab baaccbabaa aaabaggcgb atcacgaggc ccbbbcgtc 6829 
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<210> 97 
<211> 6752 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVRl012x/s containing HIV genes 
<400> 97 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 204 0 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg atcgaagaac tgcagcttca 2340 
acatgaccac cgaggtgcgc gacaagcgcg agaaggtgca cgccctgttc taccgcctgg 2400 
acatcgtgcc cctgaagaac gagagcagca acaccagcgg cgactaccgc ctgatcaact 2460 
gcaacaccag cgccatcacc caggcctgcc ccaaggtgag cttcgacccc atccccatcc 2520 
actactgcgc ccccgccggc tacgccatcc tgaagtgcaa caacaagacc ttcaacggca 2580 
ccggcccctg caacaacgtg agcaccatcc agtgcaccca cggcaccaag cccgtggtga 2640 
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gcacccagcb gcbgcbgaac ggcagccfcgg 
accbgaccga caacgtgaag accatcabcg 
gcacccgccc caacaacaac acccgcaaga 
acgccaccgg cgagatcatc ggcgacatcc 
cctggaacaa gaccctgcag gaggtgggca 
ccatcaagtt cgccaagcac agcggcggcg 
gccgcggcga gttcttctac tgcaacacca 
acagcaccga gaacatcacc ggcaccgaga 
agcagabcab caacabgbgg cagggcgbgg 
gcabccbgac ctgccgcagc aacatcaccg 
gcatgcacga caccgagabc. ttccgccccg 
gcgagcbgba caagtacaag gbggbggaga 
ccaagcttac cgtgcaggtg cgccagctgc 
tgctgcgcgc catcgaggcc cagcagcaca 
agctgcagac ccgcgtgctg gccatcgagc 
tctggaacaa cabgaccbgg abgcagbggg 
tctaccgcct gctggaggac agccagaacc 
ccctggacaa gtggcagaac ctgtggacct 
cacgtgggat ccagatctgc bgbgccbbct 
cccgtgcctt ccttgaccct ggaaggtgcc 
gaaattgcat cgcattgtct gagtaggtgt 
gacagcaagg gggaggattg ggaagacaat 
abgggbaccc aggtgctgaa gaattgaccc 
catcccctbc bcbgbgacac accctgtcca 
ataggacact cafcagctcag gagggctccg 
gcggtctctc cctccctcat cagcccacca 
aattaaagca agataggcta ttaagtgcag 
aagtaatgag agaaatcata gaattttaag 
cbtccgcttc ctcgctcact gactcgctgc 
cagctcacbc aaaggcggta atacggbbab 
acatgtgagc aaaaggccag caaaaggcca 
tttbccatag gctccgcccc ccbgacgagc 
ggcgaaaccc gacaggacta taaagatacc 
gctctcctgt tccgaccctg ccgcttaccg 
gcgtggcgct bbcbcabagc tcacgctgta 
ccaagctggg cbgbgbgcac gaaccccccg 
acbabcgbcb tgagtccaac ccggtaagac 
gtaacaggat tagcagagcg aggtatgbag 
cbaacbacgg cbacacbaga agaacagbab 
cctbcggaaa aagagbbggb agctcbbgab 
gbtbtbbbgb bbgcaagcag cagabbacgc 
bgabcbbbbc tacggggbcb gacgcbcagb 
bcabgagabb abcaaaaagg atcbtcaccb 
aabcaatcba aagbababab gagbaaacbb 
aggcaccbab cbcagcgabc bgbcbabbbc 
gggggcgcbg aggbcbgccb cgbgaagaag 
ccccabcabc cagccagaaa gtgagggagc 
accagbbggb gabbbtgaac bbbbgcbbtg 
gcgbgabcbg abcctbcaaq bcagcaaaag 
bcaagbcagc gbaabgcbcb gccagbgbba 
cbcatcgagc abcaaatgaa acbgcaabbb 
bbgaaaaagc cgbbbcbgta abgaaggaga 
aagatccbgg babcggtcbg cgabtccgac 
cccctcgtca aaaabaaggb tabcaagbga 
bgagaabggc aaaagcbbab gcabbbcbbb 



cc 9 a 99agga gabcabcabc cgcagcaaga 2700 
bgcaccbgaa cgagagcgbg gagabcaacb 2760 
gcatccgcab cggccccggc caggccbtcb 2820 
gccaggccca cbgcaacabc agccgcaccg 2880 
agaagcbggc cgagcacbbc cccaacaagg 2940 
accbggagab caccacccac agcbbcaacb 3000 
gcagccbgbb caacagcacc bacaccccca 3060 
acagcabcab caccatcccc tgccgcatca 3120 
gccgcgccat gbacgccccc cccabcgagg 3180 
gccbgcbgcb gacccgcgac ggcggcaccg 3240 
agggcggcga cabgcgcgac aacbggcgca 33 00 
tcaagccccb gggcabcgcc cccaccaagg 3360 
bgagcggcab cgbgcagcag cagagcaacc 3420 
bgcbgcagcb gaccgtgbgg ggcatcaagc 3480 
gcbacctgcg cgaccagcag cbcgaggaca 354 0 
accgcgagab cagcaacbac accaacacca 3600 
agcaggagaa gaacgagcag gaccbgcbgg 3660 
ggbbcggcab caccaacbgg cbgbggbgaa 3720 
agbbgccagc cabcbgtbgb bbgccccbcc 3780 
actcccactg bcctttccba ataaaabgag 3840 
cabbcbatbc bggggggbgg ggbggggcag 3900 
agcaggcabg cbggggabgc ggtgggcbcb 3960 
ggbbccbccb gggccagaaa gaagcaggca 4 020 
cgccccbggb bctbagbbcc agccccacbc 4080 
ccbbcaabcc cacccgcbaa agbacbbgga 4140 
aaccaaaccb agccbccaag agbgggaaga 4200 
agggagagaa aabgccbcca acabgbgagg 4260 
gccabgabbb aaggccabca bggccbbaab 4320 
gcbcggbcgt bcggcbgcgg cgagcggbab 4380 
ccacagaabc aggggabaac gcaggaaaga 4440 
ggaaccgtaa aaaggccgcg bbgcbggcgb 4500 
abcacaaaaa bcgacgcbca agbcagaggfc 4560 
aggcgbbbcc cccbggaagc bcccbcgbgc 4620 
gabaccbgbc cgccbbbcbc ccbbcgggaa 4680 
ggbabcbcag bbcggbgbag gbcgbbcgcb 4740 
bbcagcccga ccgcbgcgcc bbabccggba 4800 
acgacbbabc gccacbggca gcagccacbg 4860 
gcggbgcbac agagbbcbbg aagbggbggc 4920 
bbggbabcbg cgcbctgcbg aagccagbba 4980 
ccggcaaaca aaccaccgcb ggbagcggbg 5040 
gcagaaaaaa aggabcbcaa gaagabccbb 5100 
ggaacgaaaa cbcacgtbaa gggabbbbgg 5160 
agabccbbbb aaabbaaaaa tgaagbbbba 5220 
ggtcbgacag bbaccaatgc bbaatcagbg 5280 
gbbcatccab agbbgccbga cbcggggggg 534 0 
gbgbbgcbga cbcabaccag gccbgaabcg 5400 
cacggbbgab gagagctbbg bbgbaggbgg 5460 
ccacggaacg gbcbgcgbbg bcgggaagab 5520 
bbcgabtbab bcaacaaagc cgccgbcccg 5580 
caaccaabba accaabbcbg abtagaaaaa 5640 
abtcababca ggabbabcaa baccababbb 5700 
aaacbcaccg aggcagbbcc abaggabggc 5760 
bcgbccaaca bcaabacaac cbabbaatbb 5820 
gaaabcacca bgagbgacga cbgaabccgg 5880 
ccagacbbgb bcaacaggcc agccabbacg 5940 
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cbcgtcatca aaatcacbcg catcaaccaa accgbtattc abtcgbgatb gcgcctgagc 6000 

gagacgaaat acgcgatcgc tgttaaaagg acaattacaa acaggaatcg aatgcaaccg 6060 

gcgcaggaac actgccagcg cabcaacaat atbbbcaccb gaatcaggat attcttctaa 6120 

baccbggaab gctgttbbcc cggggatcgc agbggtgagt aaccabgcat catcaggagb 6180 

acggataaaa tgctbgabgg tcggaagagg cabaaatbcc gtcagccagb btagtcbgac 624 0 

catctcafcct gtaacatcat tggcaacgct accbttgcca tgtttcagaa acaactctgg 6300 

cgcatcgggc ttcccataca abcgatagat tgtcgcacct gattgcccga cattatcgcg 6360 

agcccattta tacccatata aatcagcatc catgttggaa tttaatcgcg gccbcgagca 6420 

agacgcbbcc cgttgaatat ggctcataac accccttgta ttactgttta tgtaagcaga 6480 

cagbtbtabt gttcatgatg atababttbb atcttgtgca abgtaacatc agagabtbtg 6540 

agacacaacg tggcbbbccc ccccccccca tbattgaagc abbtatcagg gtbatbgbcb 6600 

catgagcgga tacababttg aabgbattba gaaaaabaaa caaatagggg btccgcgcac 6660 

abtbccccga aaagbgccac ctgacgtcba agaaaccatb abtatcabga cabtaaccba 6720 

baaaaatagg cgtabcacga ggcccbbtcg tc 6752 



<210> 98 
<211> 6623 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 98 

bcgcgcgttt cggbgabgac ggbgaaaacc tcbgacacat gcagctcccg gagacggtca 60 
cagcttgtcb gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggbg 12 0 
ttggcgggtg bcggggctgg cbtaactatg cggcatcaga gcagattgta cbgagagtgc 180 
accatabgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc abcagattgg 240 
ctattggcca ttgcatacgt tgtabccata tcataatabg tacabtbaba tbggctcatg 3 00 
tccaacafcba ccgccabgtt gacattgabt attgactagt tatbaatagb aabcaatbac 360 
ggggtcatba gtbcatagcc cababatgga gttccgcgbt acabaacbta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataabga "cgtatgttcc 480 
catagbaacg ccaataggga ctbtccattg acgtcaatgg gbggagtatb tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tabgccaagt acgcccccta ttgacgtcaa 600 
tgacggbaaa bggcccgccb ggcattatgc ccagtacabg accbtatggg actttcctac 660 
ttggcagtac atctacgbat tagbcabcgc tabtaccatg gbgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggbttgactc acggggattt ccaagtcbcc accccabtga 780 
cgtcaatggg agttcgbttt ggcaccaaaa bcaacgggac btbccaaaat gbcgtaacaa 84 0 
ctccgcccca btgacgcaaa bgggcggtag gcghgtacgg tgggaggbct abataagcag 900 
agctcgttta gbgaaccgtc agatcgccbg gagacgccab ccacgctgtt bbgacctcca 960 
bagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcabbg gaacgcggat 1020 
bccccgtgcc aagagtgacg baagtaccgc ctabagactc bataggcaca cccctttggc 1080 
tcttatgcab gctatacbgt ttbbggcttg gggcctabac acccccgcbt ccttatgcta 1140 
taggtgafcgg tatagcttag ccbataggtg tgggttattg accattatbg accactcccc 1200 
tattggtgac gatacbttcc attacbaatc cafcaacatgg cbcbttgcca caactabctc 1260 
battggctat atgccaatac tctgbcctbc agagacbgac acggacfcctg tattttbaca 1320 
ggatggggbc ccattbabta bbtacaaatb cacababaca acaacgccgt cccccgbgcc 1380 
cgcagttttt attaaacata gcgbgggatc tccacgcgaa bctcgggbac gtgbtccgga 1440 
catgggcbct bctccggtag cggcggagct tccacabccg agccctggbc ccatgccbcc 1500 
agcggctcab ggtcgcbcgg cagcbcctfcg ctcctaacag bggaggccag actbaggcac 1560 
agcacaatgc ccaccaccac cagbgbgccg cacaaggccg tggcggtagg gtabgtgtct 1620 
gaaaatgagc gtggagatbg ggcbcgcacg gcbgacgcag abggaagact taaggcagcg 1680 
gcagaagaag abgcaggcag ctgagbtgbb gbabtctgat aagagbcaga ggtaacbccc 1740 
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gbbgcggbgc bgbbaacggb ggagggcagt gtagtctgag cagbacbcgt bgctgccgcg 1800 
cgcgccacca gacabaabag ctgacagact aacagactgt bccbbtccab gggtcttttc 1860 
tgcagtcacc gbcgbcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 192 0 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcbbctggab ggtgatgatc hacaacgbgc gcggcaacct gtgggbgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggbgcacaac gbgbgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacabgg 2220 
bggagcagat gcaccaggac abcatcagcc bgbgggacca gagccbgaag cccbgcgbga 2280 
agcbgacccc ccbgbgcgbg accctgcacb gcagcaaccg abcgabcaac tgcaacacca 2340 
gcgccabcac ccaggccbgc cccaaggbga gcbtcgaccc cabccccabc cactacbgcg 2400 
cccccgccgg cbacgccabc cbgaagbgca acaacaagac cbbcaacggc accggccccb 2460 
gcaacaacgb gagcaccabc cagbgcaccc acggcaccaa gcccgtggbg agcacccagc 2520 
bgcbgcbgaa cggcagccbg gccgaggagg agabcabcat ccgcagcaag aaccbgaccg 2580 
acaacgbgaa gaccabcabc gbgcaccbga acgagagcgb ggagatcaac bgcacccgcc 2640 
ccaacaacaa cacccgcaag agcabccgca bcggccccgg ccaggccbbc bacgccaccg 2700 
gcgagabcat cggcgacabc cgccaggccc acbgcaacat cagccgcacc gccbggaaca 2760 
agacccbgca ggaggbgggc aagaagcbgg ccgagcacbt ccccaacaag gccabcaagb 2820 
bcgccaagca cagcggcggc gaccbggaga bcaccaccca cagcbtcaac bgccgcggcg 2880 
agbbcbbcba cbgcaacacc agcagccbgb bcaacagcac cbacaccccc aacagcaccg 2940 
agaacabcac cggcaccgag aacagcabca bcaccabccc cbgccgcabc aagcagabca 3 000 
bcaacabgbg gcagggcgbg ggccgcgcca bgbacgcccc ccccatcgag ggcabccbga 3 060 
ccbgccgcag caacabcacc ggccbgctgc bgacccgcga cggcggcacc ggcabgcacg 312 0 
acaccgagab cbbccgcccc gagggcggcg acabgcgcga caacbggcgc agcgagcbgt 3180 
acaagtacaa ggbggbggag abcaagcccc bgggcabcgc ccccaccaag gccaagcbba 3240 
ccgbgcaggb gcgccagcbg cbgagcggca bcgbgcagca gcagagcaac cbgcbgcgcg 3300 
ccabcgaggc ccagcagcac abgcbgcagc bgaccgbgbg gggcabcaag cagcbgcaga 3360 
cccgcgbgcb ggccabcgag cgcbacctgc gcgaccagca gcbcgaggac atcbggaaca 3420 
acabgaccbg gatgcagtgg gaccgcgaga bcagcaacba caccaacacc abcbaccgcc 3480 
bgcbggagga cagccagaac cagcaggaga agaacgagca ggacctgcbg gcccbggaca 3540 
agbggcagaa cctgbggacc bggbbcggca bcaccaacbg gcbgbggbga acacgbggga 3600 
bccagabcbg cbgbgccbbc bagbbgccag ccabcbgtbg bbbgccccbc ccccgbgccb 3660 
bcctbgaccc bggaaggbgc cacbcccacb gtccbbbccb aabaaaabga ggaaabbgca 3720 
bcgcabbgbc bgagbaggbg tcabbcbabb cbggggggbg gggbggggca ggacagcaag 3780 
ggggaggatb gggaagacaa tagcaggcab gcbggggabg cggbgggcbc babgggtacc 3840 
caggbgcbga agaabbgacc cggbbccbcc bgggccagaa agaagcaggc acabccccbt 3900 
cbctgbgaca cacccbgbcc acgccccbgg bbcbbagbbc cagccccacb cabaggacac 3960 
bcabagcbca ggagggctcc gccbbcaabc ccacccgcba aagbacbbgg agcggbcbcb 4020 
cccbcccbca bcagcccacc aaaccaaacc bagccbccaa gagbgggaag aaabbaaagc 4080 
aagabaggcb abbaagtgca gagggagaga aaabgccbcc aacabgbgag gaagtaabga 4140 
gagaaabcab agaabbbtaa ggccatgabb baaggccabc abggcctbaa bctbccgcbb 4200 
ccbcgcbcac bgacbcgcbg cgcbcggbcg bbcggcbgcg gcgagcggba bcagcbcacb 4260 
caaaggcggb aabacggbba bccacagaab caggggabaa cgcaggaaag aacatgbgag 4320 
caaaaggcca gcaaaaggcc aggaaccgba aaaaggccgc gbbgcbggcg bbbbbccaba 4380 
ggcbccgccc cccbgacgag catcacaaaa abcgacgctc aagbcagagg bggcgaaacc 444 0 
cgacaggacb abaaagabac caggcgbbbc ccccbggaag cbcccbcgbg cgctcbccbg 4500 
bbccgacccb gccgctbacc ggabaccbgb ccgcctbbcb cccbbcggga agcgbggcgc 4560 
bbbcbcabag cbcacgcbgb aggtabcbca gbbcggbgba ggbcgbbcgc bccaagcbgg 4620 
gcbgbgbgca cgaacccccc gbbcagcccg accgcbgcgc cbtabccggb aacbabcgbc 4680 
bbgagbccaa cccggbaaga cacgacbbab cgccacbggc agcagccacb ggbaacagga 474 0 
btagcagagc gaggbabgba ggcggbgcba cagagtbcbb gaagbggbgg ccbaacbacg 4800 
gcbacacbag aagaacagba tbbggbabcb gcgcbcbgcb gaagccagbb accbbcggaa 4860 
aaagagbbgg bagcbcbbga bccggcaaac aaaccaccgc bggbagcggb ggbbbbbtbg 4920 
bbbgcaagca gcagabbacg cgcagaaaaa aaggabcbca agaagabccb bbgabcbbbb 4980 
cbacggggbc bgacgcbcag bggaacgaaa acbcacgbba agggattbbg gbcabgagab 5040. 
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batcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt aaatcaatcb 5100 
aaagtabata tgagfcaaact tggtctgaca gttaccaatg ctbaabcagb gaggcaccta 5160 
tctcagcgab cbgbcbabbb cgbbcatcca bagbtgccbg actcgggggg ggggggcgcb 5220 
gaggbcbgcc bcgbgaagaa ggbgbtgctg actcabacca ggccbgaabc gccccabcab 5280 
ccagccagaa agbgagggag ccacggbbga bgagagcbbt gtbgbaggbg gaccagbbgg 5340 
bgatbbbgaa cbbbbgcbbb gccacggaac ggtcfcgcgbb gtcgggaaga bgcgtgabcb 5400 
gabccbbcaa cbcagcaaaa gbbcgabtta bbcaacaaag ccgccgtccc gbcaagtcag 5460 
cgbaabgcbc bgccagtgbb acaaccaatb aaccaatbcb gabtagaaaa acbcabcgag 5520 
cabcaaabga aacbgcaabfc tatbcababc aggabbabca ataccatatb bbtgaaaaag 5580 
ccgbbbc'bgb aabgaaggag aaaacbcacc gaggcagbtc cabaggabgg caagabccbg 5640 
gbabcggbcb gcgabbccga cbcgbccaac abcaabacaa ccbabbaabb tcccctcgtc 5700 
aaaaabaagg bbabcaagbg agaaabcacc abgagbgacg acbgaatccg gbgagaatgg 5760 
caaaagcbba bgcabbbcbb tccagacttg bbcaacaggc cagccatbac gcbcgtcabc 5820 
aaaatcacbc gcabcaacca aaccgbbatb cabbcgbgat bgcgcctgag cgagacgaaa 5880 
tacgcgabcg cbgbbaaaag gacaabbaca aacaggaabc gaatgcaacc ggcgcaggaa 5940 
cacbgccagc gcabcaacaa tatbbbcacc bgaabcagga tabbcttcba abacctggaa 6000 
tgcbgbbbbc ccggggabcg cagbggbgag taaccabgca tcatcaggag tacggabaaa 6060 
abgcbbgabg gbcggaagag gcabaaabtc cgbcagccag btbagbctga ccabctcabc 6120 
bgbaacabca bbggcaacgc baccbbtgcc abgbbbcaga aacaacbctg gcgcatcggg 6180 
cbbcccabac aabcgabaga tbgbcgcacc tgabbgcccg acabbabcgc gagcccabbt 6240 
atacccabat aaabcagcab ccabgbbgga abbbaabcgc ggccbcgagc aagacgbbbc 6300 
ccgbbgaaba bggcbcabaa caccccbtgb abtacbgbbb atgbaagcag acagbbtbat 6360 
bgbbcabgab gabababbbb babcbbgtgc aabgbaacab cagagabbbt gagacacaac 6420 
gbggcbbbcc cccccccccc atbabbgaag cabbbabcag ggbbabbgbc bcabgagcgg 6480 
abacababbb gaabgbabbb agaaaaabaa acaaabaggg gbbccgcgca cabbbccccg 6540 
aaaagbgcca ccbgacgbcb aagaaaccab babbabcatg acabbaaccb abaaaaabag 6600 
gcgtabcacg aggcccbbbc gbc 6623 



<210> 99 
<211> 6542 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 

■ 

<400> 99 

bcgcgcgtbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggtg bcggggcbgg cbbaactabg cggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gbgbgaaata ccgcacagab gcgbaaggag aaaataccgc abcagatbgg 24 0 
cbatbggcca bbgcatacgb tgbatccaba bcabaatabg bacabbbaba tbggcbcabg 300 
bccaacabba ccgccabgtb gacabbgabb abbgacbagb babbaabagb aabcaabbac 360 
ggggbcabba gbbcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 4 80 
cabagbaacg ccaabaggga cbbtccabbg acgbcaabgg gbggagbabb tacggbaaac 54 0 
bgcccacbbg gcagbacatc aagbgtabca babgccaagb acgcccccba tbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagbac abcbacgbab bagbcabcgc babbaccabg gbgabgcggb tbbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 7 80 
cgbcaabggg agbbbgbbtb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 84 0 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agcbcgbtba gbgaaccgtc agabcgccbg gagacgccab ccacgcbgbb tbgaccbcca 960 
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tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca ccccttfcggc 1080 
fccttatgcab gcbatactgb ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
battggbgac gatactttcc attactaatc cabaacatgg ctctttgcca caactatctc 1260 
babbggcbat atgccaatac bcbgbccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa bcbcgggtac gbgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggbagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtatfcctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataahag ctgacagach aacagactgt tcctttccat gggtctbttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc bacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2 04 0 
gcgtgcccgt gtggaaggag gccaagacca ccctgttcbg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgcbggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagccfcgaag ccctgcgbga 22 80 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg atcgatcaac tgcaacacca 2340 
gcgccatcac ccaggcctgc cccaaggtga gcttcgaccc catccccatc cactactgcg 2400 
cccccgccgg ctacgccatc ctgaagtgca acaacaagac cttcaacggc accggcccct 24 60 
gcaacaacgt gagcaccatc cagtgcaccc acggcaccaa gcccgtggtg agcacccagc 2520 
tgctgctgaa cggcagcctg gccgaggagg agatcatcat ccgcagcaag aacctgaccg 2580 
acaacgtgaa gaccatcatc gtgcacctga acgagagcgt ggagatcaac tgcacccgcc 264 0 
gatcggccca ctgcaacatc agccgcaccg cctggaacaa gaccctgcag gaggtgggca 2700 
agaagctggc cgagcacttc cccaacaagg ccatcaagtt cgccaagcac agcggcggcg 2760 
acctggagat caccacccac agcttcaact gccgcggcga gbtcttctac tgcaacacca 2 820 
gcagcctgtt caacagcacc tacaccccca acagcaccga gaacatcacc ggcaccgaga 2880 
acagcatcat caccatcccc tgccgcatca agcagatcat caacatgtgg cagggcgtgg 2940 
gccgcgccat gtacgccccc cccatcgagg gcatcctgac ctgccgcagc aacatcaccg 3000 
gcctgctgct gacccgcgac ggcggcaccg gcatgcacga caccgagatc ttccgccccg 3060 
agggcggcga catgcgcgac aacfcggcgca gcgagcbgfca caagtacaag gtggtggaga 3120 
tcaagcccct gggcatcgcc cccaccaagg ccaagcttac cgtgcaggtg cgccagctgc 3180 
tgagcggcat cgtgcagcag cagagcaacc tgctgcgcgc catcgaggcc cagcagcaca 3240 
fcgctgcagct gaccgtgtgg ggcatcaagc agctgcagac ccgcgtgctg gccatcgagc 3300 
gctacctgcg cgaccagcag ctcgaggaca tctggaacaa catgacctgg atgcagtggg 3360 
accgcgagat cagcaactac accaacacca tctaccgcct gctggaggac agccagaacc 3420 
agcaggagaa gaacgagcag gacctgctgg ccctggacaa gtggcagaac ctgtggacct 34 80 
ggttcggcat caccaactgg ctgtggtgaa cacgtgggat ccagatctgc tgtgccttct 3540 
agttgccagc catctgttgt tbgccccbcc cccgtgccbt ccbbgaccct ggaaggtgcc 3600 
acbcccactg tcctttccta ataaaatgag gaaabbgcat cgcattgbct gagtaggtgt 3660 
cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat 3720 
agcaggcatg cbggggatgc ggtgggcbct atgggfcaccc aggbgctgaa gaabtgaccc 3780 
ggbtccbcct gggccagaaa gaagcaggca catcccctbc tcbgtgacac accctgtcca 3840 
cgccccbggb fccttagtbcc agccccactc ataggacact catagctcag gagggcbccg 3900 
ccttcaatcc cacccgctaa agbacttgga gcggbctctc cctccctcat cagcccacca 3960 
aaccaaacct agcctccaag agtgggaaga aabbaaagca agataggcta btaagtgcag 4020 
agggagagaa aatgccbcca acabgbgagg aagtaatgag agaaatcata gaattttaag 4080 
gccatgattfc aaggccatca tggccbtaab cttccgctbc ctcgctcact gactcgcbgc 4140 
gctcggtcgt fccggctgcgg cgagcggbat cagctcactc aaaggcggta atacggttat 4200 
ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 4260 
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ggaaccgfcaa aaaggccgcg ttgctggcgt bbbtccabag gctccgcccc cctgacgagc 4320 
abcacaaaaa tcgacgctca agbcagaggt ggcgaaaccc gacaggacta baaagabacc 4380 
aggcgtbbcc ccctggaagc tccctcgtgc gcbcbccbgb tccgaccctg ccgcttaccg 4440 
gatacctgtc cgcctbfcctc ccttcgggaa gcgtggcgct ttctcatagc bcacgcbgba 4500 
ggbabctcag tbcggbgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 4560 
ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 4620 
acgacttatc gccactggca gcagccacbg gtaacaggafc tagcagagcg aggtatgtag 4680 
gcggtgctac agagttctbg aagtggtggc ctaactacgg ctacactaga agaacagtat 4740 
ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcbbgat 4800 
ccggcaaaca aaccaccgct ggbagcggbg gbbtbbtbgb ttgcaagcag cagabbacgc 4860 
gcagaaaaaa aggatctcaa gaagabccbb bgabcttttc bacggggtct gacgcbcagb 4920 
ggaacgaaaa cbcacgbbaa gggattbtgg bcatgagabt atcaaaaagg atcttcacct 4980 
agatcctttt aaattaaaaa bgaagttbba aatcaatcta aagtatatat gagbaaacbb 5040 
ggtctgacag bbaccaabgc bbaatcagbg aggcacctat ctcagcgatc tgtctatttc 5100 
gttcatccat agttgcctga cbcggggggg gggggcgctg aggtctgcct cgtgaagaag 5160 
gtgttgctga ctcataccag gcctgaatcg ccccatcatc cagccagaaa gtgagggagc 5220 
cacggbbgab gagagctbtg ttgtaggtgg accagbbggt gattbtgaac ttttgctttg 5280 
ccacggaacg gtctgcgttg tcgggaagab gcgbgabctg atccbbcaac bcagcaaaag 5340 
ttcgabbtat bcaacaaagc cgccgtcccg tcaagtcagc gbaabgctcb gccagtgtba 5400 
caaccaatta accaatbcbg abbagaaaaa ctcatcgagc abcaaatgaa acbgcaatbt 5460 
abtcabatca ggabbatcaa taccatabtb tbgaaaaagc cgbbtctgta atgaaggaga 5520 
aaactcaccg aggcagtbcc abaggabggc aagatccbgg babcggbcbg cgattccgac 5580 
bcgbccaaca tcaabacaac ctabbaabbb ccccbcgbca aaaataaggt batcaagtga 5640 
gaaatcacca bgagtgacga ctgaabccgg tgagaabggc aaaagcttat gcattbcttt 5700 
ccagacbbgt bcaacaggcc agccabtacg cbcgtcatca aaatcactcg catcaaccaa 5760 
accgbbabbc abbcgbgabb gcgccbgagc gagacgaaab acgcgatcgc bgbbaaaagg 5820 
acaatbacaa acaggaatcg aabgcaaccg gcgcaggaac acbgccagcg catcaacaat 5880 
abbttcaccb gaabcaggab abbcbbctaa bacctggaab gctgttttcc cggggabcgc 5940 
agtggtgagb aaccabgcat catcaggagt acggataaaa bgcbtgatgg tcggaagagg 6000 
cabaaatbcc gbcagccagt bbagbctgac cabcbcatct gbaacabcat bggcaacgct 6060 
accbttgcca tgttbcagaa acaactcbgg cgcatcgggc btcccataca abcgatagab 6120 
bgbcgcaccb gatbgcccga cabbabcgcg agcccabtba bacccataba aabcagcabc 6180 
catgttggaa ttbaabcgcg gccbcgagca agacgtbbcc cgbbgaabab ggcbcataac 6240 
accccbtgta btacbgtbta tgtaagcaga cagtttbatt gtbcatgabg atatabbtbb 6300 
atcbtgbgca atgbaacabc agagabttbg agacacaacg bggcbtbccc ccccccccca 6360 
bbattgaagc abbbabcagg gbbabbgbcb cabgagcgga bacatabbbg aabgbabbba 6420 
gaaaaabaaa caaabagggg ttccgcgcac abbbccccga aaagtgccac cbgacgbcba 6480 
agaaaccatb atbabcabga cabbaaccba baaaaabagg cgbabcacga ggcccbbbcg 6540 
tc 6542 



<210> 100 
<211> 6473 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 100 

tcgcgcgbbt cggtgabgac ggtgaaaacc bctgacacab gcagcbcccg gagacggbca 60 
cagcbbgbct gtaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggtg bcggggcbgg ctbaacbabg cggcabcaga gcagabtgba ctgagagbgc 180 
accabatgcg gtgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
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ctattggcca btgcabacgb bgbabccaba bcabaabafcg tacathtata bbggctcatg 300 

tccaacatta ccgccatgtt gacattgatt abbgacbagb babbaabagb aatcaattac 360 

ggggbcatba gttcatagcc cabababgga gbbccgcgbb acabaacbba cggtaaatgg 420 

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt bacggbaaac 540 

bgcccactbg gcagbacabc aagtgbatca tatgccaagh acgcccccta ttgacgtcaa 600 

tgacggtaaa bggcccgccb ggcattatgc ccagtacahg accttatggg acbbbccbac 660 

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt bbbggcagba 720 

catcaatggg cgbggabagc ggtttgactc acggggabbb ccaagtctcc accccattga 780 

cgtcaatggg agbbbgbbbb ggcaccaaaa tcaacgggac bbbccaaaat gbcgbaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgfcgtacgg tgggaggtct atataagcag 900 

agcbcgbtba gtgaaccgtc agabcgccbg gagacgccat ccacgcbgbt bbgaccbcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagbgacg taagtaccgc chatagactc tataggcaca ccccbtbggc 1080 

tcttatgcat gcbabacbgb ttttggcttg gggcctatac acccccgctt ccbbatgcba 1140 

baggbgabgg tatagcttag cctataggtg bgggbbabbg accabbabbg accactcccc 1200 

babbggbgac gabacbttcc attactaatc cataacatgg ctctbtgcca caacbabcbc 1260 

tatbggctab abgccaabac bctgbcctbc agagacbgac acggacbctg babbbbtaca 1320 

ggabggggbc ccabbbabba tbbacaaatt cacababaca acaacgccgt cccccgbgcc 1380 

cgcagbbtbb abbaaacata gcgbgggabc bccacgcgaa bcbcgggbac gbgbbccgga 1440 

cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 

agcggcbcab ggbcgcbcgg cagcbccbbg ctccbaacag bggaggccag acbbaggcac 1560 

agcacaatgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgtgbcb 1620 

gaaaabgagc gbggagabbg ggctcgcacg gctgacgcag abggaagact taaggcagcg 1680 

gcagaagaag abgcaggcag cbgagbbgbt gbabbcbgab aagagbcaga ggtaacbccc 1740 

gbbgcggbgc tgbtaacggb ggagggcagb gbagbcbgag cagbacbcgb bgcbgccgcg 1800 

cgcgccacca gacabaabag cbgacagact aacagacbgt bccbbbccab gggbcbbbbc 1860 

bgcagbcacc gbcgbcgaca cgtgbgabca gababcgcgg ccgcbcbaga gababcgcca 1920 

ccabgcgcgb ggagggcabc cagcgcaact ggaagcagtg gbggabcbgg ggcabccbgg 1980 

gcbbcbggab ggtgabgatc bacaacgbgc gcggcaaccb gbgggbgacc gbgbacbacg 2040 

gcgbgcccgb gbggaaggag gccaagacca cccbgbbcbg cgccagcgac gccaaggccb 2100 

acgacgccga ggbgcacaac gbgbgggcca cccacgccbg cgbgcccacc gaccccaacc 2160 

cccaggagab ggbgcbggag aacgbgaccg agaacbbcaa cabgbgggag aacgacabgg 2220 

tggagcagab gcaccaggac abcabcagcc tgtgggacca gagccbgaag cccbgcgbga 2280 

agcbgacccc ccbgtgcgbg acccbgcacb gcagcaaccg abcgabcaac bgcaacacca 2340 

gcgccabcac ccaggccbgc cccaaggbga gctbcgaccc cabccccabc cacbacbgcg 2400 

cccccgccgg cbacgccabc cbgaagbgca acaacaagac cbbcaacggc accggccccb 2460 

gcaacaacgb gagcaccabc cagbgcaccc acggcaccaa gcccgbggbg agcacccagc 2520 

bgcbgcbgaa cggcagccbg gccgaggagg agabcabcab ccgcagcaag aacctgaccg 2580 

acaacgbgaa gaccabcabc gbgcaccbga acgagagcgb ggagabcaac bgcacccgcc 2640 

gabcggccca cbgcaacabc agccgcaccg ccbggaacaa gacccbgcag gaggbgggca 2700 

agaagcbggc cgagcacbtc cccaacaagg ccabcaagbb cgccaagcac agcggcggcg 2760 

accbggagat caccacccac agcbbcaacb gccgcggcga gbbcbbcbac bgcaacaccc 2820 

gabcgabccc cbgccgcatc aagcagabca bcaacabgbg gcagggcgbg ggccgcgcca 2880 

bgbacgcccc ccccatcgag ggcabccbga cctgccgcag caacabcacc ggccbgcbgc 2940 

bgacccgcga cggcggcacc ggcabgcacg acaccgagab cbtccgcccc gagggcggcg 3 000 

acabgcgcga caacbggcgc agcgagcbgb acaagbacaa ggbggbggag abcaagcccc 3060 

tgggcabcgc ccccaccaag gccaagctba ccgbgcaggt gcgccagcbg cbgagcggca 3120 

tcgbgcagca gcagagcaac cbgcbgcgcg ccabcgaggc ccagcagcac abgcbgcagc 3180 

bgaccgbgbg gggcatcaag cagcbgcaga cccgcgbgcb ggccabcgag cgcbacctgc 3240 

gcgaccagca gcbcgaggac abctggaaca acabgaccbg gabgcagbgg gaccgcgaga 33 0 0 

bcagcaacba caccaacacc abcbaccgcc bgcbggagga cagccagaac cagcaggaga 3360 

agaacgagca ggacctgctg gcccbggaca agtggcagaa ccbgbggacc tggbbcggca 3 420 

tcaccaacbg gcbgbggbga acacgbggga bccagabcbg cbgbgccbbc bagbbgccag 3480 

ccabcbgbbg tbbgcccctc ccccgbgccb bccbbgaccc bggaaggbgc cacbcccacb 3540 
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gbccfcbbccb aataaaatga ggaaattgca bcgcabbgbc bgagbaggbg bcabbcbabb 3600 
cbggggggbg gggbggggca ggacagcaag ggggaggabb gggaagacaa tagcaggcab 3660 
gctggggatg cggbgggcbc tatgggtacc caggtgctga agaattgacc cggbbccbcc 3720 
tgggccagaa agaagcaggc acabccccbb ctctgtgaca cacccbgbcc acgccccbgg 3780 
ttcttagttc cagccccact cataggacac tcatagctca ggagggctcc gccbbcaabc 3840 
ccacccgcta aagtacttgg agcggtctct ccctcccbca tcagcccacc aaaccaaacc 3900 
tagcctccaa gagtgggaag aaatbaaagc aagataggct attaagtgca gagggagaga 3960 
aaatgcctcc aacatgtgag gaagtaatga gagaaabcab agaabbbbaa ggccatgatt 4020 
baaggccabc atggccbbaa bcbbccgcbb ccbcgcbcac bgacbcgcbg cgcbcggbcg 40B0 
tbcggcbgcg gcgagcggba bcagcbcacb caaaggcggb aabacggbba bccacagaab 4140 
caggggabaa cgcaggaaag aacatgbgag caaaaggcca gcaaaaggcc aggaaccgba 4200 
aaaaggccgc gbbgcbggcg bbbttccata ggctccgccc cccbgacgag cabcacaaaa 4260 
abcgacgcbc aagtcagagg bggcgaaacc cgacaggacb abaaagabac caggcgbbbc 4320 
ccccbggaag cbcccbcgbg cgcbcbccbg bbccgacccb gccgcbbacc ggataccbgb 4380 
ccgccbbbct ccctbcggga agcgbggcgc bbbcbcabag cbcacgcbgt aggbabcbca 4440 
gbbcggbgta ggbcgbbcgc bccaagcbgg gcbgbgbgca cgaacccccc gbbcagcccg 4500 
accgcbgcgc cbbabccggb aacbabcgbc bbgagbccaa cccggbaaga cacgacbtab 4560 
cgccacbggc agcagccacb ggbaacagga bbagcagagc gaggtabgba ggcggbgcba 4620 
cagagbbcbb gaagbggbgg cctaacbacg gcbacacbag aagaacagba bbbggbabct 4680 
gcgcbcbgcb gaagccagbb accbbcggaa aaagagbbgg bagcbcbbga bccggcaaac 4740 
aaaccaccgc bggbagcggb ggbbbbbbbg bbbgcaagca gcagabbacg cgcagaaaaa 4800 
aaggabcbca agaagabccb bbgabcbbbb cbacggggbc bgacgcbcag bggaacgaaa 4860 
actcacgbta agggabbbbg gbcabgagab babcaaaaag gatcbbcacc bagabccbbb 4920 
baaabbaaaa abgaagbbbb aaabcaabcb aaagbababa bgagbaaacb bggbcbgaca 4980 
gbbaccaabg cbbaabcagb gaggcaccba bcbcagcgab cbgbcbabbt cgbbcatcca 5040 
bagbbgccbg acbcgggggg ggggggcgcb gaggbcbgcc bcgbgaagaa ggbgbbgcbg 5100 
acbcabacca ggccbgaabc gccccabcab ccagccagaa agbgagggag ccacggbbga 5160 
bgagagcbbb gbbgbaggbg gaccagbbgg bgabbbbgaa cbbbbgcttb gccacggaac 5220 
ggbcbgcgbb gbcgggaaga bgcgbgabcb gabccbbcaa cbcagcaaaa gbbcgatbba 5280 
bbcaacaaag ccgccgbccc gbcaagbcag cgbaabgcbc tgccagbgbb acaaccaabb 5340 
aaccaabbcb gabbagaaaa acbcabcgag cabcaaabga aacbgcaabb babbcababc 5400 
aggabbabca abaccababb bbbgaaaaag ccgbbbcbgb aabgaaggag aaaacbcacc 5460 
gaggcagbbc cabaggabgg caagabccbg gbabcggbcb gcgabbccga cbcgbccaac 5520 
abcaabacaa ccbabbaabb bccccbcgbc aaaaabaagg bbabcaagbg agaaabcacc 5580 
abgagbgacg acbgaatccg gbgagaabgg caaaagcbba bgcabtbcbb bccagacbbg 5640 
bbcaacaggc cagccabbac gcbcgbcabc aaaabcacbc gcabcaacca aaccgbbatb 5700 
cabbcgbgab bgcgccbgag cgagacgaaa bacgcgabcg cbgbbaaaag gacaabbaca 5760 
aacaggaabc gaabgcaacc ggcgcaggaa cacbgccagc gcabcaacaa babbbbcacc 5820 
bgaabcagga babbcbbcba abacctggaa bgcbgbbbbc ccggggabcg cagbggbgag 5880 
baaccabgca bcabcaggag bacggabaaa abgcbbgabg gbcggaagag gcabaaabbc 5940 
cgbcagccag bbbagbcbga ccabcbcatc bgtaacabca bbggcaacgc baccbbbgcc 6000 
abgbbbcaga aacaacbcbg gcgcabcggg cbbcccabac aabcgabaga bbgbcgcacc 6060 
bgabbgcccg acabbabcgc gagcccabtb abacccabab aaabcagcab ccabgbbgga 6120 
abbbaatcgc ggcctcgagc aagacgbbbc ccgbbgaaba bggcbcabaa caccccbbgb 6180 
abbacbgbbt abgbaagcag acagbbbbab bgtbcabgab gabababbbb babcbbgbgc 6240 
aabgbaacab cagagabbbt gagacacaac gbggcbbbcc cccccccccc abbabbgaag 6300 
cabbbabcag ggbbabbgbc bcabgagcgg abacabatbb gaabgbabbb agaaaaabaa 6360 
acaaabaggg gbbccgcgca cabbbccccg aaaagbgcca ccbgacgbcb aagaaaccab 6420 
babbabcabg acatbaaccb abaaaaabag gcgbabcacg aggcccbbbc gbc 6473 



<210> 101 
<211> 6554 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 101 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 1B0 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
99»tggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggcfccat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg atcgatcaac tgcaacacca 2340 
gcgccatcac ccaggcctgc cccaaggtga gcttcgaccc catccccatc cactactgcg 2400 
cccccgccgg ctacgccatc ctgaagtgca acaacaagac cttcaacggc accggcccct 2460 
gcaacaacgt gagcaccatc cagtgcaccc acggcaccaa gcccgtggtg agcacccagc 2520 
tgctgctgaa cggcagcctg gccgaggagg agatcatcat ccgcagcaag aacctgaccg 2580 
acaacgtgaa gaccatcatc gtgcacctga acgagagcgt ggagatcaac tgcacccgcc 2640 
ccaacaacaa cacccgcaag agcatccgca tcggccccgg ccaggccttc tacgccaccg 2700 
gcgagatcat cggcgacatc cgccaggccc actgcaacat cagccgcacc gcctggaaca 2760 
agaccctgca ggaggtgggc aagaagctgg ccgagcactt ccccaacaag gccatcaagt 2820 
tcgccaagca cagcggcggc gacctggaga tcaccaccca cagcttcaac tgccgcggcg 2 880 
agttcttcta ctgcaacacc cgatcgatcc cctgccgcat caagcagatc atcaacatgb 2940 



WO 02/32943 . PCT/US01/25721 

ggcagggcgt gggccgcgcc atgtacgccc cccccatcga gggcabccbg acctgccgca 3 000 

gcaacatcac cggccbgcbg ctgacccgcg acggcggcac cggcatgcac gacaccgaga 3060 

bcbbccgccc cgagggcggc gacatgcgcg acaactggcg cagcgagcbg tacaagtaca 3120 

aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccaagctt accgtgcagg 3180 

tgcgccagct gctgagcggc atcgtgcagc agcagagcaa cctgctgcgc gccatcgagg 3240 

cccagcagca catgctgcag ctgaccgtgt ggggcabcaa gcagctgcag acccgcgtgc 3 3 00 

tggccatcga gcgctacctg cgcgaccagc agctcgagga catctggaac aacabgaccb 3360 

ggatgcagtg ggaccgcgag abcagcaacb acaccaacac catctaccgc ctgctggagg 3420 

acagccagaa ccagcaggag aagaacgagc aggacctgct ggccctggac aagtggcaga 3480 

acctgbggac cbggbbcggc abcaccaacb ggctgtggtg aacacgtggg atccagatct 354 0 

gctgtgcctt ctagttgcca gccatctgtt gbbbgccccb cccccgtgcc ttccttgacc 3600 

ctggaaggtg ccacbcccac tgtcctttcc baabaaaabg aggaaattgc atcgcattgt 3660 

ctgagtaggt gtcabbcbab tctggggggt ggggbggggc aggacagcaa gggggaggat 3 720 

tgggaagaca atagcaggca bgcbggggab gcggtgggct ctatgggtac ccaggtgctg 3 780 

aagaattgac ccggttcctc ctgggccaga aagaagcagg cacatcccct tctctgtgac 3 84 0 

acacccbgbc cacgccccbg gttcttagtt ccagccccac bcabaggaca ctcatagctc 3 900 

aggagggctc cgccbbcaab cccacccgct aaagtacttg gagcggtctc tccctccctc 3 960 

atcagcccac caaaccaaac ctagcctcca agagtgggaa gaaattaaag caagataggc 4 02 0 

tattaagtgc agagggagag aaaabgccbc caacatgtga ggaagbaabg agagaaatca 4 080 

tagaatttta aggccatgat ttaaggccat cafcggcctta atcttccgct tcctcgctca 414 0 

ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 4 200 

baatacggbb atccacagaa tcaggggata acgcaggaaa gaacabgbga gcaaaaggcc 4260 

agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gttttbccat aggctccgcc 4320 

cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 4380 

babaaagaba ccaggcgbbb ccccctggaa gctccctcgt gcgctctcct gttccgaccc 4440 

tgccgcttac cggatacctg tccgccbbbc bcccbbcggg aagcgtggcg cbbbctcata 4500 ' 

gctcacgctg taggbatctc agthcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 4560 

acgaaccccc cgbbcagccc gaccgctgcg ccttatccgg taactatcgt cbbgagbcca 4 620 

acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 4680 

cgaggbabgb aggcggbgcb acagagttct bgaagbggbg gcctaactac ggctacacta 4740 

gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 4 800 

gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 4 860 

agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt: tctacggggt 4920 

ctgacgcfcca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 4980 

ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 5 040 

atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 5100 

tctgtctatt tcgttcatcc atagttgcct gactcggggg gggggggcgc tgaggtctgc 5160 

ctcgtgaaga aggtgttgct gactcatacc aggcctgaat cgccccabca bccagccaga 5220 

aagbgaggga gccacggbbg abgagagcbb bgbbgbaggb ggaccagbbg gbgabbbbga 528 0 

acbbfetgcbt bgccacggaa cggbctgcgt bgtcgggaag abgcgbgatc bgabccbbca 5340 

acbcagcaaa agbbcgabbt abbcaacaaa gccgccgbcc cgtcaagbca gcgtaatgcb 5400 

cbgccagbgb bacaaccaab taaccaabbc bgabbagaaa aacbcabcga gcabcaaabg 5460 

aaacbgcaab bbabbcabab caggabbatc. aafcaccatab tbbbgaaaaa gccgbbbcbg 5520 

baabgaagga gaaaactcac cgaggcagbb ccabaggabg gcaagabccb ggbabcggbc 5580 

bgcgabbccg acbcgtccaa cabcaabaca accbabtaab tbccccbcgb caaaaabaag 5640 

gbbabcaagb gagaaabcac cabgagbgac gacbgaabcc ggtgagaabg gcaaaagcbb 5700 

atgcabbbcb tbccagacbb gbbcaacagg ccagccabba cgcbcgbcab caaaabcacb 5760 

cgcabcaacc aaaccgtbab tcabtcgbga bcgcgccbga gcgagacgaa abacgcgatc 5820 

gcbgbbaaaa ggacaabbac aaacaggaab cgaabgcaac cggcgcagga acacbgccag 58 80 

cgcabcaaca ababbbbcac cbgaabcagg ababbcbbcb aataccbgga abgcbgbbbb 5940 

cccggggabc gcagbggbga gbaaccabgc abcabcagga gbacggabaa aabgcbbgab 6000 

ggbcggaaga ggcabaaabb ccgbcagcca gbtbagbcbg accabcbcab cbgbaacabc 6060 

atbggcaacg cbaccbtbgc cabgbbbcag aaacaacbcb ggcgcabcgg gcbtcccata 6120 

caatcgabag abbgbcgcac cbgabbgccc gacabtabcg cgagcccabb babacccaba 6180 

baaabcagca bccabgtbgg aabbbaabcg cggccbcgag caagacgtbb cccgbbgaab 624 0 
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atggctcata acaccccttg tattactgtt tatgtaagca gacagtttta ttgttcatga 6300 

tgatatattt ttatcttgtg caatgtaaca tcagagattt tgagacacaa cgtggctttc 6360 

cccccccccc cattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt 6420 

tgaatgtatt tagaaaaata aacaaatagg ggbfcccgcgc acatfctcccc gaaaagtgcc 6480 

acctgacgtc baagaaacca tbattatcat gacattaacc tataaaaata ggcgtatcac 6540 

gaggcccttt cgtc S5S4 



<210> 102 
<211> 6671 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 102 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 18 00 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
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tggagcagab gcaccaggac abcabcagcc 
agctgacccc cctgtgcgtg accctgcact 
acatgaccac cgaggtgcgc gacaagcgcg 
acatcgtgcc ccbgaagaac gagagcagca 
gcaacaccag cgccatcacc caggcctgcc 
actacbgcgc ccccgccggc tacgccatcc 
ccggcccctg caacaacgtg agcaccabcc 
gcacccagct gchgctgaac ggcagcctgg 
acctgaccga caacgtgaag accatcatcg 
gcacccgccg atcggcccac tgcaacatca 
aggtgggcaa gaagcbggcc gagcactbcc 
gcggcggcga ccbggagatc accacccaca 
gcaacaccag cagcctgttc aacagcacct 
gcaccgagaa cagcatcatc accatcccct 
agggcgtggg ccgcgccatg tacgcccccc 
acatcaccgg cctgctgctg acccgcgacg 
tccgccccga gggcggcgac atgcgcgaca 
tggtggagat caagcccctg ggcatcgccc 
gccagcbgcb gagcggcatc gtgcagcagc 
agcagcacat gctgcagctg accgtgtggg 
ccatcgagcg ctacctgcgc gaccagcagc 
tgcagtggga ccgcgagatc agcaacbaca 
gccagaacca gcaggagaag aacgagcagg 
tgbggaccbg gttcggcatc accaactggc 
gtgccttcta gttgccagcc atcbgbtgbt 
gaaggtgcca ctcccactgt ccbbtccbaa 
agtaggtgtc attcbattct ggggggtggg 
gaagacaata gcaggcatgc tggggatgcg 
aabbgacccg gttcctcctg ggccagaaag 
ccctgtccac gcccctggtt cbbagtbcca 
agggctccgc cttcaatccc acccgctaaa 
agcccaccaa accaaaccba gcctccaaga 
taagtgcaga gggagagaaa atgcctccaa 
aabbbbaagg ccatgabbta aggccatcab 
actcgcbgcg ctcggbcgtt cggctgcggc 
tacggbtatc cacagaatca ggggataacg 
aaaaggccag gaaccgbaaa aaggccgcgb 
ctgacgagca tcacaaaaab cgacgctcaa 
aaagatacca ggcgttbccc cctggaagcb 
cgcbbaccgg abacctgtcc gcctttctcc 
cacgcbgtag gtatcbcagb tcggbgbagg 
aaccccccgt bcagcccgac cgctgcgcct 
cggtaagaca cgacbbatcg ccacbggcag 
ggtatgtagg cggtgctaca gagtbctbga 
gaacagtatt tggtatctgc gctctgctga 
gctcbbgabc cggcaaacaa accaccgctg 
agatbacgcg cagaaaaaaa ggabcbcaag 
acgcbcagtg gaacgaaaac tcacgbbaag 
tcttcaccta gabccbbtba aabtaaaaab 
agbaaacfctg gtcbgacagt taccaabgct 
gtctabbbcg ttcabccaba gbtgcctgac 
gbgaagaagg bgbbgcbgac bcabaccagg 
tgagggagcc acggbbgatg agagcttbgb 
ttbgcbttgc cacggaacgg bctgcgbbgb 
cagcaaaagb bcgatbbabb caacaaagcc 



tgtgggacca gagccbgaag cccbgcgtga 2280 
gcagcaaccg atcgaagaac tgcagcbtca 2340 
agaaggtgca cgccctgtbc taccgcctgg 2400 
acaccagcgg cgacbaccgc cbgabcaacb 2460 
ccaaggtgag cbbcgacccc afcccccatcc 2520 
bgaagbgcaa caacaagacc bbcaacggca 2580 
agbgcaccca cggcaccaag cccgtggtga 2640 
ccgaggagga gabcatcatc cgcagcaaga 2700 
bgcaccbgaa cgagagcgbg gagabcaacb 2760 
gccgcaccgc cbggaacaag accctgcagg 2820 
ccaacaaggc cabcaagbbc gccaagcaca 2880 
gcttcaactg ccgcggcgag btctbcbacb 2940 
acacccccaa cagcaccgag aacatcaccg 3000 
gccgcabcaa gcagatcabc aacatgtggc 3060 
ccabcgaggg catcctgacc tgccgcagca 3120 
gcggcaccgg cabgcacgac accgagatct 3180 
acbggcgcag cgagctgtac aagbacaagg 3240 
ccaccaaggc caagcttacc gtgcaggbgc 3300 
agagcaacct gcbgcgcgcc atcgaggccc 3360 
gcabcaagca gcbgcagacc cgcgbgctgg 3420 
bcgaggacab cbggaacaac atgaccbgga 34 80 
ccaacaccab cbaccgccbg cbggaggaca 3540 
accbgcbggc cctggacaag tggcagaacc 3 600 
tgtggbgaac acgtgggatc cagabcbgcb 3660 
bgcccctccc ccgtgccttc cbtgaccctg 3720 
baaaabgagg aaabbgcatc gcabtgtcbg 3780 
gbggggcagg acagcaaggg ggaggatbgg 3840 
gbgggcbcba bgggbaccca ggbgcbgaag 3900 
aagcaggcac abcccctbcb cbgbgacaca 3960 
gccccactca taggacacbc abagcbcagg 4020 
gbacbtggag cggtctcbcc ctcccbcabc 4080 
gbgggaagaa abbaaagcaa gataggcbat 4140 
cabgbgagga agbaabgaga gaaabcabag 4 200 
ggccbbaabc bbccgctbcc tcgctcacbg 4 260 
gagcggbabc agctcacbca aaggcggbaa 4320 
caggaaagaa catgtgagca aaaggccagc 4380 
bgcbggcgbb bbtccabagg cbccgccccc 4440 
gbcagaggbg gcgaaacccg acaggacbab 4500 
cccbcgbgcg cbctccbgbt ccgacccbgc 4560 
cbtcgggaag cgbggcgcbt tcbcabagcb 4620 
tcgbbcgcbc caagcbgggc tgtgtgcacg 4680 
tabccggbaa cbabcgtcbt gagbccaacc 4740 
cagccacbgg taacaggabt agcagagcga 4 800 
agbggtggcc taacbacggc tacacbagaa 4860 
agccagbbac cbtcggaaaa agagbbggba 4920 
gbagcggbgg tbbbbbbgbt bgcaagcagc 4980 
aagabccbtb gabctbbtcb acggggbcbg 5040 
ggabbbtggb cabgagabba tcaaaaagga 5100 
gaagbbbtaa abcaatcbaa agbatababg 5160 
baabcagbga ggcaccbabc tcagcgabcb 5220 
tcgggggggg ggggcgctga ggbcbgccbc 52 80 
cctgaabcgc cccabcatcc agccagaaag 5340 
bgbaggbgga ccagbtggbg abtbbgaacb 5400 
cgggaagabg cgbgatctga bccbbcaacb 5460 
gccgbcccgb caagbcagcg baabgcbcbg 5520 
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ccagtgttac aaccaattaa ccaattcbga btagaaaaac tcatcgagca tcaaatgaaa 5580 
ctgcaattta ttcatatcag gabbabcaab accababbtt bgaaaaagcc gtttctgtaa 5640 
tgaaggagaa aactcaccga ggcagttcca taggatggca agatcctggt atcggtctgc 5700 
gattccgact cgtccaacat caatacaacc babbaabbtc cccbcgbcaa aaataaggtt 5760 
atcaagtgag aaatcaccat gagtgacgac tgaatccggt gagaatggca aaagcttatg 5820 
cabbbcbbbc cagacttgtt caacaggcca gccattacgc bcgbcabcaa aatcactcgc 5880 
atcaaccaaa ccgbbabbca ttcgtgattg cgcctgagcg agacgaaata cgcgabcgcb 5940 
gttaaaagga caattacaaa caggaatcga atgcaaccgg cgcaggaaca ctgccagcgc 6000 
atcaacaata ttttcacctg aatcaggata ttcttctaat accbggaabg cbgtbtbccc 6060 
Ogggatcgca gtggtgagta accatgcatc atcaggagta cggataaaat gcbtgabggb 6120 
cggaagaggc ataaabbccg bcagccagbb tagtctgacc atctcatctg taacatcatt 6180 
ggcaacgcta cctttgccat gtttcagaaa caacbctggc gcatcgggcb tcccatacaa 6240 
bcgabagabb gtcgcacctg attgcccgac attatcgcga gcccabttab acccatataa 6300 
abcagcatec atgbtggaab bbaabcgcgg cctcgagcaa gacgbbtccc gbtgaabatg 6360 
gcbcabaaca ccccbbgbab bacbgtbbab gbaagcagac agbbbbatbg btcabgabga 6420 
bababbbbba bcbbgbgcaa bgbaacabca gagatbbbga gacacaacgt ggcbbbcccc 64 80 
ccccccccab babbgaagca bttabcaggg bbabbgbcbc abgagcggab acatabbbga 6540 
atgbabbbag aaaaabaaac aaabaggggb bccgcgcaca bbtccccgaa aagbgccacc 6600 
bgacgbcbaa gaaaccabba bbabcabgac abbaacctab aaaaabaggc gtabcacgag 6660 
gcccbbbcgb c 6671 



<210> 103 
<211> 6602 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/9 conbaining HIV genes 
<400> 103 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gtaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 
tbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accabatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabtggcca bbgcabacgb bgbabccaba bcabaababg bacabbbata bbggctcabg 300 
bccaacabba ccgccabgbb gacabbgabb abbgacbagb batbaabagb aabcaabbac 360 
ggggtcabba gbbcabagcc cabababgga gbbccgcgbb . acabaacbta cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 
cabagbaacg ccaataggga cbbbccabbg acgbcaatgg gbggagbatb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgbabca babgccaagb acgcccccta bbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbtabggg acbbbccbac 660 
bbggcagbac abcbacgbab bagbcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
catcaabggg cgbggabagc ggbbbgactc acggggabtb ccaagbcbcc accccabbga 780 
cgtcaatggg agbbbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
ctccgcccca ttgacgcaaa bgggcggbag gcgtgbacgg bgggaggbcb ababaagcag 900 
agcbcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgctgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgbgcc aagagbgacg baagbaccgc cbabagacbc tabaggcaca cccctbbggc 1080 
bctbabgcab gcbabacbgb btbbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggbgabgg tabagcbtag ccbabaggbg bgggbtabbg accabbabbg accacbcccc 1200 
babbggbgac gabacbbbcc abbacbaatc cabaacabgg cbcbbbgcca caacbabcbc 1260 
babbggcbat abgccaatac bcbgbccbtc agagacbgac acggactcbg babbbbbaca 1320 
ggabggggbc ccabbbabba bbbacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
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cgcagbtbbt attaaacata gcgtgggatc 
catgggcbct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccbtg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtfc 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagbcacc gtcgbcgaca cgtgtgatca 
ccatgcgcgt ggagggcatc cagcgcaact 
gcttctggat ggtgatgatc tacaacgtgc 
gcgtgcccgt gbggaaggag gccaagacca 
acgacgccga ggtgcacaac gbgtgggcca 
cccaggagat ggtgctggag aacgtgaccg 
tggagcagat gcaccaggac atcatcagcc 
agctgacccc cctgtgcgtg accctgcact 
acatgaccac cgaggtgcgc gacaagcgcg 
acatcgtgcc cctgaagaac gagagcagca 
gcaacaccag cgccatcacc caggcctgcc 
actactgcgc ccccgccggc tacgccatcc 
ccggcccctg caacaacgtg agcaccatcc 
gcacccagct gctgctgaac ggcagcctgg 
acchgaccga caacgtgaag accatcatcg 
gcacccgccg atcggcccac bgcaacatca 
aggbgggcaa gaagctggcc gagcacttcc 
gcggcggcga cctggagatc accacccaca 
gcaacacccg atcgatcccc tgccgcatca 
gccgcgccat gtacgccccc cccatcgagg 
gcctgcbgcb gacccgcgac ggcggcaccg 
agggcggcga catgcgcgac aactggcgca 
tcaagcccct gggcatcgcc cccaccaagg 
tgagcggcat cgtgcagcag cagagcaacc 
bgcbgcagcb gaccgtgtgg ggcatcaagc 
gctacctgcg cgaccagcag ctcgaggaca 
accgcgagat cagcaactac accaacacca 
agcaggagaa gaacgagcag gacctgctgg 
ggttcggcat caccaactgg ctgtggtgaa 
agttgccagc cabcbgtbgb ttgcccctcc 
actcccactg tcctttccta ataaaatgag 
cattcbabbc tggggggtgg ggtggggcag 
agcaggcatg ctggggatgc ggtgggctct 
ggttcctcct gggccagaaa gaagcaggca 
cgccccbggt tcbbagfctcc agccccacbc 
ccttcaatcc cacccgctaa agtacttgga 
aaccaaacct agcctccaag agtgggaaga 
agggagagaa aatgcctcca acatgtgagg 
gccatgattt aaggccatca bggccbbaat 
gctcggtcgt tcggctgcgg cgagcggtat 
ccacagaatc aggggataac gcaggaaaga 
ggaaccgtaa aaaggccgcg ttgctggcgfc 
atcacaaaaa tcgacgctca agbcagaggb 
aggcgtttcc ccctggaagc tccctcgtgc 
gatacctgtc cgcctttctc cctbcgggaa 
ggtatctcag ttcggbgtag gtcgtbcgct 
ttcagcccga ccgctgcgcc ttatccggta 



tccacgcgaa tctcgggtac gtgttccgga 144 0 
tccacatccg agccctggtc ccatgcctcc 1500 
ctcctaacag bggaggccag acfctaggcac 1560 
cacaaggccg tggcggtagg gtatgtgtct 1620 
gcbgacgcag atggaagact taaggcagcg 1680 
gtattctgab aagagtcaga ggtaactccc 1740 
gtagtctgag cagtactcgt tgctgccgcg 1800 
aacagacbgb tcctttccat gggtcttctc 1860 
gatatcgcgg ccgctctaga gatatcgcca 1920 
ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcggcaacct gtgggtgacc gtgtactacg 2040 
ccctgttchg cgccagcgac gccaaggcct 2100 
cccacgcctg cgtgcccacc gaccccaacc 2160 
agaacttcaa catgtgggag aacgacatgg 2220 
fcgtgggacca gagcctgaag ccctgcgtga 2280 
gcagcaaccg atcgaagaac tgcagcttca 234 0 
agaaggtgca cgccctgttc taccgcctgg 2400 
acaccagcgg cgactaccgc ctgatcaact 2460 
ccaaggtgag ctbcgacccc atccccatcc 2520 
tgaagtgcaa caacaagacc ttcaacggca 2580 
agtgcaccca cggcaccaag cccgtggtga 264 0 
ccgaggagga gatcatcatc cgcagcaaga 2700 
tgcacctgaa cgagagcgtg gagatcaact 2760 
gccgcaccgc ctggaacaag accctgcagg 2820 
ccaacaaggc catcaagttc gccaagcaca 2880 
gcttcaactg ccgcggcgag ttcttctact 2940 
agcagatcat caacatgtgg cagggcgtgg 3000 
gcatcctgac ctgccgcagc aacatcaccg 3060 
gcatgcacga caccgagatc ttccgccccg 3120 
gcgagctgta caagtacaag gtggtggaga 3180 
ccaagcttac cgtgcaggtg cgccagctgc 3240 
tgctgcgcgc,catcgaggcc cagcagcaca 3300 
agctgcagac ccgcgtgctg gccatcgagc 3360 
tctggaacaa catgacctgg atgcagtggg 3420 
tctaccgcct gctggaggac agccagaacc 3480 
ccctggacaa gtggcagaac ctgfcggacct 3540 
cacgtgggab ccagatcbgc tgbgccbtct 3600 
cccgbgccbb cctbgaccct ggaaggtgcc 3660 
gaaattgcat cgcattgbct gagtaggtgt 3720 
gacagcaagg gggaggattg ggaagacaat 3780 
abgggtaccc aggtgcbgaa gaattgaccc 3840 
cabccccfctc tctgbgacac accctgtcca 3900 
abaggacacb cabagcbcag gagggcbccg 3960 
gcggtctctc cctccctcat cagcccacca 4020 
aabtaaagca agabaggcba tbaagtgcag 4080 
aagtaatgag agaaatcata gaattttaag 4140 
cbtccgcbbc ctcgcbcacb gactcgcbgc 4200 
cagcbcactc aaaggcggta abacggbtat 426.0 
acatgtgagc aaaaggccag caaaaggcca 4320 
bbbtccatag gctccgcccc ccbgacgagc 4380 
ggcgaaaccc gacaggacba baaagabacc 4440 
gctctccbgb bccgacccbg ccgctbaccg 4500 
gcgtggcgct btcbcatagc bcacgctgta 4560 
ccaagctggg cbgtgtgca.c gaaccccccg 4 620 
actatcgbct bgagtccaac ccggbaagac 4680 
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acgacttatc gccactggca gcagccactg gbaacaggab bagcagagcg aggtatgtag 4740 

gcggtgctac agagttcttg aagtggtggc ctaactacgg cbacactaga agaacagtat 4800 

bbggbabcbg cgcbcbgcbg aagccagbba ccttcggaaa aagagbtggb agctcttgat 4860 

ccggcaaaca aaccaccgcb ggbagcggbg gbbbbbbbgb btgcaagcag cagabbacgc 4920 

gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgcbcagb 4980 

ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg abcbbcaccb 5040 

agabccbttt aaattaaaaa tgaagtttta aatcaatcta aagtabatab gagfcaaactt 5100 

ggtctgacag ttaccaatgc bbaabcagbg aggcacctab .cbcagcgabc bgbctabbbc 5160 

gttcatccat agbbgccbga ctcggggggg gggggcgctg aggtctgcct cgbgaagaag 5220 

gtgttgctga cbcataccag gcctgaatcg ccccatcatc cagccagaaa gtgagggagc 5280 

cacggttgat gagagcttbg ttgtaggtgg accagbbggb gabbbbgaac bbbbgcbbbg 5340 

ccacggaacg gbcbgcgbbg tcgggaagat gcgtgatctg atcctbcaac bcagcaaaag 5400 

bbcgabbtat tcaacaaagc cgccgtcccg bcaagtcagc gbaabgctcb gccagbgbba 5460 

caaccaatba accaabtcbg abbagaaaaa cbcatcgagc abcaaabgaa acbgcaabbt 5520 

abbcababca ggabtatcaa taccatabtb bbgaaaaagc cgttbctgba abgaaggaga 5580 

aaacbcaccg aggcagtbcc abaggabggc aagabccbgg batcggbcbg cgabbccgac 5640 

bcgbccaaca bcaabacaac ctabbaabbb cccctcgbca aaaataaggb babcaagbga 5700 

gaaabcacca bgagbgacga cbgaatccgg bgagaabggc aaaagcbbab gcabbbcbtb 5760 

ccagacbtgt bcaacaggcc agccatbacg cbcgbcatca aaabcacbcg cabcaaccaa 5820 

accgtbattc abtcgbgabb gcgcctgagc gagacgaaab acgcgabcgc tgbbaaaagg 5880 

acaabbacaa acaggaatcg aabgcaaccg gcgcaggaac actgccagcg cabcaacaab 5940 

abbbbcacct gaabcaggab abbcbbcbaa baccbggaab gcbgbtbbcc cggggatcgc 6000 

agbggbgagb aaccabgcab cabcaggagb acggabaaaa tgcbbgabgg bcggaagagg 6060 

cabaaabtcc gbcagccagb btagbcbgac cabcbcabcb gtaacabcab bggcaacgcb 6120 

accbbtgcca bgbtbcagaa acaacbcbgg cgcabcgggc tbcccabaca abcgatagab 6180 

bgbcgcaccb gabbgcccga catbabcgcg agcccattba tacccababa aabcagcabc 6240 

cabgbtggaa bbbaabcgcg gccbcgagca agacgtttcc cgbtgaabab ggcbcataac 6300 

accccbbgba bbacbgbbba bgbaagcaga cagbbbtabb gbbcabgabg abababtbbb 6360 

abcbbgbgca abgbaacabc agagabbbbg agacacaacg bggcbbbccc ccccccccca 6420 

bbabbgaagc abbbabcagg gbbabbgtcb cabgagcgga tacababbbg aabgtatbba 6480 

gaaaaabaaa caaabagggg bbccgcgcac abbbccccga aaagtgccac cbgacgtcba 6540 

agaaaccabb abbabcabga cabbaaccba baaaaatagg cgbabcacga ggcccbbbcg 6600 
be 6602 



<210> 104 
<211> 6683 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 104 

begegegbbb eggbgabgae ggtgaaaacc bebgacacab gcagcbcccg gagaeggbca 60 
cagcbbgbcb gtaageggab geegggagea gacaagcccg beagggegeg beagegggbg 120 
btggcgggbg beggggebgg ebbaacbabg eggcabcaga gcagabbgta cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gegbaaggag aaaabacege abcagatbgg 24 0 
cbabbggcca bbgcabacgb bgbabecaba bcabaababg bacabbbata bbggcbcabg 300 
becaacabba ccgccabgbb gacabbgabb abbgacbagb batbaabagb aabcaatbac 360 
ggggbcabba gbbcabagee cabababgga gbbccgcgbb acabaacbta eggtaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgtatgtbcc 480 
cabagbaacg ccaabaggga ebbbecabbg aegbcaabgg gbggagbatb baeggbaaac 54 0 
bgcccacbbg gcagbacabc aagtgbabca babgecaagb acgcccccta bbgacgtcaa 600 
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bgacggbaaa tggcccgcct ggcattatgc ccagbacabg accttatggg acbtbcctac 660 
bbggcagbac atctacgtat bagbcabcgc tabbaccabg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agbbtgbbbb ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgbgbacgg bgggaggbcb atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccab ccacgctgbt ttgacctcca 960 
bagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcabbg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gcbabactgb ttttggcttg gggcctatac acccccgcbb ccbbabgcta 1140 
baggbgabgg babagcbbag cctataggtg bgggtbabbg accattattg accactcccc 12 00 
tattggtgac gabacbtbcc attactaatc cabaacabgg ctctttgcca caactatctc 1260 
tattggctat abgccaabac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacababaca acaacgccgt cccccgtgcc 1380 > 
cgcagbbtbb attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgbbccgga 1440 
catgggctcb bcbccggbag cggcggagct tccacatccg agccctggtc ccahgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gbabgbgtct 1620 
gaaaatgagc gbggagatbg ggctcgcacg gctgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag abgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc bgttaacggb ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggbcbttbc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc bacaacgbgc gcggcaaccb gtgggtgacc gtgtacbacg 204 0 
gcgtgcccgt gbggaaggag gccaagacca ccctghtchg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg atcgaagaac tgcagcttca 234 0 
acatgaccac cgaggtgcgc gacaagcgcg agaaggtgca cgccctgttc taccgcctgg 24 00 
acatcgtgcc cctgaagaac gagagcagca acaccagcgg cgactaccgc ctgatcaact 24 60 
gcaacaccag cgccatcacc caggcctgcc ccaaggtgag cttcgacccc atccccatcc 2520 
actactgcgc ccccgccggc tacgccatcc tgaagtgcaa caacaagacc ttcaacggca 2580 
ccggcccctg caacaacgtg agcaccatcc agtgcaccca cggcaccaag cccgtggtga 2 640 
gcacccagct gctgctgaac ggcagcctgg ccgaggagga gatcatcatc cgcagcaaga 2 700 
acctgaccga caacgtgaag accatcatcg tgcacctgaa cgagagcgtg gagatcaact 2760 
gcacccgccc caacaacaac acccgcaaga gcatccgcat cggccccggc caggccttct 2820 
acgccaccgg cgagatcatc ggcgacatcc gccaggccca ctgcaacatc agccgcaccg 2 880 
cctggaacaa gacccbgcag gaggtgggca agaagctggc cgagcacttc cccaacaagg 2940 
ccatcaagtt cgccaagcac agcggcggcg acctggagat caccacccac agcttcaact 3 000 
gccgcggcga gfctcttctac tgcaacaccc gatcgatccc ctgccgcatc aagcagatca 3060 
tcaacatgtg gcagggcgtg ggccgcgcca tgtacgcccc ccccatcgag ggcatcctga 3120 
cctgccgcag caacatcacc ggcctgctgc tgacccgcga cggcggcacc ggcatgcacg 3180 
acaccgagat cttccgcccc gagggcggcg acatgcgcga caactggcgc agcgagctgt 324 0 
acaagbacaa ggbggtggag atcaagcccc tgggcatcgc ccccaccaag gccaagctta 3300 
ccgtgcaggt gcgccagctg ctgagcggca tcgtgcagca gcagagcaac ctgctgcgcg 3360 
ccatcgaggc ccagcagcac atgctgcagc fcgaccgtgtg gggcatcaag cagctgcaga 3420 
cccgcgtgct ggccatcgag cgctacctgc gcgaccagca gctcgaggac atctggaaca 34 80 
acatgacctg gahgcagtgg gaccgcgaga hcagcaacta caccaacacc atctaccgcc 3540 
tgctggagga cagccagaac cagcaggaga agaacgagca ggacctgctg gccctggaca 3600 
agtggcagaa cctgtggacc tggttcggca tcaccaactg gctgtggtga acacgtggga 3660 
tccagatctg ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 3720 
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 3780 
tcgcattgtc tgaghaggtg bcattctatt ctggggggtg gggtggggca ggacagcaag 3840 
ggggaggatt gggaagacaa tagcaggcat gctggggatg cggtgggctc tabgggtacc 3900 
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caggtgctga agaabbgacc cggbbcctcc tgggccagaa agaagcaggc acabccccbb 3960 
cbcbgtgaca cacccbgbcc acgcccctgg ttcttagttc cagccccact cabaggacac 4020 
tcatagctca ggagggctcc gcctbcaabc ccacccgcta aagtacttgg agcggtctct 4080 
cccbccctca tcagcccacc aaaccaaacc tagcctccaa gagtgggaag aaattaaagc 4140 
aagataggct attaagtgca gagggagaga aaatgcctcc aacatgtgag gaagtaatga 4200 
gagaaatcat agaattttaa ggccatgatt taaggccatc atggccttaa bctbccgcbb 4260 
cctcgctcac tgactcgctg cgcbcggtcg ttcggctgcg gcgagcggta tcagchcact 4320 
caaaggcggt aabacggbba tccacagaat caggggataa cgcaggaaag aacatgtgag 4380 
caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg bbbbbccaba 4440 
ggctccgccc ccctgacgag catcacaaaa abcgacgcbc aagtcagagg tggcgaaacc 4500 
cgacaggacb ataaagatac caggcgtbtc cccctggaag cbcccbcgbg cgctctcctg 4560 
ttccgaccct gccgcttacc ggabaccbgt ccgccbbbcb cccttcggga agcgtggcgc 4620 
tbbctcabag ctcacgctgt aggbabcbca gttcggtgta ggbcgttegc tccaagctgg 4680 
gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt aacbabcgbc 4740 
bbgagbccaa cccggbaaga cacgacttab cgccactggc agcagccact ggbaacagga 4800 
bbagcagagc gaggtabgba ggcggbgcta cagagbbctb gaagtggtgg ccbaacbacg 4 860 
gctacactag aagaacagba tttggtabcb gcgcbcbgcb gaagccagtt accbbcggaa 4920 
aaagagttgg tagetcttga tccggcaaac aaaccaccgc tggtagcggt ggtbttttbg 4980 
bttgcaagca gcagatbacg cgcagaaaaa aaggatctca agaagatccb bbgatcttbb 5040 
cbacggggbc bgacgcbcag bggaacgaaa acbcacgbta agggabbbbg gbcabgagab 5100 
tatcaaaaag gatcttcacc bagatccbbt baaabbaaaa atgaagtttt aaatcaatct 5160 
aaagtataba tgagbaaacb tggtctgaca gttaccaatg cttaabcagt gaggcaccta 5220 
bctcagcgat ctgbcbabbb cgttcatcca tagbbgcctg actcgggggg ggggggcgct 5280 
gaggtctgcc tcgtgaagaa ggtgttgctg actcatacca ggcctgaatc gccccatcab 5340 
ccagccagaa agbgagggag ccacggbtga bgagagcbbt gbtgtaggtg gaccagttgg 5400 
bgabtbbgaa cttbtgcbbb gccacggaac ggtcbgcgbt gbcgggaaga tgcgtgatct 5460 
gatccttcaa ctcagcaaaa gttcgattta ttcaacaaag ccgccgtccc gbcaagtcag 5520 
cgbaabgcbc tgccagtgtb acaaccaatb aaccaabtcb gattagaaaa acbcabcgag 5580 
catcaaabga aacbgcaatb babbcatatc aggattabca abaccabatb tttgaaaaag 5640 
ccgttbcbgb aatgaaggag aaaactcacc gaggcagbbc cataggabgg caagabccbg 5700 
gtabcggbct gcgabbccga cbcgbccaac atcaatacaa cctatbaabb tccccbcgtc 5760 
aaaaataagg bbabcaagbg agaaabcacc abgagbgacg acbgaabccg gbgagaatgg 5820 
caaaagcbba tgcabbbcbb tccagacbtg ttcaacaggc cagccabtac gctcgtcatc 5880 
aaaatcacbc gcabcaacca aaccgttatb catbcgbgat bgcgccbgag cgagacgaaa 5940 
bacgcgabcg cbgbbaaaag gacaabtaca aacaggaatc gaabgcaacc ggcgcaggaa 6000 
cacbgccagc gcabcaacaa babttbcacc tgaabcagga batbcbbcba abaccbggaa 6060 
tgctgbbbbc ccggggabcg cagbggtgag taaccabgca bcabcaggag tacggabaaa 6120 
abgcbbgabg gtcggaagag gcabaaabbc cgtcagccag bbbagbcbga ccabcbcabc 6180 
tgbaacabca bbggcaacgc taccbbtgcc abgbbbcaga aacaacbcbg gcgcabcggg 6240 
cbbcccabac aabcgabaga tbgtcgcacc tgabbgcccg acabbatcgc gagcccatbt 6300 
abacccabat aaabcagcab ccatgbbgga abtbaabcgc ggccbcgagc aagacgttbc 6360 
ccgbtgaaba bggcbcabaa cacccctbgb abbactgbtb abgbaagcag acagbbbtab 6420 
bgbbcabgat gatababbtb babcbbgbgc aatgbaacab cagagabbbb gagacacaac 6480 
gbggcbbbcc cccccccccc abbabbgaag cabbbabcag ggbtatbgbc bcabgagcgg 6540 
abacababbt gaabgbabtb agaaaaabaa acaaataggg gbbccgcgca cabbtccccg 6600 
aaaagbgcca ccbgacgbct aagaaaccab babbatcabg acabbaaccb abaaaaabag 6660 
gcgbabcacg aggcccbbtc gbc 6683 



<210> 105 
<211> 6605 
<212> DNA 

<213> Arbificial Sequence 
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<220> 

<223> plasmid pVR1012x/e containing HIV genes 
<400> 105 

tcgcgcgtbt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcbtgtct gtaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggfcg 12 0 
ttggcgggtg tcggggctgg cbtaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca bbgcatacgb tgtabccaba bcabaabatg bacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgacbagt tattaatagt aatcaattac 360 
ggggtcatta gtbcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg bcaataatga cgtabgtbcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccba ttgacgtcaa 600 
bgacggbaaa bggcccgccb ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgbab bagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtbbgtbtt ggcaccaaaa tcaacgggac bttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggab 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc bataggcaca ccccbttggc 10 80 
tcttatgcat gctatactgt bbttggcbbg gggcctatac acccccgcbt ccttatgcta 1140 
taggtgatgg tatagcttag cctatagghg tgggttattg accattattg accactcccc 12 00 
tattggtgac gatactttcc abbactaatc cabaacabgg cbcttbgcca caactatctc 1260 
tatbggctat atgccaatac tctgtccttc agagactgac acggactctg babtttbaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 15 00 
agcggcbcat ggtcgctcgg cagcbcctbg ctccfcaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag afcggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgbt gtattctgat aagagbcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagbctgag cagtactcgt tgcbgccgcg 18 00 
cgcgccacca gacabaatag ctgacagacb aacagactgb tcctttccat gggtctbttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gabatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggabctgg ggcabccbgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 22 80 
agcbgacccc cctgbgcgtg accctgcact gcagcaaccg caccatcgac bacaacaacc 2340 
gcaccgacaa catgggcggc gagatcaaga acbgcagctt ccgatcgabc aactgcaaca 24 00 
ccagcgccat cacccaggcc tgccccaagg tgagctbcga ccccatcccc atccactacb 2460 
gcgcccccgc cggcbacgcc afccctgaagt gcaacaacaa gaccttcaac ggcaccggcc 2520 
cctgcaacaa cgtgagcacc atccagbgca cccacggcac caagcccgtg gtgagcaccc 25 80 
agctgctgct gaacggcagc cbggccgagg aggagabcat catccgcagc aagaacctga 2 64 0 
ccgacaacgt gaagaccabc abcgtgcacc tgaacgagag cgtggagabc aactgcaccc 27 00 
gccgatcggc ccactgcaac abcagccgca ccgcctggaa caagacccbg caggaggtgg 2760 
gcaagaagct ggccgagcac ttccccaaca aggccabcaa gttcgccaag cacagcggcg 2820 
gcgaccbgga gatcaccacc cacagcbtca actgccgcgg cgagttctbc bactgcaaca 2880 
ccagcagcct gtbcaacagc acctacaccc ccaacagcac cgagaacabc accggcaccg 2940 
agaacagcat cabcaccabc ccctgccgca bcaagcagat catcaacatg bggcagggcg 30 00 
bgggccgcgc catgbacgcc ccccccatcg agggcatcct gaccbgccgc agcaacatca 3060 
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ccggccbgcb gctgacccgc gacggcggca 
ccgagggcgg cgacatgcgc gacaactggc 
agabcaagcc cctgggcatc gcccccacca 
tgctgagcgg catcgtgcag cagcagagca 
acabgcbgca gctgaccgtg tggggcatca 
agcgctacct gcgcgaccag cagctcgagg 
gggaccgcga gatcagcaac tacaccaaca 
accagcagga gaagaacgag caggaccbgc 
cctggttcgg cabcaccaac tggctgtggt 
bctagbbgcc agccatctgt tgtttgcccc 
gccactccca cbgbcctbbc cbaataaaab 
tgtcattcta btctgggggg tggggtgggg 
aatagcaggc afcgctgggga tgcggtgggc 
cccggbbccb cctgggccag aaagaagcag 
ccacgcccct ggttcttagt tccagcccca 
ccgccttcaa tcccacccgc taaagtactt 
ccaaaccaaa cctagcctcc aagagtggga 
cagagggaga gaaaatgcct ccaacatgtg 
aaggccatga tttaaggcca tcatggcctb 
tgcgctcggt cgttcggctg cggcgagcgg 
babccacaga abcaggggab aacgcaggaa 
ccaggaaccg taaaaaggcc gcgttgctgg 
agcatcacaa aaatcgacgc tcaagtcaga 
accaggcgtt bcccccbgga agcbcccbcg 
ccggatacct gtccgccttt ctcccttcgg 
gtaggtatct cagttcggtg baggbcgttc 
ccgttcagcc cgaccgctgc gccttatccg 
gacacgactt atcgccactg gcagcagcca 
taggcggtgc tacagagttc ttgaagtggt 
tatttggtat cbgcgcbcbg ctgaagccag 
gatccggcaa acaaaccacc gctggtagcg 
cgcgcagaaa aaaaggatct caagaagatc 
agtggaacga aaactcacgt taagggattt 
cctagatcct tttaaattaa aaatgaagtt 
ctbggtcbga cagbbaccaa tgcttaatca 
ttcgbbcabc catagttgcc tgactcgggg 
aaggtgttgc tgacbcatac caggcctgaa 
agccacggtb gatgagagct ttgttgtagg 
ttgccacgga acggbcbgcg ttgtcgggaa 
aagttcgatt tattcaacaa agccgccgtc 
ttacaaccaa ttaaccaatt ctgattagaa 
btbabbcata tcaggattat caabaccaba 
agaaaacbca ccgaggcagb bccabaggab 
gactcgtcca acabcaabac aaccbabbaa 
bgagaaabca ccabgagbga cgacbgaabc 
bbbccagacb bgtbcaacag gccagccabb 
caaaccgbba btcatbcgtg abtgcgccbg 
aggacaabba caaacaggaa bcgaatgcaa 
aabatbtbca ccbgaabcag gababbcbbc 
cgcagbggtg agbaaccabg cabcabcagg 
aggcabaaab bccgbcagcc agbbtagbcb 
gcbacctbbg ccabgbbbca gaaacaacbc 
gabbgbcgca ccbgabbgcc cgacabbabc 
abccatgbbg gaabbbaabc gcggccbcga 
aacaccccbb gbabbacbgb tbabgbaagc 



ccggcatgca cgacaccgag atcbbccgcc 3120 
gcagcgagcb gbacaagbac aaggbggbgg 3180 
aggccaagcb baccgbgcag gbgcgccagc 3240 
accbgctgcg cgccabcgag gcccagcagc 3300 
agcagctgca gacccgcgbg ctggccabcg 3360 
acabcbggaa caacatgacc bggabgcagb 3420 
ccabcbaccg ccbgctggag gacagccaga 3480 
bggcccbgga caagbggcag aacctgbgga 3540 
gaacacgbgg gatccagabc bgcbgbgccb 3600 
tcccccgbgc cbtccbbgac ccbggaaggb 3660 
gaggaaabbg catcgcattg tcbgagbagg 3720 
caggacagca agggggagga tbgggaagac 3780 
bcbabgggta cccaggtgcb gaagaattga 3840 
gcacatcccc bbcbctgbga cacaccctgb 3900 
cbcabaggac acbcabagcb caggagggcb 3960 
ggagcggbcb cbcccbcccb catcagccca 4020 
agaaabbaaa gcaagatagg cbabbaagbg 4080 
aggaagbaat gagagaaabc abagaabtbb 4140 
aabcbtccgc bbccbcgcbc actgacbcgc 4200 
babcagcbca cbcaaaggcg gbaabacggb 4260 
agaacabgbg agcaaaaggc cagcaaaagg 4320 
cgbbbbtcca baggcbccgc cccccbgacg 4380 
ggbggcgaaa cccgacagga cbabaaagat 4440 
bgcgcbcbcc bgbbccgacc cbgccgctba 4500 
gaagcgbggc gctbbctcab agcbcacgcb 4560 
gctccaagcb gggcbgbgbg cacgaacccc 462 0 
gbaactabcg bcbtgagbcc aacccggbaa 4680 
cbggbaacag gabbagcaga gcgaggbabg 474 0 
ggccbaacba cggcbacact agaagaacag 4800 
tbaccbbcgg aaaaagagbb ggtagcbcbb 4860 
gbggbbbbbb bgbbbgcaag cagcagabba 4920 
cbtbgabcbb btcbacgggg bcbgacgcbc 4980 
bggbcabgag atbabcaaaa aggatcbtca 5040 
tbaaatcaat ctaaagbaba tabgagbaaa 5100 
gbgaggcacc babcbcagcg abcbgbcbab 5160 
999ggggg c 9 ctgaggbcbg ccbcgtgaag 5220 
bcgccccabc abccagccag aaagbgaggg 5280 
bggaccagbb ggtgabbtbg aacbbbtgct 5340 
gabgcgbgat ctgabccbbc aacbcagcaa 5400 
ccgbcaagbc agcgbaabgc tcbgccagbg 5460 
aaacbcatcg agcabcaaab gaaacbgcaa 5520 
bbttbgaaaa agccgbbbcb gbaabgaagg 5580 
ggcaagabcc bggbabcggb ctgcgabbcc 5640 
bbtccccbcg bcaaaaabaa ggbtabcaag 5700 
cggbgagaab ggcaaaagct babgcabbbc 5760 
acgcbcgbca bcaaaabcac bcgcabcaac 5820 
agcgagacga aabacgcgab cgcbgbtaaa 5880 
ccggcgcagg aacactgcca gcgcabcaac 5940 
baabaccbgg aabgctgbbb bcccggggab 6000 
agbacggaba aaabgcbtga bggbcggaag 6060 
gaccabctca bctgbaacab cabbggcaac 6120 
bggcgcatcg ggctbcccab acaatcgaba 6180 
gcgagcccab ttatacccab ataaabcagc 6240 
gcaagacgbb bcccgbbgaa babggcbcat 6300 
agacagbbbb abbgbtcabg atgabababt 6360 
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tttatcttgt gcaafcgtaac atcagagatt ttgagacaca acgtggcttt cccccccccc 6420 

ccattattga agcabttatc agggttattg tctcatgagc ggatacabat ttgaatgtat 6480 

ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt 6540 

ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt 6600 
tcgtc 6605 



<210> 106 
<211> 6605 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 106 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 4 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgfcgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 168 0 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 192 0 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 222 0 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
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agctgacccc cctgtgcgtg acccbgcacb gcagcaaccg caccatcgac tacaacaacc 2340 
gcaccgacaa catgggcggc gagatcaaga actgcagctt ccgatcgatc aactgcaaca 24 00 
ccagcgccat cacccaggcc tgccccaagg tgagcttcga ccccatcccc atccacbacb 2460 
gcgcccccgc cggctacgcc atcctgaagt gcaacaacaa gaccttcaac ggcaccggcc 2520 
cctgcaacaa cgbgagcacc atccagtgca cccacggcac caagcccgtg gtgagcaccc 2580 
agctgctgct gaacggcagc ctggccgagg aggagatcat catccgcagc aagaacctga 264 0 
ccgacaacgt gaagaccatc atcgtgcacc tgaacgagag cgtggagatc aactgcaccc 2700 
gccgatcggc ccacbgcaac atcagccgca ccgccbggaa caagacccbg caggaggtgg 2760 
gcaagaagcb ggccgagcac ttccccaaca aggccabcaa gbtcgccaag cacagcggcg 2820 
gcgacctgga gatcaccacc cacagcfctca actgccgcgg cgagbtctbc tactgcaaca 2880 
ccagcagcct gtfccaacagc acctacaccc ccaacagcac cgagaacatc accggcaccg 2940 
agaacagcat catcaccatc ccctgccgca tcaagcagab cafccaacatg bggcagggcg 3000 
tgggccgcgc cabgtacgcc ccccccatcg agggcatcct gacctgccgc agcaacatca 3060 
ccggccbgcb gctgacccgc gacggcggca ccggcatgca cgacaccgag atcttccgcc 3120 
ccgagggcgg cgacatgcgc gacaactggc gcagcgagcb gtacaagtac aaggtggtgg 3180 
agatcaagcc cctgggcatc gcccccacca aggccaagct taccgbgcag gtgcgccagc 3240 
bgcbgagcgg cabcgbgcag cagcagagca acctgctgcg cgccatcgag gcccagcagc 33 00 
acatgctgca gctgaccgtg tggggcatca agcagctgca gacccgcgtg ctggccatcg 3360 
agcgctacct gcgcgaccag cagctcgagg acatctggaa caacatgacc tggatgcagt 3420 
gggaccgcga gatcagcaac tacaccaaca ccatctaccg cctgctggag gacagccaga 3480 
accagcagga gaagaacgag caggacctgc bggcccbgga caagtggcag aacctgtgga 3540 
cctggttcgg catcaccaac tggctgtggt gaacacgtgg gatccagatc tgctgtgcct 3600 
bcbagbbgcc agccabcbgfc tgtbbgcccc hcccccgtgc cttccttgac cctggaaggb 3660 
gccactccca ctgtcctttc ctaataaaat gaggaaattg cabcgcabbg tctgagtagg 3720 
tgtcattcta ttctgggggg bggggbgggg caggacagca agggggagga ttgggaagac 3780 
aatagcaggc atgcbgggga tgcggtgggc tctatgggta cccaggtgct gaagaabbga 3 84 0 
cccggttcct cctgggccag aaagaagcag gcacatcccc ttctctgtga cacaccctgt 3900 
ccacgcccct ggttcttagt tccagcccca ctcataggac actcatagct caggagggct 3960 
ccgccttcaa tcccacccgc baaagbacfcb ggagcggtct ctccctccct catcagccca 4020 
ccaaaccaaa cctagcctcc aagagfcggga agaaattaaa gcaagatagg cbatbaagbg 4080 
cagagggaga gaaaabgccb ccaacabgtg aggaagtaat gagagaaatc abagaabbbb 414 0 
aaggccatga tttaaggcca tcatggcctt aatcttccgc ttcctcgctc actgactcgc 4200 
tgcgctcggt cgbbcggcbg cggcgagcgg batcagcbca ctcaaaggcg gbaabacggb 4260 
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 4 320 
ccaggaaccg taaaaaggcc gcgttgctgg cgtttbtcca taggcbccgc cccccbgacg 43 80 
agcatcacaa aaabcgacgc bcaagtcaga ggtggcgaaa cccgacagga ctataaagat 444 0 
accaggcgtb bcccccbgga agctcccbcg bgcgcbctcc tgbtccgacc ctgccgcbta 4500 
ccggabaccb gbccgccbbb cbccctbcgg gaagcgbggc gcbbbcbcab agcbcacgct 4560 
gbaggtatcb cagbtcggbg baggbcgtbc gcbccaagcb gggcbgbgbg cacgaacccc 4620 
ccgttcagcc cgaccgctgc gccbbabccg gbaactatcg tcbbgagbcc aacccggbaa 4680 
gacacgactb abcgccacbg gcagcagcca cbggbaacag gabbagcaga gcgaggbatg 4740 
baggcggbgc bacagagbbc bbgaagbggb ggcctaacba cggcbacacb agaagaacag 4 800 
babbbggbab cbgcgctcbg cbgaagccag bbaccbbcgg aaaaagagbb ggtagcbcbb 4860 
gabccggcaa acaaaccacc gctggbagcg gbggbbbbbb tgbbbgcaag cagcagabba 4920 
cgcgcagaaa aaaaggatct caagaagatc cbtbgabcbb tbcbacgggg bctgacgcbc 4980 
agbggaacga aaacbcacgt taagggabbb bggbcabgag abbabcaaaa aggatcbbca 5040 
ccbagabccb bbtaaabtaa aaatgaagtb bbaaabcaafc cbaaagbafca batgagbaaa 5100 
ctbggbcbga cagtbaccaa tgctbaatca gbgaggcacc babcbcagcg abctgbcbat 5160 
btcgbbcabc catagbbgcc tgacbcgggg ggggggggcg cbgaggbcbg cctcgbgaag 5220 
aaggtgbtgc bgactcabac caggccbgaa bcgccccabc abccagccag aaagbgaggg 52 80 
agccacggbb gatgagagcb tbgtbgbagg tggaccagtb ggbgabbbbg aacbbbbgcb 5340 
btgccacgga acggbcbgcg tbgtcgggaa gabgcgtgab cbgatccttc aacbcagcaa 5400 
aagtbcgatb batbcaacaa agccgccgbc ccgbcaagbc agcgtaabgc bctgccagbg 5460 
btacaaccaa bbaaccaabt cbgabtagaa aaacbcabcg agcabcaaab gaaacbgcaa 5520 
btbabbcaba bcaggatbab caataccaba bbbbbgaaaa agccgbbbcb gbaabgaagg 5580 
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agaaaactca ccgaggcagt bccataggab ggcaagatcc tggtatcggt ctgcgattcc 5640 
gacbcgbcca acabcaabac aacctattaa btbccccbcg bcaaaaabaa ggttatcaag 5700 
tgagaaatca ccatgagbga cgactgaabc cggtgagaat ggcaaaagct babgcabbbc 5760 
tttccagact tgttcaacag gccagccatt acgctcgtca tcaaaatcac tcgcatcaac 5820 
caaaccgtta ttcattcgtg abbgcgccbg agcgagacga aatacgcgat cgctgttaaa 5880 
aggacaabba caaacaggaa hcgaatgcaa ccggcgcagg aacactgcca gcgcatcaac 5940 
aatattttca cctgaatcag gatattcttc taataccbgg aatgctgttt tcccggggat 6000 
cgcagtggbg agtaaccatg catcatcagg agbacggaba aaatgcttga tggtcggaag 6060 
aggcataaat bccgbcagcc agtttagtct gaccatctca tctgtaacat cattggcaac 6120 
gcbaccbbbg ccabgbbbca gaaacaactc tggcgcatcg ggcttcccat acaatcgata 6180 
gattgtcgca cctgattgcc cgacattatc gcgagcccat ttatacccat abaaatcagc 6240 
atccatgttg gaatttaatc gcggcctcga gcaagacgtt tcccgttgaa tatggctcat 6300 
aacacccctt gtattactgt bbabgtaagc agacagtttt attgttcatg atgatatatt 6360 
tttatcttgt gcaatgtaac abcagagabb ttgagacaca acgtggcttt cccccccccc 6420 
ccattattga agcatttatc agggbtabbg tctcatgagc ggatacatat ttgaatgtat 6480 
ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgb 6540 
ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca cgaggcccbb 6600 
tcgbc 6605 



<210> 107 
<211> 6536 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 107 

bcgcgcgttt cggbgatgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgtcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
bbggcgggtg tcggggcbgg cbbaacbatg cggcabcaga gcagattgba ctgagagbgc 180 
accababgcg gbgtgaaata ccgcacagab gcgbaaggag aaaabaccgc atcagatbgg 240 
ctatbggcca tbgcatacgt bgbatccaba bcabaababg bacabtbata btggcbcabg 300 
bccaacabta ccgccabgbb gacabtgabb abtgacbagt babbaabagb aatcaatbac 360 
ggggbcabta gbbcatagcc cabababgga gbtccgcgtb acabaacbba cggbaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccabbgacg bcaataabga cgtabgtbcc 480 
cabagtaacg ccaabaggga cbbtccabbg acgbcaabgg gtggagbabb bacggbaaac 54 0 
tgcccacbbg gcagbacabc aagtgbatca batgccaagb acgcccccba bbgacgtcaa 600 
bgacggbaaa tggcccgccb ggcabbabgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagbac atcbacgbab bagtcabcgc batbaccabg gtgabgcggt tbbggcagba 720 
cabcaabggg cgbggabagc ggbttgactc acggggabbb ccaagtcbcc accccatbga 780 
cgbcaabggg agbbbgbbbb ggcaccaaaa bcaacgggac tttccaaaat gtcgtaacaa 840 
cbccgcccca tbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbct abataagcag 900 
agctcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgtt bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcattg gaacgcggat 1020 
bccccgbgcc aagagbgacg taagbaccgc cbabagacbc babaggcaca cccctbbggc 1080 
bctbabgcab gcbabactgb bttbggcbtg gggccbabac acccccgctb ccbbabgcba 1140 
baggbgabgg babagctbag ccbabaggtg bgggbbabbg accabbabtg accacbcccc 1200 
tabbggbgac gabacbbbcc atbacbaatc cabaacabgg cbctbtgcca caactatcbc 1260 
babbggcbab abgccaabac bcbgtccbtc agagactgac acggactcbg babbtbbaca 1320 
ggabggggbc ccabbbabba bbbacaaabb cacababaca acaacgccgt cccccgbgcc 1380 
cgcagbbtbb atbaaacaba gcgbgggabc bccacgcgaa bcbcgggbac gtgtbccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
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agcggcbcab ggtcgctcgg cagctcchtg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggcbcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gbtgcggbgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcaghcacc gtcgtcgaca cgtgtgatca 
ccatgcgcgt ggagggcatc cagcgcaact 
gcttctggat ggbgabgatc tacaacgtgc 
gcgtgcccgt gtggaaggag gccaagacca 
acgacgccga ggtgcacaac gtgtgggcca 
cccaggagat ggtgctggag aacghgaccg 
bggagcagab gcaccaggac atcatcagcc 
agctgacccc ccbgtgcgbg accctgcact 
gcaccgacaa catgggcggc gagatcaaga 
ccagcgccab cacccaggcc tgccccaagg 
gcgcccccgc cggctacgcc atcctgaagt 
ccbgcaacaa cgtgagcacc atccagtgca 
agcbgcbgcb gaacggcagc ctggccgagg 
ccgacaacgt gaagaccatc atcgbgcacc 
gccgatcggc ccactgcaac atcagccgca 
gcaagaagcb ggccgagcac ttccccaaca 
gcgacctgga gatcaccacc cacagcttca 
cccgatcgat cccctgccgc atcaagcaga 
ccatgtacgc cccccccatc gagggcabcc 
bgcbgacccg cgacggcggc accggcatgc 
gcgacatgcg cgacaactgg cgcagcgagc 
cccbgggcab cgcccccacc aaggccaagc 
gcatcgtgca gcagcagagc aacctgctgc 
agctgaccgt gtggggcatc aagcagctgc 
tgcgcgacca gcagctcgag gacatctgga 
agatcagcaa ctacaccaac accatctacc 
agaagaacga gcaggaccbg ctggccctgg 
gcatcaccaa ctggctgtgg tgaacacgtg 
cagccatctg ttgtttgccc ctcccccgtg 
actgtccttt cctaataaaa tgaggaaatt 
attctggggg gbggggbggg gcaggacagc 
catgctgggg atgcggtggg ctctatgggt 
tcctgggcca gaaagaagca ggcacatccc 
tggttcttag ttccagcccc actcatagga 
atcccacccg ctaaagtact tggagcggtc 
acctagccbc caagagtggg aagaaattaa 
agaaaatgcc bccaacabgb gaggaagbaa 
atttaaggcc atcatggccb taatcttccg 
tcgttcggct gcggcgagcg gbabcagctc 
aatcagggga taacgcagga aagaacabgb 
gtaaaaaggc cgcgttgctg gcgtttttcc 
aaaabcgacg cbcaagtcag aggfcggcgaa 
bbcccccbgg aagctccctc gtgcgcbcbc 
bgbccgccbb bcbccctfccg ggaagcgtgg 
tcagttcggb gtaggbcgtb cgctccaagc 
ccgaccgctg cgccbbabcc ggbaactabc 
tabcgccacb ggcagcagcc acbggbaaca 
ctacagagtb cbbgaagbgg bggccbaacb 
bcbgcgctcb gcbgaagcca gtbacctfccg 



ctcctaacag bggaggccag actfcaggcac 1560 
cacaaggccg tggcggbagg gbabgtgbcb 1620 
gctgacgcag atggaagacb baaggcagcg 1680 
gtabbcbgab aagagbcaga ggbaacbccc 1740 
gtagbcbgag cagtacbcgt bgctgccgcg 1800 
aacagacbgb tccbttccat gggtcbbtbc i860 
gatatcgcgg ccgcbcbaga gatatcgcca 1920 
ggaagcagbg gbggabctgg ggcabccbgg 1980 
gcggcaacct gtgggbgacc gtgbacbacg 2040 
cccbgtbctg cgccagcgac gccaaggccb 2100 
cccacgcctg cgbgcccacc gaccccaacc 2160 
agaacttcaa cabgbgggag aacgacabgg 2220 
tgbgggacca gagccbgaag cccbgcgbga 2280 
gcagcaaccg caccatcgac bacaacaacc 2340 
acbgcagcbb ccgabcgatc aacbgcaaca 2400 
bgagcbbcga ccccatcccc atccacbacb 2460 
gcaacaacaa gaccttcaac ggcaccggcc 2520 
cccacggcac caagcccgtg gbgagcaccc 2580 
aggagatcat cabccgcagc aagaaccbga 2 640 
bgaacgagag cgbggagatc aacbgcaccc 2700 
ccgcctggaa caagaccctg caggaggbgg 2760 
aggccabcaa gbbcgccaag cacagcggcg 2 820 
actgccgcgg cgagtbctbc tacbgcaaca 2880 
tcabcaacab gbggcagggc gbgggccgcg 2 940 
tgaccbgccg cagcaacabc accggcctgc 3 000 
acgacaccga gabcbbccgc cccgagggcg 3060 
tgbacaagba caaggbggbg gagabcaagc 3120 
btaccgtgca ggbgcgccag cbgcbgagcg 3180 
gcgccabcga ggcccagcag cacabgctgc 3240 
agacccgcgb gctggccatc gagcgcbacc 3300 
acaacabgac ctggabgcag tgggaccgcg 3360 
gccbgcbgga ggacagccag aaccagcagg 3420 
acaagtggca gaacctgtgg accbggbtcg 3480 
ggatccagab ctgctgbgcc bbctagtbgc 3540 
ccbtccbbga cccbggaagg bgccacbccc 3600 
gcatcgcabb gbcbgagtag gbgtcabbct 3660 
aagggggagg abbgggaaga caatagcagg 3720 
acccaggbgc bgaagaabbg acccggbbcc 3780 
cttctcbgtg acacacccbg bccacgcccc 3840 
cactcabagc tcaggagggc bccgccbbca 3 900 
bcbcccbccc bcatcagccc accaaaccaa 3960 
agcaagabag gcbabbaagb gcagagggag 4020 
bgagagaaab catagaabbb baaggccatg 4080 
cbbccbcgcb cacbgacbcg ctgcgcbcgg 4140 
acbcaaaggc ggbaabacgg btabccacag 4200 
gagcaaaagg ccagcaaaag gccaggaacc 42 60 
abaggctccg ccccccbgac gagcabcaca 4320 
acccgacagg acbabaaaga baccaggcgb 4 3 80 
cbgbbccgac ccbgccgcbb accggatacc 4440 
cgcbbbcbca bagcbcacgc bgbaggbabc 4500 
tgggcbgbgb gcacgaaccc cccgbtcagc 4560 
gbcbtgagbc caacccggba agacacgacb 4620 
ggabtagcag agcgaggbab gbaggcggbg 4680 
acggcbacac bagaagaaca gbabbbggba 4740 
gaaaaagagt bggbagcbcb bgabccggca 4 800 
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aacaaaccac cgctggtagc ggtggfctttt ttgtttgcaa gcagcagatt acgcgcagaa 4860 
aaaaaggatc tcaagaagab cctbbgatcb bttcbacggg gbcbgacgct cagtggaacg 4920 
aaaactcacg fctaagggatt bbggbcabga gattatcaaa aaggabcbtc acctagabcc 4980 
btttaaabba aaaabgaagt tttaaatcaa tctaaagbab atatgagtaa acttggtctg 5040 
acagbbacca atgcttaatc agtgaggcac cbabcbcagc gatctgtcta btbcgtbcat 5100 
ccatagttgc ctgactcggg gggggggggc gctgaggbcb gcctcgtgaa gaaggtgttg 5160 
ctgactcata ccaggcctga abcgccccat catccagcca gaaagtgagg gagccacggt 5220 
tgatgagagc bbbgbtgtag gtggaccagt tggtgattfct gaacttttgc tttgccacgg 5280 
aacggtctgc gttgtcggga agatgcgtga tctgatcctt caactcagca aaagttcgat 5340 
ttattcaaca aagccgccgt cccgtcaagt cagcgtaatg ctctgccagt gttacaacca 5400 
attaaccaat tctgattaga aaaactcatc gagcatcaaa tgaaactgca atttattcat 5460 
atcaggatta tcaataccat atbbbtgaaa aagccgtttc tgtaatgaag gagaaaactc 5520 
accgaggcag bbccabagga bggcaagatc ctggtatcgg tctgcgattc cgactcgtcc 5580 
aacatcaata caaccbabba abtbcccctc gtcaaaaata agghtatcaa gtgagaaatc 5640 
accatgagtg acgactgaat ccggbgagaa bggcaaaagc btabgcabtb ctttccagac 5700 
ttgttcaaca ggccagccat tacgctcgtc atcaaaatca ctcgcatcaa ccaaaccgtt 5760 
abbcabbcgb gattgcgcct gagcgagacg aaatacgcga tcgctgttaa aaggacaabb 5820 
acaaacagga abcgaatgca accggcgcag gaacacbgcc agcgcabcaa caatabbbtc 5880 
accbgaatca ggatabbcbb ctaatacctg gaabgcbgtt btcccgggga bcgcagbggb 5940 
gagbaaccab gcabcatcag gagtacggat aaaabgctbg abggbcggaa gaggcabaaa 6000 
tbccgbcagc cagbbbagbc bgaccatcbc abcbgbaaca bcattggcaa cgcbacctbb 6060 
gccabgbtbc agaaacaacb ctggcgcabc gggcbbccca bacaabcgab agatbgbcgc 6120 
acctgabbgc ccgacabbab cgcgagccca tbbabaccca bataaabcag cabccabgbt 6180 
ggaabbbaat cgcggcctcg agcaagacgb bbcccgbbga ababggcbca baacaccccb 6240 
bgtabbacbg btbabgtaag cagacagbbb tabtgttcat gabgafcabat tbtbabctbg 6300 
bgcaabgtaa cabcagagab btbgagacac aacgbggctt bccccccccc cccatbabtg 6360 
aagcabbbab cagggbbabb gbcbcabgag cggabacata bbtgaabgba tbbagaaaaa 6420 
baaacaaaba ggggtbccgc gcacabtbcc ccgaaaagtg ccaccbgacg tctaagaaac 6480 
cabtabbabc atgacatbaa ccbataaaaa baggcgbatc acgaggcccb tbcgbc 6536 



<210> 108 
<211> 6617 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 108 

tcgcgcgbbb cggbgabgac ggtgaaaacc bcbgacacab gcagctcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabtgba ctgagagbgc 180 
accabatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc atcagatbgg 240 
cbabbggcca bbgcabacgb bgtabccaba bcataatabg tacabtbaba btggctcatg 300 
bccaacabba ccgccatgtb gacabtgabb attgacbagt babbaabagb aabcaabbac 360 
9999tcabba gbbcatagcc cababatgga gbtccgcgtb acataacbba cggbaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccabbgacg bcaabaabga cgtabgttcc 480 
cabagbaacg ccaataggga cbttccabbg acgbcaabgg gtggagbabb bacggtaaac 540 
bgcccacbbg gcagbacabc aagbgtabca batgccaagb acgcccccba tbgacgtcaa 600 
bgacggbaaa bggcccgccb ggcabtatgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagtac abcbacgtat tagtcabcgc batbaccabg gtgabgcggb bbbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggatbt ccaagtcbcc accccabbga 780 
cgbcaabggg agbbbgbttb ggcaccaaaa bcaacgggac btbccaaaat gtcgbaacaa 84 0 
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ctccgcccca bbgacgcaaa tgggcggbag gcgtgtacgg tgggaggtct ababaagcag 900 
agcbcgbbba gbgaaccgbc agatcgcctg gagacgccat ccacgcbgbt ttgacctcca 960 
tagaagacac cgggaccgab ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc . bataggcaca cccctttggc 1080 
tcttatgcat gcbatacbgt bbbbggctbg gggcctatac acccccgctt ccttatgcta 1140 
baggbgabgg babagcbbag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gabacbbbcc ahtactaatc cataacatgg ctctttgcca caactatctc 1260 
babbggcbab abgccaabac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatbtatba tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt atbaaacaba gcgbgggabc tccacgcgaa tctcgggtac gbgttccgga 1440 
cabgggctcb tctccggtag cggcggagct bccacatccg agccctggbc ccabgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbcctbg cbccbaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gtatgtgbct 1620 
gaaaatgagc gtggagatbg ggcbcgcacg gcbgacgcag atggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbabbcbgab aagagbcaga ggtaacbccc 1740 
gtbgcggbgc bgbbaacggb ggagggcagb gbagbctgag cagbacbcgb fcgcbgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagacbgt tcctttccat gggtcbbtbc 1860 
bgcagtcacc gtcgbcgaca cgtgbgabca gabatcgcgg ccgctcbaga gababcgcca 1920 
ccabgcgcgb ggagggcabc cagcgcaacb ggaagcagbg gbggatcbgg ggcabcctgg 1980 
gcbtcbggab ggtgabgatc bacaacgbgc gcggcaaccb gbgggtgacc gbgtacbacg 2040 
gcgbgcccgb gbggaaggag gccaagacca cccbgbtctg cgccagcgac gccaaggccb 2100 
acgacgccga ggtgcacaac gbgtgggcca cccacgccbg cgbgcccacc gaccccaacc 2160 
cccaggagab ggbgcbggag aacgtgaccg agaacbbcaa cabgbgggag aacgacabgg 2220 
bggagcagab gcaccaggac abcabcagcc tgbgggacca gagccbgaag cccbgcgbga 2280 
agcbgacccc ccbgbgcgbg acccbgcacb gcagcaaccg caccabcgac bacaacaacc 2340 
gcaccgacaa cabgggcggc gagatcaaga acbgcagcbb ccgatcgabc aacbgcaaca 2400 
ccagcgccab cacccaggcc tgccccaagg tgagctbcga ccccabcccc abccacbact 2460 
gcgcccccgc cggcbacgcc abccbgaagb gcaacaacaa gacctbcaac ggcaccggcc 2520 
ccbgcaacaa cgbgagcacc abccagbgca cccacggcac caagcccgtg gbgagcaccc 2580 
agcbgcbgcb gaacggcagc cbggccgagg aggagabcat cabccgcagc aagaaccbga 2640 
ccgacaacgb gaagaccabc atcgbgcacc tgaacgagag cgbggagatc aactgcaccc 2700 
gccccaacaa caacacccgc aagagcabcc gcabcggccc cggccaggcc bbcbacgcca 2760 
ccggcgagab cabcggcgac atccgccagg cccacbgcaa catcagccgc accgccbgga 2820 
acaagacccb gcaggaggtg ggcaagaagc bggccgagca cbbccccaac aaggccabca 2880 
agbbcgccaa gcacagcggc ggcgaccbgg agabcaccac ccacagcbbc aacbgccgcg 2940 
gcgagbtcbb cbacbgcaac acccgatcga tccccbgccg cabcaagcag atcatcaaca 3000 
tgtggcaggg cgbgggccgc gccabgtacg ccccccccab cgagggcabc cbgaccbgcc 3 060 
gcagcaacab caccggccbg cbgcbgaccc gcgacggcgg caccggcabg cacgacaccg 3120 
agabcbtccg ccccgagggc ggcgacabgc gcgacaacbg gcgcagcgag cbgbacaagb 3180 
acaaggbggb ggagatcaag ccccbgggca bcgcccccac caaggccaag ctbaccgbgc 3240 
aggbgcgcca gcbgcbgagc ggcatcgbgc agcagcagag caacctgcbg cgcgccabcg 3300 
aggcccagca gcacabgctg cagcbgaccg tgbggggcab caagcagcbg cagacccgcg 3360 
bgcbggccab cgagcgcbac cbgcgcgacc agcagcbcga ggacabcbgg aacaacabga 3420 
cctggatgca gbgggaccgc gagabcagca acbacaccaa caccabcbac cgcctgcbgg 3480 
aggacagcca gaaccagcag gagaagaacg agcaggaccb gcbggcccbg gacaagbggc 3540 ■ 
agaaccbgbg gaccbggbbc ggcabcacca acbggcbgbg gbgaacacgt gggabccaga 3600 
bcbgcbgbgc cbbcbagbbg ccagccabcb gbbgbbbgcc ccbcccccgt gccbbcctbg 3660 
acccbggaag gbgccactcc cacbgbccbb bccbaabaaa abgaggaaab bgcabcgcab 3720 
tgbcbgagba ggbgbcabbc tafcbcbgggg ggbggggbgg ggcaggacag caagggggag 3780 
gabbgggaag acaabagcag gcatgcbggg gabgcggbgg gcbctabggg bacccaggbg 3840 
cbgaagaatb gacccggbbc cbccbgggcc agaaagaagc aggcacabcc ccbbcbcbgb 3900 
gacacacccb gtccacgccc cbggbbctba gbbccagccc cacbcatagg acacbcatag 3960 
cbcaggaggg ctccgcctbc aatcccaccc gcbaaagbac bbggagcggb ctctcccbcc 4020 
cbcabcagcc caccaaacca aaccbagccb ccaagagbgg gaagaaabba aagcaagaba 4080 
ggcbabbaag bgcagaggga gagaaaatgc cbccaacabg bgaggaagba atgagagaaa 4140 
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tcatagaatt bbaaggccab gatttaaggc cabcabggcc tbaabcbbcc gcbbccbcgc 4200 
tcacbgacbc gctgcgctcg gtcgttcggc bgcggcgagc ggtatcagct cactcaaagg 4260 
cggtaatacg gttatccaca gaabcagggg abaacgcagg aaagaacatg bgagcaaaag 4320 
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cahaggctcc 4380 . 
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 444 0 
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct ccbgbbccga 4500 
ccctgccgct taccggatac ctgtccgcct ttctcccbtc gggaagcgtg gcgcbbtcbc 4560 
atagctcacg ctghaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 4620 
tgcacgaacc ccccgttcag cccgaccgct gcgccbbatc cggtaacbab cgtcttgagt 4680 
ccaacccggt aagacacgac bbatcgccac bggcagcagc cactggtaac aggattagca 474 0 
gagcgaggta tgtaggcggt gcbacagagb bcttgaagtg gtggcctaac tacggctaca 4800 
ctagaagaac agtatttggt abctgcgcbc tgctgaagcc agttaccttc ggaaaaagag 4860 
bbggbagcbc bbgabccggc aaacaaacca ccgctggtag cggtggtttb bbbgbtbgca 4920 
agcagcagab tacgcgcaga aaaaaaggat cbcaagaaga bcctttgabc bbbtcbacgg 4980 
ggtcbgacgc bcagtggaac gaaaacbcac gttaagggat tttggbcatg agattatcaa 5040 
aaaggabcbb caccbagabc cbbtbaaatb aaaaabgaag btttaaabca atcbaaagta 5100 
babatgagba aacbtggtcb gacagbtacc aatgcbtaat cagtgaggca cctatctcag 5160 
cgabcbgbct abtbcgtbca bccabagbbg ccbgacbcgg gggggggggg cgctgaggtc 5220 
bgccbcgbga agaaggtgtt gcbgactcat accaggccbg aabcgcccca bcatccagcc 5280 
agaaagtgag ggagccacgg tbgabgagag cbbtgttgba ggtggaccag btggtgatbb 5340 
bgaactbbbg cttbgccacg gaacggtcbg cgbtgtcggg aagabgcgbg abcbgabccb 5400 
bcaacbcagc aaaagtbcga btbabtcaac aaagccgccg bcccgbcaag bcagcgbaat 54 60 
gcbcbgccag tgbbacaacc aabbaaccaa ttctgattag aaaaactcab cgagcabcaa 5520 
abgaaacbgc aabbbabbca batcaggabt abcaabacca batbbbtgaa aaagccgtbb 5580 
ctgtaabgaa ggagaaaacb caccgaggca gbbccabagg atggcaagab ccbggbabcg 5640 
gtctgcgabt ccgactcgbc caacatcaat acaaccbabb aabbbccccb cgbcaaaaab 5700 
aaggbbatca agbgagaaab caccabgagb gacgacbgaa bccggbgaga atggcaaaag 5760 
cbbatgcabb bcbbbccaga cbbgbbcaac aggccagcca btacgcbcgb cabcaaaabc 5820 
acbcgcatca accaaaccgb babbcabbcg bgabbgcgcc tgagcgagac gaaabacgcg 5880 
abcgcbgtba aaaggacaab bacaaacagg aabcgaatgc aaccggcgca ggaacacbgc 5940 
cagcgcabca acaatattbb caccbgaabc aggababbcb bcbaabaccb ggaabgcbgb 6000 
bbbcccgggg abcgcagbgg bgagbaacca bgcabcatca ggagtacgga baaaabgcbb 6060 
gabggbcgga agaggcabaa abbccgbcag ccagbtbagb cbgaccabcb cabcbgbaac 6120 
abcabbggca acgcbaccbb bgccabgbbb cagaaacaac bctggcgcat cgggcbbccc 6180 
abacaabcga bagatbgbcg caccbgabbg cccgacabba bcgcgagccc abbbabaccc 6240 
abataaabca gcatccabgb bggaabbbaa bcgcggcctc gagcaagacg bbbcccgbtg 6300 
aababggcbc abaacacccc^bbgbabbact gtbtabgbaa gcagacagbt bbatbgbbca 6360 
tgabgabata bbbbbabcbb gbgcaabgba acabcagaga bbbbgagaca caacgbggcb 6420 
btcccccccc ccccabbabb gaagcabbba bcagggbtab bgbcbcatga gcggatacat 6480 
abbbgaabgb abbbagaaaa abaaacaaab aggggbbccg cgcacabtbc cccgaaaagb 6540 
gccaccbgac gbcbaagaaa ccabbabbab cabgacabba accbabaaaa ataggcgbab 6600 
cacgaggccc btbcgbc ^617 



<210> 109 
<211> 6713 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<22 3> plasmid pVR1012x/s conbaining HIV genes 
<40O> 109 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
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cagcttgtct gbaagcggab gccgggagca 
ttggcgggtg tcggggctgg cbbaacbabg 
accatatgcg gtgtgaaaba ccgcacagat 
cbabbggcca ttgcatacgt bgbabccaba 
tccaacatta ccgccatgtt gacattgatt 
ggggbcabba gttcatagcc catatabgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga cbbbccabbg 
tgcccacttg gcagbacabc aagbgbabca 
bgacggbaaa tggcccgcct ggcattatgc 
ttggcagtac abcbacgbab bagbcabcgc 
catcaatggg cgtggatagc ggbbbgacbc 
cgtcaatggg agtttgtttt ggcaccaaaa 
ctccgcccca bbgacgcaaa bgggcggtag 
agctcgbbba gbgaaccgbc agabcgccbg 
tagaagacac cgggaccgat ccagcctccg 
bccccgbgcc aagagbgacg baagbaccgc 
tcbbatgcab gcbabacbgb bbttggcbbg 
taggbgabgg babagcbbag ccbabaggbg 
tabbggbgac gabactbtcc abtacbaatc 
tabbggcbab atgccaabac tctgbccbbc 
ggabggggbc ccabtbabba tbtacaaabb 
cgcagbbbtt attaaacaba gcgbgggabc 
cabgggcbcb tctccggbag cggcggagcb 
agcggcbcab ggtcgcbcgg cagcbccbbg 
agcacaabgc ccaccaccac cagbgbgccg 
gaaaabgagc gbggagabtg ggcbcgcacg 
gcagaagaag abgcaggcag ctgagtbgbb 
gbbgcggbgc tgbbaacggb ggagggcagb 
cgcgccacca gacabaabag cbgacagacb 
bgcagbcacc gbcgbcgaca cgbgbgabca 
ccatgcgcgb ggagggcabc cagcgcaacb 
gcbtctggab ggbgabgabc bacaacgbgc 
gcgtgcccgb gbggaaggag gccaagacca 
acgacgccga ggbgcacaac gbgbgggcca 
cccaggagab ggbgctggag aacgbgaccg 
bggagcagab gcaccaggac abcabcagcc 
agcbgacccc ccbgbgcgbg acccbgcacb 
gcaccgacaa catgggcggc gagabcaaga 
gcgacaagcg cgagaaggbg cacgcccbgb 
acgagagcag caacaccagc ggcgacbacc 
cccaggccbg ccccaaggbg agcbbcgacc 
gcbacgccat ccbgaagbgc aacaacaaga 
bgagcaccab ccagbgcacc cacggcacca 
acggcagccb ggccgaggag gagabcabca 
agaccabcab cgbgcaccbg aacgagagcg 
acbgcaacab cagccgcacc gccbggaaca 
ccgagcacbb ccccaacaag gccabcaagb 
bcaccaccca cagcbbcaac bgccgcggcg 
bcaacagcac cbacaccccc aacagcaccg 
bcaccabccc cbgccgcabc aagcagabca 
bgbacgcccc ccccabcgag ggcabccbga 
bgacccgcga cggcggcacc ggcabgcacg 
acabgcgcga caacbggcgc agcgagcbgb 
bgggcabcgc ccccaccaag gccaagcbba 



gacaagcccg bcagggcgcg bcagcgggbg 120 
cggcabcaga gcagabbgba cbgagagbgc 1B0 
gcgbaaggag aaaabaccgc abcagabbgg 240 
bcabaatatg bacatbbaba bbggcbcabg 300 
abbgacbagb batbaabagb aatcaabbac 360 
gbbccgcgtb acabaacbta cggbaaabgg 420 
cccabbgacg bcaabaabga cgtabgbtcc 480 
acgbcaabgg gbggagbatb tacggbaaac 540 
babgccaagb acgcccccba tbgacgbcaa 600 
ccagbacabg accttabggg acbbbcctac 660 
babtaccabg gtgatgcggb bbbggcagba 720 
acggggabbb ccaagbcbcc accccabbga 780 
bcaacgggac bbbccaaaab gbcgbaacaa 840 
gcgbgbacgg bgggaggbcb ababaagcag 900 
gagacgccab ccacgcbgbt tbgaccbcca 960 
cggccgggaa cggbgcabbg gaacgcggab 1020 
cbabagacbc babaggcaca cccctbbggc 1080 
gggccbabac acccccgctt cctbatgcba 1140 
bgggbbabtg accabbabbg accactcccc 1200 
cabaacabgg cbcbbbgcca caacbabcbc 1260 
agagactgac acggacbcbg babbbbbaca 1320 
cacatataca acaacgccgb cccccgtgcc 1380 
bccacgcgaa bcbcgggtac gbgtbccgga 144 0 
bccacabccg agccctggbc ccatgccbcc 1500 
cbccbaacag bggaggccag acbtaggcac 1560 
cacaaggccg bggcggbagg gbabgbgbcb 1620 
gcbgacgcag abggaagacb baaggcagcg 1680 
gtabtcbgat aagagtcaga ggbaacbccc 1740 
gtagtcbgag cagbacbcgt bgcbgccgcg 1800 
aacagacbgb bccbbbccab gggbcbbbbc 1860 
gababcgcgg ccgcbcbaga gababcgcca 1920 
ggaagcagbg gbggabcbgg ggcabccbgg 1980 
gcggcaaccb gbgggbgacc gtgbacbacg 2040 
cccbgbbcbg cgccagcgac gccaaggccb 2100 
cccacgccbg cgbgcccacc gaccccaacc 2160 
agaacbbcaa cabgbgggag aacgacabgg 2220 
bgbgggacca gagccbgaag cccbgcgbga 2280 
gcagcaaccg caccabcgac bacaacaacc 2340 
acbgcagcbb caacabgacc accgaggbgc 2400 
bcbaccgccb ggacabcgbg ccccbgaaga 2460 
gccbgabcaa cbgcaacacc agcgccabca 2520 
ccabccccab ccacbacbgc gcccccgccg 2580 
ccbbcaacgg caccggcccc bgcaacaacg 2640 
agcccgbggb gagcacccag ctgcbgcbga 2700 
bccgcagcaa gaacctgacc gacaacgbga 2760 
bggagabcaa cbgcacccgc cgabcggccc 2820 
agacccbgca ggaggbgggc aagaagcbgg 2880 
bcgccaagca cagcggcggc gacctggaga 2940 
agbbcbbcba cbgcaacacc agcagccbgb 3000 
agaacabcac cggcaccgag aacagcabca 3060 
bcaacabgbg gcagggcgbg ggccgcgcca 312 0 
ccbgccgcag caacabcacc ggccbgcbgc 3180 
acaccgagab cbbccgcccc gagggcggcg 3240 
acaagbacaa ggbggbggag abcaagcccc 3300 
ccgbgcaggb gcgccagcbg cbgagcggca 3360 
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tcgtgcagca gcagagcaac ctgctgcgcg 
tgaccgtgtg gggcabcaag cagcbgcaga 
gcgaccagca gcbcgaggac abcbggaaca 
tcagcaacta caccaacacc atctaccgcc 
agaacgagca ggacctgctg gccctggaca 
tcaccaactg gctgtggtga acacgtggga 
ccabcbgbbg tttgcccctc ccccgtgcct 
gtccbbbccb aataaaatga ggaaattgca 
ctggggggtg gggtggggca ggacagcaag 
9ctggggatg cggtgggctc tatgggtacc 
tgggccagaa agaagcaggc acatcccctt 
tbcbbagbtc cagccccacb cataggacac 
ccacccgcta aagbacbbgg agcggbcbct 
tagcctccaa gagtgggaag aaattaaagc 
aaatgcctcc aacatgtgag gaagtaatga 
taaggccatc atggccttaa bcbbccgcbb 
ttcggctgcg gcgagcggta tcagctcact 
caggggataa cgcaggaaag aacabgbgag 
aaaaggccgc gbtgcbggcg bbbbbccaba 
atcgacgctc aagtcagagg tggcgaaacc 
cccctggaag cbcccbcgbg cgctctcctg 
ccgcctttct cccttcggga agcgtggcgc 
gbbcggtgba ggbcgbbcgc tccaagctgg 
accgctgcgc cttabccggb aactatcgtc 
cgccactggc agcagccact ggtaacagga 
cagagbbcbb gaagtggtgg cctaactacg 
gcgctctgcb gaagccagtt accbbcggaa 
aaaccaccgc tggtagcggt ggbbbbbbbg 
aaggatctca agaagatcct ttgatctttt 
acbcacgbba agggatttbg gbcatgagab 
baaabbaaaa abgaagbbtb aaabcaabct 
gbtaccaabg cbtaabcagb gaggcaccba 
bagbtgcctg acbcgggggg ggggggcgct 
actcatacca ggccbgaatc gccccatcat 
tgagagcbbt gbbgtaggtg gaccagbbgg 
ggbcbgcgbb gbcgggaaga bgcgtgabct 
tbcaacaaag ccgccgbccc gbcaagbcag 
aaccaabbcb gabtagaaaa acbcabcgag 
aggabbatca abaccababt bttgaaaaag 
gaggcagtbc cataggabgg caagabccbg 
atcaabacaa ccbabbaabb tccccbcgbc 
abgagbgacg acbgaatccg gtgagaatgg 
tbcaacaggc cagccatbac gctcgbcatc 
cabbcgtgab bgcgcctgag cgagacgaaa 
aacaggaabc gaabgcaacc ggcgcaggaa 
bgaabcagga tabbcbbcta abaccbggaa 
taaccatgca bcabcaggag tacggabaaa 
cgbcagccag btbagbcbga ccabctcabc 
abgtbtcaga aacaactcbg gcgcabcggg 
bgatbgcccg acabbatcgc gagcccabbb 
atbbaabcgc ggccbcgagc aagacgbbbc 
atbacbgtbb abgbaagcag acagbbtbab 
aabgbaacab cagagabtbt gagacacaac 
cabbbatcag ggbbatbgbc bcabgagcgg 
acaaabaggg gbbccgcgca cabbbccccg 



ccabcgaggc ccagcagcac atgctgcagc 3420 
cccgcgtgct ggccatcgag cgctaccbgc 3480 
acabgaccbg gabgcagbgg gaccgcgaga 3540 
bgctggagga cagccagaac cagcaggaga 3600 
agtggcagaa ccbgtggacc bggbbcggca. 3660 
bccagabcbg cbgbgccbbc bagbbgccag 3720 
bccbbgaccc bggaaggbgc cacbcccacb 3780 
tcgcabtgtc tgagbaggbg bcabbcbabb 3840 
99ggaggatb gggaagacaa bagcaggcab 3900 
caggbgcbga agaabbgacc cggbbccbcc 3960 
ctcbgbgaca cacccbgbcc acgccccbgg 4020 
bcabagcbca ggagggcbcc gccbbcaatc 4080 
ccctcccbca tcagcccacc aaaccaaacc 4140 
aagataggcb abbaagbgca gagggagaga 4200 
gagaaabcab agaabbbbaa ggccatgabb 4260 
ccbcgcbcac bgacbcgcbg cgctcggbcg 4320 
caaaggcggb aabacggbba bccacagaab 4380 
caaaaggcca gcaaaaggcc aggaaccgba 4440 
ggcbccgccc cccbgacgag cabcacaaaa 4500 
cgacaggacb abaaagabac caggcgbbbc 4560 
bbccgaccct gccgcbbacc ggataccbgb 4620 
bbbcbcabag cbcacgcbgb aggbabcbca 4680 
gcbgbgbgca cgaacccccc gbtcagcccg 4740 
bbgagbccaa cccggbaaga cacgacbbab 4 800 
tbagcagagc gaggbatgba ggcggbgcba 4860 
gcbacacbag aagaacagba bbbggbabcb 4920 
aaagagtbgg bagcbcbbga bccggcaaac 4980 
bbtgcaagca gcagabbacg cgcagaaaaa 5040 
cbacggggbc bgacgcbcag tggaacgaaa 5100 
babcaaaaag gabcbbcacc tagabccbbb 5160 
aaagbababa bgagbaaact bggbcbgaca 5220 
tcbcagcgab ctgbcbabbb cgbbcabcca 5280 
gaggbcbgcc bcgbgaagaa ggbgbtgcbg 5340 
ccagccagaa agbgagggag ccacggbtga 5400 
tgabttbgaa cbbbbgcbbb gccacggaac 5460 
gabcctbcaa cbcagcaaaa gbbcgabbba 5520 
cgbaatgcbc bgccagbgbb acaaccaabb 5580 
cabcaaabga aacbgcaatb babbcabatc 5640 
ccgtbbctgb aatgaaggag aaaactcacc 5700 
gbabcggbcb gcgabbccga cbcgbccaac 5760 
aaaaabaagg bbabcaagbg agaaabcacc 5820 
caaaagcbta bgcabbbcbb bccagacbbg 5880 
aaaabcacbc gcabcaacca aaccgbbabb 5940 
bacgcgabcg cbgbbaaaag gacaabbaca 6000 
cacbgccagc gcabcaacaa babbbbcacc 6060 
bgcbgbbbbc ccggggabcg cagbggbgag 6120 
abgcbbgabg gbcggaagag gcabaaabtc 6180 
tgbaacabca bbggcaacgc baccbbtgcc 6240 
cbbcccabac aabcgabaga bbgbcgcacc 6300 
abacccabab aaabcagcab ccabgbbgga 6360 
ccgbbgaaba bggcbcabaa caccccbbgb 6420 
bgbbcabgab gabababbbb babcbbgbgc 6480 
gbggcbtbcc cccccccccc abbabbgaag 6540 
atacabatbb gaabgbabbb agaaaaabaa 6600 
aaaagbgcca ccbgacgbcb aagaaaccab 6660 
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tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc gtc 6713 



<210> 110 
<211> 6644 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 110 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtacccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac acctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960' 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacafcataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2 22 0 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg caccatcgac tacaacaacc 2340 
gcaccgacaa catgggcggc gagatcaaga actgcagctt caacatgacc accgaggtgc 2400 
gcgacaagcg cgagaaggtg cacgccctgt tctaccgcct ggacatcgtg cccctgaaga 2460 
acgagagcag caacaccagc ggcgactacc gcctgatcaa ctgcaacacc agcgccatca 2520 
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cccaggccbg ccccaaggtg agcttcgacc ccatccccat ccactactgc gcccccgccg 2580 
gctacgccat cctgaagtgc aacaacaaga ccttcaacgg caccggcccc bgcaacaacg 2640 
bgagcaccat ccagtgcacc cacggcacca agcccgtggt gagcacccag cbgcbgcbga 2700 
acggcagcct ggccgaggag gagabcabca tccgcagcaa gaacctgacc gacaacgtga 2760 
agaccatcat cgtgcacctg aacgagagcg tggagatcaa ctgcacccgc cgabcggccc 2820 
actgcaacat cagccgcacc gcctggaaca agaccctgca ggaggtgggc aagaagctgg 2880 
ccgagcactt ccccaacaag gccatcaagb tcgccaagca cagcggcggc gacctggaga 2940 
tcaccaccca cagcttcaac tgccgcggcg agbbctbcba ctgcaacacc cgatcgatcc 3 000 
cctgccgcat caagcagatc abcaacabgb ggcagggcgt gggccgcgcc atgtacgccc 3060 
cccccatcga gggcatcctg acctgccgca gcaacatcac cggcctgcfcg ctgacccgcg 3120 
acggcggcac cggcatgcac gacaccgaga tcttccgccc cgagggcggc gacatgcgcg 3180 
acaactggcg cagcgagctg tacaagtaca aggbggtgga gatcaagccc ctgggcatcg 3240 
cccccaccaa ggccaagctt accgtgcagg tgcgccagct gctgagcggc abcgbgcagc 3300 
agcagagcaa ccbgcbgcgc gccatcgagg cccagcagca catgctgcag ctgaccgtgt 3360 
ggggcatcaa gcagctgcag acccgcgtgc tggccatcga gcgctacctg cgcgaccagc 3420 
agctcgagga catctggaac aacatgaccb ggatgcagtg ggaccgcgag abcagcaacb 34 80 
acaccaacac catctaccgc ctgctggagg acagccagaa ccagcaggag aagaacgagc 3540 
aggacctgct ggccctggac aagtggcaga acctgtggac ctggttcggc atcaccaact 3600 
ggctgtggtg aacacgtggg abccagabcb gctgtgcctt ctagttgcca gccatctgtt 3660 
gfcbbgccccb cccccgtgcc bbccbbgacc cbggaaggbg ccactcccac bgtccbtbcc 3720 
taataaaatg aggaaabbgc abcgcabbgt cbgagbaggb gtcattctab tcbggggggb 3780 
ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat 3840 
gcggbgggcb cbabgggbac ccaggtgctg aagaattgac ccggttcctc cbgggccaga 3900 
aagaagcagg cacatcccct tctctgtgac acaccctgtc cacgcccctg gttcttagtt 3960 
ccagccccac bcabaggaca ctcatagctc aggagggctc cgccttcaat cccacccgct 4020 
aaagtacttg gagcggtctc tccctcccbc abcagcccac caaaccaaac ctagcctcca 4080 
agagtgggaa gaaattaaag caagataggc tatbaagtgc agagggagag aaaabgccbc 4140 
caacabgtga ggaagbaabg agagaaatca bagaabtbta aggccatgat btaaggccat 4200 
cabggccbba abcbbccgcb tccbcgctca cbgacbcgct gcgcbcggbc gbbcggcbgc 4260 
ggcgagcggb abcagcbcac bcaaaggcgg baabacggbt abccacagaa bcaggggaba 4320 
acgcaggaaa gaacabgbga gcaaaaggcc agcaaaaggc caggaaccgb aaaaaggccg 4380 
cgtbgcbggc gbtbbbccab aggcbccgcc cccctgacga gcabcacaaa aabcgacgcb 4440 
caagtcagag gbggcgaaac ccgacaggac bataaagaba ccaggcgbbb cccccbggaa 4500 
gcbcccbcgb gcgcbcbccb gbbccgaccc bgccgcbbac cggabaccbg bccgccbbbc 4560 
bcccbbcggg aagcgbggcg cbbbcbcaba gcbcacgcbg baggbabcbc agbbcggbgb 4620 
aggtcgbbcg cbccaagcbg ggctgbgbgc acgaaccccc cgbtcagccc gaccgcbgcg 4 680 
ccttabccgg baacbabcgb ctbgagbcca acccggtaag acacgacbba bcgccacbgg 4740 
cagcagccac bggbaacagg abbagcagag cgaggbabgt aggcggbgcb acagagbbcb 4800 
bgaagbggbg gccbaacbac ggctacacta gaagaacagb attbggbabc bgcgctctgc 4860 
tgaagccagb baccbtcgga aaaagagbbg gbagcbctbg atccggcaaa caaaccaccg 4 920 
cbggbagcgg bggbbtbbbt gbbtgcaagc agcagabtac gcgcagaaaa aaaggabcbc 4980 
aagaagabcc tbbgatcbbb bcbacggggb cbgacgctca gbggaacgaa aacbcacgbb 5040 
aagggatbtb ggbcabgaga btabcaaaaa ggabcbbcac ctagabccbb btaaabbaaa 5100 
aabgaagbbb baaabcaabc baaagbabab abgagbaaac btggbcbgac agbtaccaab 5160 
gcbbaatcag bgaggcaccb abcbcagcga bcbgbcbabb bcgbbcabcc abagbbgccb 5220 
gactcggggg gggggggcgc bgaggfccbgc cbcgbgaaga aggbgbbgct gacbcabacc 5280 
aggccbgaab cgccccabca tccagccaga aagbgaggga gccacggbbg abgagagcbb 5340 
tgtbgbaggb ggaccagbbg gbgabbbbga acbbbbgcbb bgccacggaa cggbcbgcgb 5400 
bgbcgggaag abgcgbgabc bgabcctbca acbcagcaaa agbbcgabbb abbcaacaaa 5460 
gccgccgbcc cgbcaagbca gcgbaabgcb cbgccagbgb bacaaccaab baaccaabbc 5520 
bgabbagaaa aacbcabcga gcabcaaabg aaacbgcaab bbabbcabab caggabtabc 5580 
aabaccabab bbbbgaaaaa gccgbbtcbg baabgaagga gaaaacbcac cgaggcagbb 5640 
ccabaggabg gcaagabccb ggbabcggbc bgcgabbccg acbcgbccaa cabcaataca 5700 
accbatbaat bbccccbcgb caaaaabaag gbbabcaagb gagaaabcac cabgagbgac 5760 
gacbgaabcc ggbgagaatg gcaaaagcbb atgcattbcb bbccagacbb gbbcaacagg 5820 
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ccagccatba cgctcghcat caaaatcact cgcatcaacc aaaccgttat bcattcgtga 5880 

ttgcgcctga gcgagacgaa atacgcgatc gctgttaaaa ggacaabbac aaacaggaat 5940 

cgaatgcaac cggcgcagga acactgccag cgcabcaaca ababbbtcac ctgaatcagg 6000 

atattcttct aatacctgga atgctgtttt cccggggatc gcagtggtga gtaaccabgc 6060 

atcatcagga gtacggataa aabgchtgat ggtcggaaga ggcataaatt ccgtcagcca 6120 

gbbtagtctg accatctcab cbgbaacatc abbggcaacg cbaccbtbgc catgbbbcag 6180 

aaacaacbcb ggcgcatcgg gcbbcccaba caatcgatag abbgbcgcac cbgabbgccc 6240 

gacattatcg cgagcccabb tabacccaba baaabcagca bccabgbbgg aabbbaabcg 6300 

cggccbcgag caagacgbbb cccgttgaab atggctcaba acaccccbbg tabbacbgbb 6360 

babgtaagca gacagbbtba tbgbbcatga tgatatatbb bbatcttgbg caatgtaaca 6420 

bcagagabbt bgagacacaa cgbggcbbtc cccccccccc cabbattgaa gcatttatca 64 80 

gggttabbgb cbcabgagcg gabacatabb tgaatgtatb tagaaaaaba aacaaabagg 6540 

ggbbccgcgc acabbbcccc gaaaagbgcc accbgacgbc baagaaacca bbatbabcab 6600 
gacatbaacc babaaaaaba ggcgtabcac gaggcccbbb cgbc 6644 



<210> 111 
<211> 6725 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 111 

tcgcgcgbbt cggtgabgac ggbgaaaacc tctgacacab gcagctcccg gagacggtca 60 
cagcttgtct gbaagcggat gccgggagca gacaagcccg bcagggcgcg tcagcgggbg 120 
btggcgggtg bcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccaba tcataatabg tacabbbata ttggcbcabg 300 
tccaacatta ccgccabgtb gacattgatb attgactagt babtaatagt aatcaattac 360 
ggggtcatba gbbcatagcc catatatgga gttccgcgtt acataacbta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg bcaataatga cgtatgtbcc 4 80 
catagtaacg ccaataggga ctttccabtg acgtcaatgg gtggagtatt bacggtaaac 540 
bgcccacbtg gcagbacatc aagbgbabca tatgccaagb acgcccccba btgacgbcaa 600 
tgacggbaaa bggcccgcct ggcabtatgc ccagbacatg acctbabggg acbttccbac 660 
ttggcagtac atctacgtat tagtcabcgc tattaccabg gtgabgcggt bbtggcagba 720 
catcaatggg cgtggabagc ggbbbgacbc acggggatbt ccaagtctcc accccattga 780 
cgtcaatggg agbbbgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca bbgacgcaaa bgggcggbag gcgtgtacgg bgggaggbcb abataagcag 900 
agcbcgbtta gtgaaccgbc agatcgcctg gagacgccat ccacgctgtb btgaccbcca 960 
bagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcatbg gaacgcggat 102 0 
bccccgtgcc aagagtgacg baagbaccgc cbabagactc tataggcaca cccctbtggc 1080 
tcttatgcat gcbatactgt tttbggcttg gggcctatac acccccgcbb ccbtabgcta 1140 
taggtgatgg batagcbtag ccbabaggtg bgggttattg accabbabbg accactcccc 1200 
tabtggtgac gatactttcc attactaatc cabaacatgg cbcbbbgcca caactatctc 1260 
tattggctat abgccaatac bcbgbccbbc agagacbgac acggactcbg tabbbbtaca 1320 
ggabggggbc ccatttatta tttacaaatt cacababaca acaacgccgb cccccgbgcc 13 80 
cgcagttbtt attaaacata gcgtgggatc bccacgcgaa tctcgggbac gbgttccgga 1440 
cabgggcbct tcbccggbag cggcggagct tccacatccg agccctggtc ccabgccbcc 1500 
agcggcbcat ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg bggcggtagg gbatgtgtct 1620 
gaaaatgagc gtggagatbg ggcbcgcacg gctgacgcag abggaagact baaggcagcg 1680 
gcagaagaag atgcaggcag cbgagttgtb gtatbctgat aagagbcaga ggtaactccc 1740 
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gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca cgtgtgatca 
ccatgcgcgt ggagggcatc cagcgcaact 
gcttctggat ggbgabgatc tacaacgtgc 
gcgtgcccgt gtggaaggag gccaagacca 
acgacgccga ggbgcacaac gtgtgggcca 
cccaggagat ggtgctggag aacgtgaccg 
tggagcagat gcaccaggac abcatcagcc 
agctgacccc cctgtgcgtg accctgcact 
gcaccgacaa catgggcggc gagatcaaga 
gcgacaagcg cgagaaggtg cacgccctgt 
acgagagcag caacaccagc ggcgactacc 
cccaggccfcg ccccaaggtg agcttcgacc 
gctacgccat cctgaagbgc aacaacaaga 
tgagcaccat ccagtgcacc cacggcacca 
acggcagcct ggccgaggag gagatcatca 
agaccatcat cgtgcacctg aacgagagcg 
acacccgcaa gagcatccgc abcggccccg 
bcggcgacat ccgccaggcc cactgcaaca 
aggaggbggg caagaagcbg gccgagcact 
acagcggcgg cgaccbggag abcaccaccc 
actgcaacac ccgatcgatc ccctgccgca 
tgggccgcgc catgtacgcc ccccccatcg 
cciggcctgct gctgacccgc gacggcggca 
ccgagggcgg cgacabgcgc gacaactggc 
agatcaagcc cctgggcatc gcccccacca 
bgcbgagcgg cabcgtgcag cagcagagca 
acabgcbgca gctgaccgtg bggggcabca 
agcgctacct gcgcgaccag cagctcgagg 
gggaccgcga gatcagcaac tacaccaaca 
accagcagga gaagaacgag caggacctgc 
cctgghtcgg catcaccaac tggctgtggt 
tctagttgcc agccatctgt tgtttgcccc 
gccactccca cbgbccbbbc ctaataaaat 
tgtcattcta ttctgggggg tggggtgggg 
aatagcaggc atgctgggga tgcggtgggc 
cccggttcct cctgggccag aaagaagcag 
ccacgccccb ggttcttagt tccagcccca 
ccgccttcaa tcccacccgc taaagtactt 
ccaaaccaaa cctagcctcc aagagtggga 
cagagggaga gaaaatgcct ccaacabgtg 
aaggccatga bbbaaggcca tcatggcctt 
tgcgctcggt cgtbcggcbg cggcgagcgg 
tatccacaga atcaggggat aacgcaggaa 
ccaggaaccg taaaaaggcc gcgbbgcbgg 
agcatcacaa aaabcgacgc bcaagtcaga 
accaggcgtt tccccctgga agctccchcg 
ccggatacct gtccgccttt ctcccttcgg 
gbaggbabcb cagttcggtg baggbcgbbc 
ccgttcagcc cgaccgctgc gccbbabccg 
gacacgactt abcgccactg gcagcagcca 
taggcggtgc bacagagbtc ttgaagtggb 
babbtggbab ctgcgcbctg cbgaagccag 
gabccggcaa acaaaccacc gcbggbagcg 



gbagbctgag cagtactcgb bgctgccgcg 1800 
aacagactgb tcctbtccat gggbcbtbbc 1860 
gababcgcgg ccgctcbaga gabatcgcca 1920 
ggaagcagtg gtggatcbgg ggcatcctgg 1980 
gcggcaaccb gtgggtgacc gbgbactacg 2040 
ccctgtbcbg cgccagcgac gccaaggcct 2100 
cccacgccbg cgtgcceacc gaccccaacc 2160 
agaacttcaa cabgbgggag aacgacatgg 2220 
tgbgggacca gagccbgaag cccbgcgtga 2280 
gcagcaaccg caccabcgac bacaacaacc 2340 
actgcagctb caacatgacc accgaggtgc 2400 
bcbaccgccb ggacatcgtg ccccbgaaga 2460 
gcctgabcaa ctgcaacacc agcgccabca 2520 
ccatccccat ccacbacbgc gcccccgccg 2580 
ccbtcaacgg caccggcccc tgcaacaacg 2 64 0 
agcccgbggb gagcacccag ctgctgctga 2700 
bccgcagcaa gaaccbgacc gacaacgtga 2760 
tggagatcaa ctgcacccgc cccaacaaca 2820 
gccaggcctb cbacgccacc ggcgagatca 2880 
bcagccgcac cgcctggaac aagacccbgc 2 940 
bccccaacaa ggccatcaag bbcgccaagc 3000 
acagcbbcaa ctgccgcggc gagtbcbbcb 3050 
bcaagcagab catcaacabg bggcagggcg 3120 
agggcabccb gacctgccgc agcaacatca 3180 
ccggcabgca cgacaccgag abctbccgcc 3240 
gcagcgagcb gtacaagbac aaggbggbgg 3300 
aggccaagcb taccgbgcag gbgcgccagc 3360 
accbgcbgcg cgccatcgag gcccagcagc 3420 
agcagcbgca gacccgcgbg cbggccabcg 34 80 
acabcbggaa caacabgacc tggabgcagb 3540 
ccatcbaccg cctgcbggag gacagccaga 3600 
tggcccbgga caagbggcag aaccbgbgga 3 660 
gaacacgbgg gabccagabc tgcbgbgccb 3720 
bcccccgtgc cbbccbbgac ccbggaaggb 3780 
gaggaaabbg cabcgcatbg bctgagtagg 3840 
caggacagca agggggagga bbgggaagac 390 0 
bctatgggba cccaggtgcb gaagaatbga 3 960 
gcacabcccc bbctctgtga cacaccctgt 4020 
ctcabaggac acbcabagct caggagggct 4080 
ggagcggtcb ctccctcccb cabcagccca 414 0 
agaaatbaaa gcaagatagg cbabbaagbg 4200 
aggaagbaab gagagaaabc abagaabbbt 4260 
aabcbtccgc btccbcgcbc actgactcgc 4320 
tabcagcbca cbcaaaggcg gtaatacggt 4380 
agaacabgbg agcaaaaggc cagcaaaagg 4440 
cgbbbbbcca baggcbccgc cccccbgacg 4500 
ggbggcgaaa cccgacagga cbabaaagab 4560 
bgcgcbcbcc bgtbccgacc cbgccgcbta 4620 
gaagcgtggc gcbbtcbcab agcbcacgcb 4680 
gcbccaagcb gggcbgbgtg cacgaacccc 474 0 
gbaacbabcg bcttgagbcc aacccggtaa 4800 
cbggbaacag gatbagcaga gcgaggbabg 4860 
ggccbaacba cggcbacacb agaagaacag 4 920 
bbacctbcgg aaaaagagbb ggfcagcbcbb 4980 
gbggbbbbbb bgbbtgcaag cagcagabba 5040 
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cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 5100 
a 9t9gaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 5160 
cctagatcct bbbaaabtaa aaatgaagtt btaaabcaat cbaaagbaba tatgagbaaa 5220 
cttggtctga cagbtaccaa tgcttaatca gtgaggcacc tatctcagcg abcbgbctab 5280 
btcgbtcabc catagttgcc tgactcgggg ggggggggcg ctgaggtctg cctcgtgaag 5340 
aaggtgttgc tgactcatac caggccfcgaa tcgccccatc abccagccag aaagtgaggg 5400 
agccacggtt gatgagagct ttgttgtagg tggaccagtt ggtgattttg aactttbgcb 5460 
ttgccacgga acggtcbgcg ttgtcgggaa gabgcgbgat ctgatccbbc aactcagcaa 5 520 
aagttcgatt tattcaacaa agccgccgtc ccgtcaagtc agcgtaatgc tctgccagtg 5580 
ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg agcatcaaab gaaactgcaa 5640 
tttattcata tcaggathat caataccata tttttgaaaa agccgtttct gtaatgaagg 5700 
agaaaactca ccgaggcagt tccataggat ggcaagatcc tggtatcggt ctgcgattcc 57 60 
gactcgtcca acabcaatac aacctattaa bbbccccbcg tcaaaaataa ggbbatcaag 5820 
tgagaaatca ccatgagtga cgactgaabc cggtgagaat ggcaaaagct tatgcatttc 5880 
tttccagact tgttcaacag gccagccatt acgctcgtca tcaaaatcac tcgcatcaac 5940 
caaaccgtta tbcatbcgtg attgcgcctg agcgagacga aatacgcgat cgctgttaaa' 6000 
aggacaatba caaacaggaa bcgaatgcaa ccggcgcagg aacactgcca gcgcabcaac 6060 
aatattttca cctgaatcag gatatbcttc taatacctgg aatgctgttt tcccggggat 6120 
cgcagtggbg agtaaccatg catcatcagg agbacggata aaatgcttga tggtcggaag 6180 
aggcahaaat bccgbcagcc agtttagtct gaccabctca tctgtaacat cattggcaac 6240 
gctaccttbg ccatgfcttca gaaacaactc tggcgcatcg ggchtcccat acaatcgata 6300 
gatbgtcgca cctgattgcc cgacattatc gcgagcccat ttatacccat ataaafccagc 6360 
atccatgfctg gaatttaatc gcggcctcga gcaagacgtt tcccgttgaa tatggctcat 6420 
aacacccctt gtatbactgt bbatgtaagc agacagtbtt atbgbbcabg atgatabatb 64 80 
bbtabcbbgb gcaatgbaac atcagagatb btgagacaca acgbggcbbb cccccccccc 6540 
ccabtatbga agcabbtatc agggbtatbg bcbcabgagc ggabacatat btgaabgbab 6600 
btagaaaaab aaacaaatag gggbtccgcg cacabttccc cgaaaagbgc cacctgacgt 6660 
cbaagaaacc atbabbabca tgacattaac ctabaaaaat aggcgtatca cgaggccctb 6720 
bcgbc 6725 



<210> 112 
<211> 6824 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<22 3> plasmid pVR1012x/s containing HIV genes 
<400> 112 

bcgcgcgtbb cggtgabgac ggbgaaaacc bcbgacacat gcagcbcccg gagacggtca 60 

cagcttgtct gtaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

tbggcgggbg tcggggctgg ctbaactatg cggcatcaga gcagabbgta ctgagagtgc 180 

accatatgcg gbgtgaaata ccgcacagab gcgtaaggag aaaataccgc atcagabtgg 24 0 

ctattggcca ttgcatacgt tgbatccata tcataatatg tacattbata btggcbcabg 300 

tccaacatta ccgccatgbt gacabtgatb abbgactagt tattaabagb aatcaatbac 360 

9999tcatba gtbcatagcc cabababgga gtbccgcgbb acabaacbba cggtaaatgg 420 

cccgcctggc bgaccgccca acgacccccg cccabbgacg tcaataabga cgbabgtbcc 4 80 

catagbaacg ccaabaggga ctttccatbg acgtcaatgg gbggagbatt tacggbaaac 540 

bgcccacttg gcagbacatc aagbgtatca tatgccaagb acgcccccta btgacgbcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accbbabggg actttccbac 660 

tbggcagtac atctacgbat bagtcatcgc tabtaccabg gbgabgcggb btbggcagba 72 0 

cabcaabggg cgtggabagc ggtttgactc acggggabbb ccaagbcbcc accccatbga 780 

cgbcaatggg agbbtgtbtb ggcaccaaaa tcaacgggac tbbccaaaab gtcgbaacaa 84 0 
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cbccgcccca ttgacgcaaa bgggcggbag gcgtgtacgg tgggaggtct ababaagcag 900 
agctcgttta gtgaaccgtc agabcgccbg gagacgccat ccacgctgtt btgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctbbggc 1080 
tcttatgcat gcbabacbgb ttttggcttg gggcctatac acccccgctt ccbbabgcta 1140 
ta 99tgatgg babagcbbag ccbabaggbg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cabaacabgg ctctttgcca caactatctc 1260 
babbggcbab atgccaatac tctgtccttc agagactgac acggacbcbg tabbbbbaca 1320 
ggatggggtc ccatttatta bttacaaabb cacababaca acaacgccgb cccccgtgcc 13 80 
cgcagbbbbb attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
cabgggcbcb tctccggtag cggcggagct tccacatccg agccctggbc ccabgcctcc 15 00 
agcggctcat ggtcgctcgg cagcfcccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaabgagc gbggagabtg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag ' atgcaggcag cbgagbbgbb gbattcbgab aagagbcaga ggtaacbccc 1740 
gbtgcggtgc tgttaacggt ggagggcagt gbagbctgag cagbactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt bccbttccat gggbctbbtc 1860 
bgcagtcacc gbcgbcgaca cgbgtgatca gabatcgcgg ccgcbcbaga gatabcgcca 1920 
ccabgcgcgb ggagggcatc cagcgcaacb ggaagcagtg gtggabcbgg ggcatcctgg 1980 
gctbcbggat ggbgatgatc tacaacgtgc gcggcaacct gbgggtgacc gbgtacbacg 2 040 
gcgbgcccgb gbggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggccb 2100 
acgacgccga ggtgcacaac gbgbgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagab ggtgctggag aacgtgaccg agaacttcaa catgbgggag aacgacabgg 2220 
bggagcagat gcaccaggac abcabcagcc bgbgggacca gagccbgaag ccctgcgbga 2280 
agctgacccc ccbgbgcgbg accctgcacb gcagcaaccg atcgaagaac bgcagcbtca 2340 
acabgaccac cgaggbgcgc gacaagcgcg agaaggbgca cgccctgttc taccgcctgg 2400 
acabcgtgcc ccbgaagaac gagagcagca acaccagcgg cgactaccgc ctgatcaact 24 60 
gcaacaccag cgccabcacc caggcctgcc ccaaggtgag cbbcgacccc atccccabcc 2520 
actactgcgc ccccgccggc bacgccatcc bgaagtgcaa caacaagacc btcaacggca 2580 
ccggccccbg caacaacgbg agcaccabcc agbgcaccca cggcaccaag cccgtggbga 2640 
gcacccagct gctgctgaac ggcagccbgg ccgaggagga gabcabcatc cgcagcaaga 2700 
acctgaccga caacgbgaag accabcabcg tgcaccbgaa cgagagcgtg gagabcaact 2760 
gcacccgccc caacaacaac acccgcaaga gcabccgcab cggccccggc caggcctbct 2 82 0 
acgccaccgg cgagabcatc ggcgacatcc gccaggccca cbgcaacabc agccgcaccg 2880 
ccbggaacaa gacccbgcag gaggbgggca agaagcbggc cgagcactbc cccaacaagg 2940 
ccatcaagbb cgccaagcac agcggcggcg accbggagat caccacccac agcbbcaacb 3000 
gccgcggcga gttcbbctac tgcaacacca gcagccbgbt caacagcacc bacaccccca 3060 
acagcaccga gaacabcacc ggcaccgaga acagcatcat caccabcccc bgccgcabca 3120 
agcagabcab caacabgbgg cagggcgbgg gccgcgccat gbacgccccc cccabcgagg 3180 
gcabcctgac cbgccgcagc aacabcaccg gccbgctgcb gacccgcgac ggcggcaccg 3240 
gcatgcacga caccgagabc bbccgccccg agggcggcga cabgcgcgac aacbggcgca 3300 
gcgagcbgta caagbacaag gtggbggaga bcaagcccct gggcabcgcc cccaccaagg 33 60 
ccaagcbbac cgbgcaggbg cgccagcbgc bgagcggcat cgtgcagcag cagagcaacc 3420 
bgcbgcgcgc cabcgaggcc cagcagcaca bgctgcagct gaccgbgbgg ggcabcaagc 3480 
agcbgcagac ccgcgtgcbg gccabcgagc gcbaccbgcg cgaccagcag cbcgaggaca 3540 
bcbggaacaa cabgaccbgg abgcagbggg accgcgagat cagcaacbac accaacacca 3600 
bcbaccgccb gcbggaggac agccagaacc agcaggagaa gaacgagcag gaccbgcbgg 3660 
cccbggacaa gbggcagaac cbgtggacct ggbtcggcab caccaacbgg cbgbggtaca 3720 
bcaagatcbb cabcaagabc gbgggcggcc tgatcggccb gcgcabcatc btcgccgbgc 3780 
bgagcatcbg aacacgbggg abccagabcb gcbgbgccbb ctagbbgcca gccatcbgbt 3 840 
gtbbgccccb cccccgbgcc bbccbbgacc cbggaaggbg ccacbcccac bgtccbbtcc 3900 
baabaaaabg aggaaabbgc abcgcabbgb cbgagtaggb gbcabbcbab bctggggggt 3960 
ggggtggggc aggacagcaa gggggaggab tgggaagaca abagcaggca bgcbggggat 402 0 
gcggbgggct cbatgggbac ccaggbgcbg aagaabbgac ccggbbccbc cbgggccaga 4080 
aagaagcagg cacabcccct bcbcbgtgac acacccbgtc cacgccccbg gbbcbbagbt 414 0 
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ccagccccac bcabaggaca cbcabagcbc aggagggctc cgccttcaat cccacccgcb 4200 
aaagtacttg gagcggtctc tccctccctc atcagcccac caaaccaaac ctagcctcca 4260 
agagtgggaa gaaattaaag caagataggc babbaagtgc agagggagag aaaabgccbc 4320 
caacatgtga ggaagtaatg agagaaafcca tagaatttta aggccatgat bbaaggccab 4380 
catggcctfca abcbbccgcb bccbcgcbca ctgactcgct gcgctcggtc gtbcggcbgc 444 0 
ggcgagcggt atcagctcac bcaaaggcgg taabacggbb atccacagaa tcaggggaba 4500 
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 4560 
cgttgctggc gbbbttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 4620 
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgtbt ccccctggaa 4680 
gcbcccbcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 4740 
tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 4800 
aggbcgbbcg cbccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 4 860 
ccbbabccgg taactatcgt cttgagtcca acccggtaag acacgactta bcgccacbgg 4920 
cagcagccac tggtaacagg atbagcagag cgaggtatgt aggcggbgct acagagttct 4980 
bgaagbggbg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctcfcgc 5040 
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 5100 
ctggtagcgg bggbbbbbtb gtbbgcaagc agcagattac gcgcagaaaa aaaggatctc 5160 
aagaagatcc tttgatcttt tctacggggt cbgacgcbca gtggaacgaa aactcacgtt 5220 
aagggathtb ggtcatgaga ttatcaaaaa ggabcbbcac ctagatcctt ttaaattaaa 5280 
aatgaagttt taaatcaatc baaagtabab atgagtaaac ttggtctgac agttaccaat 5340 
gcbtaabcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagtbgcct 5400 
gacbcggggg gggggggcgc bgaggtctgc cbcgbgaaga aggbgtbgcb gacbcabacc 5460 
aggcctgaab cgccccatca bccagccaga aagtgaggga gccacggtbg abgagagcbb 5520 
tgbbgbaggb ggaccagtbg gtgabttbga acbbbbgctb bgccacggaa cggbcbgcgb 5580 
bgtcgggaag abgcgbgatc tgabccttca actcagcaaa agtbcgattt abtcaacaaa 5640 
gccgccgtcc cgtcaagtca gcgbaatgcb cbgccagbgt tacaaccaat baaccaattc 5700 
bgabbagaaa aacbcatcga gcabcaaabg aaacbgcaab tbatbcatab caggatbatc 5760 
aabaccabab bbbbgaaaaa gccgbbbcbg taabgaagga gaaaactcac cgaggcagbt 5820 
ccataggabg gcaagabcct ggtabcggbc tgcgabtccg actcgtccaa cabcaabaca 5880 
accbabbaab bbcccctcgt caaaaabaag gttabcaagb gagaaabcac cabgagbgac 5940 
gacbgaabcc ggtgagaabg gcaaaagcbb abgcattbcb tbccagactt gbtcaacagg 6000 
ccagccabba cgctcgbcat caaaabcacb cgcabcaacc aaaccgbtat bcattcgbga 6060 
bbgcgccbga gcgagacgaa abacgcgabc gcbgbbaaaa ggacaabbac aaacaggaab 6120 
cgaabgcaac cggcgcagga acacbgccag cgcabcaaca abatbtbcac cbgaatcagg 6180 
ababbcbbct aataccbgga abgcbgttbb cccggggatc gcagbggbga gbaaccabgc 6240 
abcabcagga gbacggataa aabgcttgab ggbcggaaga ggcabaaatt ccgbcagcca 6300 
gbbbagtcbg accabcbcab cbgbaacabc abbggcaacg ctaccbbbgc cabgbbtcag 6360 
aaacaacbcb ggcgcatcgg gcbbcccaba caabcgatag atbgbcgcac cbgabbgccc 6420 
gacabbabcg cgagcccabb tabacccaba baaabcagca bccabgbbgg aatbtaabcg 6480 
cggccbcgag caagacgbbb cccgbbgaab abggcbcata acaccccbtg batbacbgbb 6540 
babgbaagca gacagtbbta ttgtbcabga bgabababbt bbabcbbgtg caabgbaaca 6600 
bcagagabbb bgagacacaa cgtggcbbtc cccccccccc cabbattgaa gcabtbabca 6660 
gggbbabbgb cbcatgagcg gabacababb bgaabgbabt bagaaaaaba aacaaabagg 6720 
ggbbccgcgc acabbtcccc gaaaagbgcc accbgacgtc baagaaacca bbatbabcab 6780 
gacabbaacc babaaaaaba ggcgbabcac gaggcccbbb cgbc 6824 



<210> 113 
<211> 6695 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
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<400> 113 

bcgcgcgbbb cggbgatgac ggtgaaaacc 
cagcbbgbcb gtaagcggat gccgggagca 
ttggcgggtg bcggggcbgg cttaactatg 
accatatgcg gtgtgaaata ccgcacagat 
ctattggcca ttgcatacgt tgtatccata 
tccaacatta ccgccatgtt gacabtgabb 
ggggtcabba gtbcabagcc cabababgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga ctttccattg 
tgcccacttg gcagbacabc aagtgtatca 
tgacggtaaa tggcccgcct ggcabtabgc 
bbggcagtac atctacgtah tagtcatcgc 
catcaatggg cgbggatagc ggtbtgactc 
cgtcaabggg agtttgtttt ggcaccaaaa 
ctccgcccca ttgacgcaaa bgggcggtag 
agctcgttta gtgaaccgtc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
tcbbabgcab gctatactgt btbbggcbbg 
taggtgatgg tatagcttag ccbabaggbg 
tattggtgac gatactttcc attactaatc 
babbggcbab atgccaatac bcbgbccbbc 
ggatggggtc ccabbbabba tttacaaatt 
cgcagfctttt abbaaacaba gcgbgggabc 
cabgggctct tcbccggbag cggcggagcb 
agcggcbcat ggbcgctcgg cagcbccbbg 
agcacaabgc ccaccaccac cagbgbgccg 
gaaaatgagc gbggagabbg ggcbcgcacg 
gcagaagaag atgcaggcag cbgagbbgbb 
gbtgcggtgc bgbtaacggb ggagggcagb 
cgcgccacca gacabaabag cbgacagact 
bgcagtcacc gbcgtcgaca cgbgbgabca 
ccabgcgcgb ggagggcabc cagcgcaacb 
gcbbctggab ggtgatgatc bacaacgtgc 
gcgbgcccgt gtggaaggag gccaagacca 
acgacgccga ggbgcacaac gbgbgggcca 
cccaggagab ggbgcbggag aacgbgaccg 
tggagcagab gcaccaggac abcabcagcc 
agcbgacccc ccbgbgcgtg acccbgcacb 
gcgccabcac ccaggccbgc cccaaggbga 
cccccgccgg cbacgccabc ctgaagbgca 
gcaacaacgb gagcaccabc cagtgcaccc 
bgcbgcbgaa cggcagccbg gccgaggagg 
acaacgbgaa gaccabcabc gbgcaccbga 
ccaacaacaa cacccgcaag agcabccgca 
gcgagabcab cggcgacabc cgccaggccc 
agaccctgca ggaggbgggc aagaagcbgg 
bcgccaagca cagcggcggc gaccbggaga 
agbbcbbcba cbgcaacacc agcagccbgb 
agaacabcac cggcaccgag aacagcabca 
tcaacatgbg gcagggcgbg ggccgcgcca 
ccbgccgcag caacabcacc ggccbgctgc 
acaccgagab cbbccgcccc gagggcggcg 



bcbgacacab gcagcbcccg gagacggbca 60 
gacaagcccg tcagggcgcg bcagcgggtg 12 0 
cggcabcaga gcagatbgba cbgagagtgc 180 
gcgtaaggag aaaabaccgc abcagabbgg 240 
bcataababg bacabbbaba bbggcbcatg 300 
abbgacbagb babbaabagb aabcaabbac 360 
gbtccgcgtb acabaacbba cggtaaabgg 420 
cccabbgacg bcaabaatga cgbabgbtcc 480 
acgtcaatgg gbggagtatb bacggbaaac 540 
babgccaagb acgcccccba bbgacgbcaa 600 
ccagbacatg accbtabggg acttbcctac 660 
babbaccabg gbgabgcggb bbbggcagta 720 
acggggabbb ccaagbcbcc accccabbga 780 
tcaacgggac bbtccaaaab gbcgbaacaa 84 0 
gcgtgbacgg bgggaggbcb ababaagcag 900 
gagacgccab ccacgcbgbb tbgaccbcca 960 
cggccgggaa cggbgcatbg gaacgcggab 102 0 
cbatagactc bataggcaca ccccbbbggc 1080 
gggccbabac acccccgcbt cctbabgcta 1140 
tgggbbabtg accabbabbg accactcccc 1200 
cabaacabgg cbcbbbgcca caacbabcbc 1260 
agagacbgac acggacbcbg tabbbbbaca 13 20 
cacababaca acaacgccgb cccccgbgcc 1380 
bccacgcgaa bctcgggtac gtgbbccgga 144 0 
bccacabccg agcccbggtc ccatgccbcc 1500 
cbcctaacag bggaggccag acbbaggcac 1560 
cacaaggccg bggcggbagg gtabgbgbcb 1620 
gcbgacgcag abggaagacb baaggcagcg 1680 
gbatbcbgab aagagbcaga ggbaacbccc 174 0 
gtagbcbgag cagtacbcgb bgcbgccgcg 1800 
aacagacbgb bccbbbccab gggtcbbbbc i860 
gababcgcgg ccgcbcbaga gababcgcca 1920 
ggaagcagtg gbggabcbgg ggcabccbgg 1980 
gcggcaaccb gbgggbgacc gtgbacbacg 2040 
ccctgbtcbg cgccagcgac gccaaggccb 2100 
cccacgcctg cgbgcccacc gaccccaacc 2160 
agaacbbcaa cabgbgggag aacgacabgg 2220 
bgbgggacca gagccbgaag cccbgcgbga 2280 
gcagcaaccg abcgabcaac bgcaacacca 234 0 
gcbtcgaccc cabccccabc cacbacbgcg 2400 
acaacaagac cbbcaacggc accggccccb 2460 
acggcaccaa gcccgtggtg agcacccagc 2520 
agabcabcab ccgcagcaag aaccbgaccg 2580 
acgagagcgb ggagabcaac bgcacccgcc 264 0 
tcggccccgg ccaggccbbc bacgccaccg 2700 
acbgcaacab cagccgcacc gccbggaaca 2760 
ccgagcactb ccccaacaag gccabcaagt 2820 
bcaccaccca cagcttcaac bgccgcggcg 2880 
bcaacagcac ctacaccccc aacagcaccg 294 0 
bcaccabccc cbgccgcabc aagcagabca 3000 
tgtacgcccc ccccatcgag ggcabccbga 3060 
bgacccgcga cggcggcacc ggcabgcacg 3120 
acabgcgcga caacbggcgc agcgagcbgb 3180 
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acaagtacaa ggtggtggag atcaagcccc bgggcabcgc ccccaccaag gccaagcbta 324 0 
ccgbgcaggt gcgccagctg ctgagcggca tcgtgcagca gcagagcaac ctgctgcgcg 3300 
ccatcgaggc ccagcagcac atgctgcagc bgaccgbgbg gggcatcaag cagctgcaga 3360 
cccgcgtgct ggccabcgag cgcbaccbgc gcgaccagca gctcgaggac atctggaaca 3420 
acatgacctg gatgcagtgg gaccgcgaga tcagcaacta caccaacacc atctaccgcc 3480 
bgcbggagga cagccagaac cagcaggaga agaacgagca ggacctgchg gccctggaca 354 0 
agtggcagaa cctgtggacc tggbbcggca bcaccaacbg gctgtggtac atcaagatct 3600 
tcatcaagat cgtgggcggc ctgatcggcc bgcgcabcab cttcgccgtg cbgagcabcb 3660 
gaacacgbgg gatccagatc tgcbgtgccb bcbagbbgcc agccatctgt tgtttgcccc 3720 
tcccccgbgc cbtccbtgac cctggaaggt gccactccca cbgbccbbbc cbaabaaaat 3780 
gaggaaattg catcgcattg tcbgagbagg tgtcattcta ttctgggggg tggggbgggg 3840 
caggacagca agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc 3900 
tctatgggta cccaggtgct gaagaatbga cccggttcct cctgggccag aaagaagcag 3960 
gcacabcccc bbcbctgbga cacaccctgt ccacgccccb ggbbcbbagb tccagcccca 4020 
ctcataggac acbcabagcb caggagggct ccgccttcaa tcccacccgc taaagtactt 4080 
ggagcggtct cbcccbcccb catcagccca ccaaaccaaa ccbagccbcc aagagtggga 4140 
agaaattaaa gcaagatagg ctattaagtg cagagggaga gaaaabgcct ccaacatgtg 4200 
aggaagtaat gagagaaatc atagaattbb aaggccabga tbtaaggcca bcatggccbb 4260 
aatcttccgc btccbcgctc actgactcgc tgcgctcggt cgbbcggcbg cggcgagcgg 4320 
tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 4380 
agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg baaaaaggcc gcgttgctgg 444 0 
cgttttbcca taggctccgc ccccctgacg agcabcacaa aaabcgacgc tcaagtcaga 4 500 
ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctcccbcg 4560 
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt cbcccbbcgg 4620 
gaagcgtggc gcttfcctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 4680 
gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 4740 
gtaactatcg tcttgagtcc aacccggbaa gacacgactt atcgccactg gcagcagcca 4800 
ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 4860 
ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 4920 
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 4980 
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 504 0 
ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 5100 
tggtcatgag attatcaaaa aggatcttca cctagatccfc tttaaattaa aaatgaagtt 5160 
ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 5220 
gtgaggcacc tatctcagcg atctgtcbat btcgbbcatc cabagbtgcc tgacbcgggg 5280 
gggggggg°g cbgaggbcbg ccbcgbgaag aaggbgtbgc bgactcabac caggcctgaa 534 0 
bcgccccatc abccagccag aaagbgaggg agccacggtb gabgagagcb btgttgbagg 54 00 
bggaccagbt ggtgatbtbg aacttbbgcb tbgccacgga acggtcbgcg bbgbcgggaa 5460 
gatgcgbgat cbgabcctbc aactcagcaa aagtbcgabb babbcaacaa agccgccgtc 5520 
ccgtcaagbc agcgbaabgc bcbgccagtg ttacaaccaa bbaaccaatb cbgattagaa 5580 
aaacbcabcg agcatcaaat gaaacbgcaa tbtabtcaba bcaggattat caabaccaba 564 0 
bbtbbgaaaa agccgtbbcb gbaabgaagg agaaaactca ccgaggcagb bccabaggab 5700 
ggcaagatcc bggbatcggb cbgcgabbcc gacbcgbcca acabcaabac aaccbabbaa 5760 
bbtccccbcg tcaaaaabaa ggbtabcaag tgagaaabca ccabgagbga cgacbgaabc 5820 
cggbgagaab ggcaaaagcb babgcabbbc tbbccagacb bgbbcaacag gccagccabt 5880 
acgcbcgbca bcaaaabcac bcgcabcaac caaaccgbba btcabbcgbg abtgcgccbg 5940 
agcgagacga aatacgcgab cgcbgbbaaa aggacaabba caaacaggaa tcgaabgcaa 6000 
ccggcgcagg aacacbgcca gcgcabcaac aababbtbca ccbgaabcag gatabtctbc 6060 
baabaccfcgg aabgctgtbt bcccggggab cgcag.bggbg agbaaccabg catcabcagg 6120 
agbacggaba aaabgcbbga bggbcggaag aggcabaaab bccgbcagcc agbbtagbcb 6180 
gaccabcbca bcbgtaacat cabbggcaac gcbaccbtbg ccabgbbbca gaaacaacbc 624 0 
bggcgcabcg ggctbcccab acaabcgaba gabbgbcgca ccbgatbgcc cgacabbabc 6300 
gcgagcccab tbabacccab abaaabcagc abccabgtbg gaatbbaabc gcggccbcga 6360 
gcaagacgbt tcccgbbgaa babggctcab aacaccccbb gtatbacbgb btabgbaagc 6420 
agacagbbbb abbgbbcabg atgabababb bbtabcbtgb gcaabgbaac abcagagabb 6480 
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. ttgagacaca acgtggcttt cccccccccc ccattattga agcatttatc agggttattg 6540 

tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 6600 

cacatthccc cgaaaagtgc cacchgacgt ctaagaaacc attattatca tgacattaac 6660 

ctataaaaat aggcgtatca cgaggccctt tcgtc 6695 



<210> 114 
<211> 6614 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 114 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 192 0 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 22 80 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg atcgatcaac tgcaacacca 2340 
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gcgccatcac ccaggccbgc cccaaggtga 
cccccgccgg ctacgccatc ctgaagtgca 
gcaacaacgt gagcaccatc cagtgcaccc 
tgctgctgaa cggcagcctg gccgaggagg 
acaacgtgaa gaccatcatc gtgcaccfcga 
gabcggccca cbgcaacabc agccgcaccg 
agaagctggc cgagcacttc cccaacaagg 
acctggagat caccacccac agcttcaact 
gcagcctgtt caacagcacc tacaccccca 
acagcatcat caccatcccc tgccgcatca 
gccgcgccat gtacgccccc cccatcgagg 
gccbgcbgcb gacccgcgac ggcggcaccg 
agggcggcga catgcgcgac aactggcgca 
tcaagcccct gggcatcgcc cccaccaagg 
tgagcggcat cgtgcagcag cagagcaacc 
tgctgcagct gaccgtgtgg ggcatcaagc 
gctacctgcg cgaccagcag ctcgaggaca 
accgcgagat cagcaactac accaacacca 
agcaggagaa gaacgagcag gaccbgctgg 
ggttcggcat caccaactgg ctgtggtaca 
bgabcggccb gcgcatcatc ttcgccgtgc 
gctgtgcctt ctagttgcca gccatctght 
ctggaaggbg ccactcccac tgtcctttcc 
ctgagtaggt gbcabbcbab bcbggggggb 
tgggaagaca atagcaggca tgctggggat 
aagaattgac ccggttcctc ctgggccaga 
acaccctgtc cacgcccctg gfctcttagtt 
aggagggcbc cgccttcaat cccacccgcb 
atcagcccac caaaccaaac ctagcctcca 
tattaagtgc agagggagag aaaabgccbc 
tagaatttta aggccatgat ttaaggccat 
ctgactcgch gcgcbcggbc gttcggctgc 
taatacggtt abccacagaa tcaggggata 
agcaaaaggc caggaaccgt aaaaaggccg 
cccctgacga gcabcacaaa aatcgacgct 
tataaagata ccaggcgttt ccccctggaa 
bgccgcbbac cggatacctg tccgcctttc 
gctcacgcbg taggtatctc agttcggtgt 
acgaaccccc cgbbcagccc gaccgcbgcg 
acccggtaag acacgactta tcgccactgg 
cgaggbabgb aggcggtgct acagagttct 
gaagaacagt abbbggbabc tgcgctctgc 
gtagctcttg abccggcaaa caaaccaccg 
agcagattac gcgcagaaaa aaaggabctc 
ctgacgctca gtggaacgaa aacbcacgbb 
ggatcttcac ctagatcctt bbaaabbaaa 
atgagtaaac bbggbcbgac agttaccaat 
bcbgbcbabb tcgttcatcc atagttgcct 
ctcgtgaaga aggtgttgct gactcatacc 
aagtgaggga gccacggttg abgagagcbb 
acbbbbgcbb tgccacggaa cggtctgcgt 
actcagcaaa agbbcgabbb attcaacaaa 
ctgccagtgt tacaaccaab baaccaabtc 
aaacbgcaab btabbcabab caggatbatc 
baatgaagga gaaaactcac cgaggcagtb 



gcbbcgaccc catccccabc cacbacbgcg 2400 
acaacaagac cbtcaacggc accggccccb 2460 
acggcaccaa gcccgbggbg agcacccagc 2520 
agabcatcab ccgcagcaag aaccbgaccg 2580 
acgagagcgb ggagabcaac bgcacccgcc 264 0 
ccbggaacaa gacccbgcag gaggbgggca 2700 
ccabcaagbb cgccaagcac agcggcggcg 2760 
gccgcggcga gbbcbbctac bgcaacacca 2820 
acagcaccga gaacabcacc ggcaccgaga 2880 
agcagabcab caacabgbgg cagggcgbgg 2 94 0 
gcabcctgac cbgccgcagc aacabcaccg 3000 
gcabgcacga caccgagabc bbccgccccg 3 060 
gcgagcbgba caagbacaag gbggtggaga 3120 
ccaagcbbac cgtgcaggbg cgccagcbgc 3180 
bgcbgcgcgc cabcgaggcc cagcagcaca 3240 
agcbgcagac ccgcgbgctg gccatcgagc 3300 
bctggaacaa catgacctgg abgcagbggg 33 6 0 
bctaccgccb gcbggaggac agccagaacc 342 0 
cccbggacaa gbggcagaac cbgbggaccb 3480 
bcaagabcbb cabcaagabc gbgggcggcc 3540 
bgagcabcbg aacacgbggg abccagabcb 3600 
gbbbgcccct cccccgtgcc bbccbbgacc 3660 
taataaaabg aggaaabbgc abcgcatbgb 3720 
99ggtggggc aggacagcaa gggggaggab 3780 
gcggbgggcb cbabgggbac ccaggbgcbg 3840 
aagaagcagg cacabcccct bctcbgbgac 3900 
ccagccccac bcabaggaca cbcabagcbc 3960 
aaagtactbg gagcggbcbc bcccbcccbc 4020 
agagbgggaa gaaatbaaag caagabaggc 4080 
caacatgtga ggaagbaabg agagaaabca 414 0 
cabggcctba atcbbccgcb tccbcgcbca 4200 
ggcgagcggt abcagcbcac tcaaaggcgg 4260 
acgcaggaaa gaacabgbga gcaaaaggcc 4 32 0 
cgbtgcbggc gtbbbbccab aggcbccgcc 4380 
caagbcagag gbggcgaaac ccgacaggac 4440 
gcbcccbcgb gcgcbcbccb gbbccgaccc 4500 
bcccbbcggg aagcgbggcg cbbbcbcaba 4560 
aggbcgbbcg cbccaagcbg ggcbgbgbgc 4620 
ccbbabccgg baacbabcgb cbbgagbcca 4680 
cagcagccac bggbaacagg abbagcagag 474 0 
bgaagbggbg gcctaacbac ggcbacacta 4 8 00 
bgaagccagb baccbbcgga aaaagagbtg 4860 
cbggbagcgg bggbtbbbbt gbbbgcaagc 4920 
aagaagabcc bbbgabcbbb bcbacggggb 4980 
aagggabbbb ggtcabgaga bbabcaaaaa 5040 
aabgaagbtb baaabcaabc baaagbabab 5100 
gcbbaabcag bgaggcaccb atcbcagcga 5160 
gacbcggggg gggggggcgc bgaggbcbgc 5220 
aggccbgaab cgccccatca bccagccaga 5280 
bgbbgbaggb ggaccagbbg gbgabbbbga 5340 
tgbcgggaag abgcgbgabc bgabccbbca 5400 
gccgccgbcc cgbcaagbca gcgbaabgcb 5460 
bgabbagaaa aacbcabcga gcatcaaabg 5520 
aabaccabab tbbbgaaaaa gccgbbbcbg 5580 
ccabaggabg gcaagabccb ggbabcggbc 5640 
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bgcgabbccg actcgtccaa catcaataca acctattaat ttcccctcgh caaaaataag 5700 
gttatcaagt gagaaatcac catgagtgac gactgaatcc ggtgagaatg gcaaaagctt 5760 
atgcatttct ttccagactt gttcaacagg ccagccabba cgcbcgbcab caaaatcact 5820 
cgcabcaacc aaaccgtbab tcattcghga ttgcgcctga gcgagacgaa abacgcgabc 588 0 
gcbgttaaaa ggacaattac aaacaggaat cgaatgcaac cggcgcagga acactgccag 5940 
cgcatcaaca atattttcac cbgaabcagg atattcttct aatacctgga abgcbgtbbb 6000 
cccggggatc gcagtggtga gtaaccatgc abcabcagga gtacggataa aatgcbbgab 6060. 
ggbcggaaga ggcabaaabb ccgtcagcca gbbbagbcbg accatctcat cbgtaacabc 6120 
attggcaacg ctacctttgc catgtttcag aaacaactct ggcgcatcgg gcttcccaba 6180 
caatcgatag attgtcgcac ctgattgccc gacattatcg cgagcccatt babacccata 6240 
taaatcagca bccabgtbgg aabbbaabcg cggcctcgag caagacgttt cccgttgaat 63 00 
atggctcata acaccccttg tabbacbgbb babgbaagca gacagtttta ttgttcatga 6360 
tgatatattt bbabcbbg.bg caatgtaaca tcagagattt tgagacacaa cgtggctttc 6420 
cccccccccc cattattgaa gcabbbabca gggbbatbgb cbcatgagcg gabacababt 64 80 
bgaabgbabt bagaaaaata aacaaabagg ggbbccgcgc acatttcccc gaaaagbgcc 6540 
accbgacgbc baagaaacca tbabbatcat gacabbaacc babaaaaaba ggcgbabcac 6600 
gaggcccbbb cgbc 6614 



<210> 115 
<211> 6545 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 115 

bcgcgcgtbb cggbgabgac ggtgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgtcb gbaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbtaacbabg cggcabcaga gcagatbgba cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaataccgc abcagabbgg 24 0 
cbabbggcca bbgcabacgb bgbabccata bcabaabatg tacattbaba bbggctcatg 300 
bccaacabba ccgccabgbt gacabbgabb atbgacbagt babbaabagb aabcaabbac 360 
ggggbcabba gbbcatagcc cabababgga gtbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaabaabga cgbabgbbcc 4 80 
cabagtaacg ccaabaggga cbbtccatbg acgbcaabgg gbggagtabt tacggtaaac 540 
tgcccacbbg gcagbacabc aagtgbatca babgccaagt acgcccccba bbgacgtcaa 600 
tgacggbaaa bggcccgccb ggcabbabgc ccagtacabg accbtabggg actbbccbac 660 
tbggcagbac abctacgbab bagbcabcgc babbaccabg gbgabgcggb bbbggcagba 72 0 
cabcaabggg cgbggabagc ggbbbgacbc acggggatbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbttgbbtt ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa B4 0 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb abataagcag 900 
agcbcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcabbg gaacgcggab 1020 
bccccgbgcc aagagfcgacg baagbaccgc cbabagacbc babaggcaca ccccbbbggc 1080 
bcbtabgcab gctabacbgb bbbbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggtgabgg tabagcttag cctabaggbg bgggbbattg accabbatbg accacbcccc 1200 
babtggbgac gabactttcc abbactaatc cabaacatgg ctcbbbgcca caacbabcbc 1260 
babtggcbab abgccaatac bcbgbccbbc agagacbgac acggacbcbg babtbbbaca 1320 
ggabggggtc ccabbbabba bbbacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbbbtb abbaaacaba gcgbgggabc bccacgcgaa bcbcgggbac gtgbbccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggtcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
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agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gbggagabbg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gbbgcggbgc bgbbaacggt ggagggcagt 
cgcgccacca gacataatag cbgacagacb 
tgcagtcacc gtcgtcgaca cgtgtgatca 
ccatgcgcgt ggagggcabc cagcgcaact 
gcttctggat ggbgabgabc tacaacgtgc 
gcgbgcccgb gbggaaggag gccaagacca 
acgacgccga ggtgcacaac gtgtgggcca 
cccaggagat ggtgctggag aacgtgaccg 
tggagcagat gcaccaggac atcatcagcc 
agctgacccc cctgtgcgtg acccbgcacb 
gcgccatcac ccaggcctgc cccaaggtga 
cccccgccgg cbacgccabc ctgaagtgca 
gcaacaacgt gagcaccatc cagtgcaccc 
bgcbgcbgaa cggcagccbg gccgaggagg 
acaacgtgaa gaccabcabc gtgcacctga 
gatcggccca cbgcaacabc agccgcaccg 
agaagcbggc cgagcacttc cccaacaagg 
acctggagat caccacccac agcbtcaacb 
gatcgatccc ctgccgcatc aagcagatca 
bgbacgcccc ccccatcgag ggcabccbga 
bgacccgcga cggcggcacc ggcatgcacg 
acatgcgcga caactggcgc agcgagcbgb 
tgggcatcgc ccccaccaag gccaagcbba 
bcgbgcagca gcagagcaac ctgctgcgcg 
tgaccgtgtg gggcatcaag cagctgcaga 
gcgaccagca gctcgaggac abcbggaaca 
tcagcaacta caccaacacc afcctaccgcc 
agaacgagca ggacctgctg gccctggaca 
bcaccaacbg gctgtggtac atcaagatct 
tgcgcatcat cttcgccgtg cbgagcabcb 
tcbagbbgcc agccatctgt tgtttgcccc 
gccactccca ctgtcctttc ctaataaaat 
tgtcattcta ttctgggggg bggggbgggg 
aatagcaggc atgctgggga bgcggbgggc 
cccggttcct cctgggccag aaagaagcag 
ccacgcccct ggttcttagt tccagcccca 
ccgccttcaa tcccacccgc taaagtactt 
ccaaaccaaa cctagcctcc aagagtggga 
cagagggaga gaaaatgcct ccaacatgtg 
aaggccatga tttaaggcca tcatggcctt 
tgcgctcggt cgttcggctg cggcgagcgg 
tatccacaga atcaggggat aacgcaggaa 
ccaggaaccg taaaaaggcc gcgttgctgg 
agcabcacaa aaabcgacgc tcaagtcaga 
accaggcgtt tccccctgga agcbcccbcg 
ccggatacct gtccgccttb cbccctbcgg 
gbaggbabcb cagbbcggbg baggbcgbtc 
ccgttcagcc cgaccgctgc gccbbabccg 
gacacgacbb atcgccactg gcagcagcca 
baggcggbgc tacagagttc bbgaagbggb 
tatttggtat ctgcgcbcbg ctgaagccag 
gatccggcaa acaaaccacc gctggtagcg 



cacaaggccg tggcggbagg gtatgtgtct 1620 
gctgacgcag abggaagact baaggcagcg 1680 
gtabtcbgat aagagtcaga ggbaactccc 1740 
gbagtcbgag cagtacbcgb tgctgccgcg 1800 
aacagacbgt bcctbtccab gggbcbbbbc 1860 
gabatcgcgg ccgcbcbaga gabatcgcca 1920 
ggaagcagbg gbggatctgg ggcatcctgg 1980 
gcggcaacct gbgggbgacc gtgbactacg 2040 
cccbgbbcbg cgccagcgac gccaaggccb 2100 
cccacgccbg cgbgcccacc gaccccaacc 2160 
agaacbbcaa cabgbgggag aacgacabgg 2220 
tgbgggacca gagcctgaag cccbgcgbga .2280 
gcagcaaccg atcgatcaac bgcaacacca 234 0 
gcbbcgaccc cabccccabc cacbacbgcg 2400 
acaacaagac ctbcaacggc accggcccct 2460 
acggcaccaa gcccgbggbg agcacccagc 2520 
agatcabcab ccgcagcaag aaccbgaccg 2580 
acgagagcgt ggagabcaac bgcacccgcc 2640 
cctggaacaa gacccbgcag gaggbgggca 2700 
ccabcaagbb cgccaagcac agcggcggcg 2760 
gccgcggcga gbbctbcbac bgcaacaccc 2820 
bcaacabgbg gcagggcgbg ggccgcgcca 2880 
cctgccgcag caacabcacc ggccbgctgc 2940 
acaccgagat ctbccgcccc gagggcggcg 3 000 
acaagbacaa ggbggbggag abcaagcccc 3060 
ccgbgcaggb gcgccagctg ctgagcggca 3120 
ccabegaggc ccagcagcac atgcbgcagc 3180 
cccgcgbgcb ggccabcgag cgcbaccbgc 3240 
acabgaccbg gabgcagbgg gaccgcgaga 3300 
tgcbggagga cagccagaac cagcaggaga 3360 
agtggcagaa ccbgtggacc bggbbcggca 3420 
bcabcaagab cgbgggcggc ctgabcggcc 3480 
gaacacgbgg gabccagabc bgcbgtgccb 3540 
bcccccgbgc cbbccbtgac ccbggaaggb 3600 
gaggaaabbg cabcgcabbg bcbgagbagg 3 660 
caggacagca agggggagga bbgggaagac 3720 
bcbabgggba cccaggbgcb gaagaabtga 3780 
gcacatcccc btcbcbgtga cacaccctgb 3840 
cbcabaggac actcabagcb caggagggcb 3900 
ggagcggbcb cbcccbcccb cabcagccca 3960 
agaaabbaaa gcaagabagg cbabtaagtg 4020 
aggaagbaab gagagaaabc abagaabbtb 4 080 
aabcbtccgc btccbcgcbc acbgactcgc 4140 
batcagcbca cbcaaaggcg gtaabacggb 4200 
agaacabgbg agcaaaaggc cagcaaaagg 4260 
cgbbbbbcca baggcbccgc cccccbgacg 4320 
ggbggcgaaa cccgacagga ctabaaagab 43 80 
bgcgcbctcc tgbbccgacc ctgccgcbba 4440 
gaagcgbggc gctbbcbcab agcbcacgcb 4500 
gcbccaagcb gggcbgbgbg cacgaacccc 4560 
gbaacbabcg bcbbgagbcc aacccggbaa 4620 
ctggbaacag gabbagcaga gcgaggbabg 4680 
ggccbaacba cggcbacacb agaagaacag 474 0 
btaccbtcgg aaaaagagbb ggbagcbcbb 4800 
gtggbtbbbb bgbbbgcaag cagcagabba 4860 
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cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 4920 
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 4980 
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tahgagtaaa 5040 
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 5100 
ttcgttcatc cahagttgcc tgactcgggg ggggggggcg ctgaggtctg cctcgtgaag 5160 
saggtgttgc tgactcatac caggcctgaa tcgccccatc atccagccag aaagtgaggg 5220 
agccacggtt gatgagagct ttgttgtagg tggaccagtt ggtgattttg aacttttgct 5280 
ttgccacgga acggtctgcg ttgtcgggaa gatgcgtgat ctgatccttc aactcagcaa 5340 
aagttcgatt tattcaacaa agccgccgtc ccgtcaagtc agcgtaatgc tctgccagtg 5400 
ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg agcatcaaat gaaactgcaa 5460 
tttattcata tcaggattat caataccata tttttgaaaa agccgtttct gfcaatgaagg 5520 
agaaaactca ccgaggcagt tccataggat ggcaagatcc tggtatcggt ctgcgattcc 5580 
gactcgtcca acatcaatac aacctattaa tttcccctcg tcaaaaataa ggttatcaag 5640 
tgagaaatca ccatgagtga cgactgaatc cggtgagaat ggcaaaagct tatgcatttc 5700 
tttccagact tgttcaacag gccagccatt acgctcgtca tcaaaatcac tcgcatcaac 5760 
caaaccgtta ttcattcgtg attgcgcctg agcgagacga aatacgcgat cgctgttaaa 5820 
aggacaatta caaacaggaa tcgaatgcaa ccggcgcagg aacacfcgcca gcgcatcaac 58 BO 
aatattttca cctgaatcag gatattcttc taatacctgg aatgctgttt tcccggggat 5940 
cgcagtggtg agtaaccatg catcafccagg agtacggata aaatgcttga tggtcggaag 6000 
aggcahaaat tccgtcagcc agtttagtct gaccatctca tctgtaacat cattggcaac 6060 
gctacctttg ccatgtttca gaaacaactc tggcgcatcg ggcttcccat acaatcgata 6120 
gattgtcgca cctgattgcc cgacattatc gcgagcccat ttatacccat ataaatcagc 6180 
atccatgttg gaatttaahc gcggcctcga gcaagacgtt tcccgttgaa tatggctcat 6240 
aacacccctt gtattactgt ttatgtaagc agacagttth atbgfctcatg atgatatatt 6300 
tttatcttgt gcaatgtaac atcagagaht ttgagacaca acgtggcttt cccccccccc 6360 
ccathattga agcatfctatc agggttattg tctcatgagc ggatacatat ttgaatgtat 6420 
ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt 6480 
ctaagaaacc attatfcatca hgacattaac ctataaaaat aggcgtatca cgaggccctt 6540 
tcgtc 6545 



<210> 116 
<211> 6626 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 116 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgfcaacaa 840 
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ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg bgggaggbcb ababaagcag 900 
agctcgttta gtgaaccgbc agatcgcctg gagacgccat ccacgctgbt tbgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagbgacg taagtaccgc ctatagactc bataggcaca ccccbbbggc 1080 
bcbtabgcab gctatactgt htttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag ccbabaggbg tgggttattg accattattg accactcccc 1200 
tattggtgac gabacbbbcc abbacbaatc cataacatgg cbcbtbgcca caacbatcbc 1260 
tattggctat atgccaatac bcbgbccbbc agagactgac acggacbcbg tatttttaca 1320 
99 at 9gggtc ccatttabba bbbacaaabb cacababaca acaacgccgt cccccgtgcc 1380 
cgcagbbtbt abbaaacata gcgbgggabc tccacgcgaa tctcgggtac gtgbtccgga 1440 
cabgggcbct bctccggbag cggcggagcb tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagcbcctbg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gcbgacgcag atggaagact taaggcagcg 1680 
gcagaagaag abgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaacbccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gbcgbcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcabc cagcgcaacb ggaagcagtg gbggabcbgg ggcatcctgg 1980 
gcbbctggab ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggccb 2100 
acgacgccga ggtgcacaac gtgbgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgfcgaccg agaacttcaa catgtgggag aacgacatgg 222 0 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg atcgatcaac tgcaacacca 2340 
gcgccatcac ccaggcctgc cccaaggtga gcttcgaccc catccccatc cacfcactgcg 2400 
cccccgccgg ctacgccabc ctgaagtgca acaacaagac ctbcaacggc accggcccct 2460 
gcaacaacgt gagcaccatc cagtgcaccc acggcaccaa gcccgtggtg agcacccagc 2520 
tgctgctgaa cggcagcctg gccgaggagg agatcatcat ccgcagcaag aacctgaccg 2580 
acaacgtgaa gaccatcatc gtgcacctga acgagagcgt ggagatcaac tgcacccgcc 2640 
ccaacaacaa cacccgcaag agcatccgca tcggccccgg ccaggccttc tacgccaccg 2700 
gcgagatcab cggcgacabc cgccaggccc acbgcaacab cagccgcacc gcctggaaca 2760 
agaccctgca ggaggtgggc aagaagcbgg ccgagcactb ccccaacaag gccatcaagt 2820 
bcgccaagca cagcggcggc gacctggaga tcaccaccca cagctbcaac bgccgcggcg 2880 
agtbcbbcba cbgcaacacc cgatcgatcc ccbgccgcab caagcagatc abcaacatgb 2940 
ggcagggcgb gggccgcgcc atgtacgccc cccccabcga gggcabccbg accbgccgca 3000 
gcaacabcac cggccbgcbg cbgacccgcg acggcggcac cggcabgcac gacaccgaga 3060 
bcbtccgccc cgagggcggc gacabgcgcg acaactggcg cagcgagcbg tacaagbaca 312 0 
aggtggbgga gabcaagccc cfcgggcabcg cccccaccaa ggccaagctb accgbgcagg 3180 
bgcgccagcb gcbgagcggc atcgbgcagc agcagagcaa cctgcbgcgc gccabcgagg 324 0 
cccagcagca cabgcbgcag cbgaccgtgt ggggcabcaa gcagctgcag acccgcgbgc 3300 
tggccabcga gcgcbaccbg cgcgaccagc agcbcgagga cabcbggaac aacabgaccb 33 60 
ggatgcagbg ggaccgcgag abcagcaacb acaccaacac catctaccgc cbgctggagg 34 20 
acagccagaa ccagcaggag aagaacgagc aggacctgcb ggcccbggac aagbggcaga 34 80 
acctgbggac ctggttcggc abcaccaacb ggcbgbggba cabcaagabc bbcabcaaga 3540 
bcgtgggcgg ccbgabcggc ctgcgcabca bcbbcgccgb gcbgagcabc bgaacacgtg 3600 
ggatccagab cbgcbgbgcc bbctagbbgc cagccatctg ttgbtbgccc cbcccccgbg 3660 
ccbbccttga cccbggaagg bgccacbccc acbgbccbbt ccbaataaaa bgaggaaatt 3720 
gcatcgcabt gbcbgagtag gbgbcabtcb atbcbggggg gbggggbggg gcaggacagc 3780 
aa 99999 a gg attgggaaga caatagcagg cabgctgggg abgcggbggg cbctabgggb 3840 
acccaggbgc bgaagaatbg acccggbbcc bccbgggcca gaaagaagca ggcacabccc 3900 
cbbcbcbgbg acacaccctg bccacgcccc bggbbcbbag bbccagcccc acbcabagga 3960 
cactcabagc bcaggagggc bccgccbbca abcccacccg ctaaagbact bggagcggbc 4020 
bcbcccbccc bcabcagccc accaaaccaa acctagccbc caagagtggg aagaaabbaa 4080 
agcaagatag gcbabbaagt gcagagggag agaaaabgcc bccaacabgb gaggaagbaa 4140 



- 269 - 



WO 02/32943 PCT/US01/25721 

bgagagaaab catagaattt taaggccatg atttaaggcc abcabggccb taabcbbccg 4200 
cttcctcgct cactgactcg ctgcgctcgg bcgbbcggcb gcggcgagcg gtatcagctc 4 260 
actcaaaggc ggtaatacgg bbabccacag aatcagggga taacgcagga aagaacatgt 4320 
gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgbbbbtcc 4380 
ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa 4440 
acccgacagg actataaaga taccaggcgh ttccccctgg aagctccctc gtgcgctctc 4500 
cbgbbccgac cctgccgctt accggatacc bgbccgccbb. tctcccttcg ggaagcgtgg 4560 
cgctttcbca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgcbccaagc 4620 
bgggctgbgb gcacgaaccc cccgttcagc ccgaccgcbg cgccttatcc ggtaactatc 4680 
gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca 4740 
ggattagcag agcgaggbab gtaggcggtg ctacagagtt cbbgaagbgg bggcctaact 4800 
acggcbacac bagaagaaca gbabbtggba bcbgcgcbcb gctgaagcca gbbaccbbcg 4860 
gaaaaagagb bggtagcbcb bgabccggca aacaaaccac cgcbggbagc ggbggtbbbb 4920 
bbgtbbgcaa gcagcagabb acgcgcagaa aaaaaggatc bcaagaagab ccbbbgabcb 4980 
bbtcbacggg gtctgacgcb cagbggaacg aaaacbcacg bbaagggabb bbggtcabga 5040 
gabtatcaaa aaggabcbtc accbagabcc bbbbaaabta aaaabgaagt bbtaaabcaa 5100 
bcbaaagbab ababgagbaa acbbggtcbg acagbtacca atgcbbaabc agbgaggcac 5160 
cbabcbcagc gabcbgtcba bbbcgbtcab ccabagbbgc cbgacbcggg gggggggggc 5220 
gcbgaggbcb gccbcgtgaa gaaggbgbbg cbgacbcaba ccaggccbga atcgccccab 5280 
catccagcca gaaagbgagg gagccacggb bgabgagagc bbbgbbgbag gbggaccagb 5340 
bggbgabbbb gaacbbbtgc bbbgccacgg aacggbcbgc gbbgbcggga agabgcgbga 5400 
bcbgabccbb caacbcagca aaagbtcgab bbabbcaaca aagccgccgb cccgbcaagb 5460 
cagcgbaabg cbcbgccagb gbbacaacca abtaaccaab tcbgatbaga aaaactcabc 5520 
gagcabcaaa bgaaacbgca abbbabbcab abcaggabba bcaabaccab atbtbbgaaa 5580 
aagccgbbbc bgbaabgaag gagaaaacbc accgaggcag bbccatagga bggcaagabc 5640 
cbggtabcgg bcbgcgabbc cgacbcgbcc aacabcaaba caaccbabba abbbccccbc 5700 
gbcaaaaata aggtbabcaa gbgagaaatc accatgagbg acgacbgaab ccggbgagaa 5760 
bggcaaaagc bbabgcabbb cbbbccagac bbgbbcaaca ggccagccab bacgcbcgbc 582 0 
abcaaaabca cbcgcabcaa ccaaaccgbb atbcabbcgt gabbgcgccb gagcgagacg 588 0 
aaatacgcga bcgcbgbbaa aaggacaabb acaaacagga abcgaabgca accggcgcag 594 0 
gaacacbgcc agcgcabcaa caababbbbc accbgaabca ggababbctb cbaabaccbg 6000 
gaabgctgbb bbcccgggga bcgcagtggb gagtaaccab gcabcabcag gagbacggab 6060 
aaaatgcbbg abggbcggaa gaggcataaa bbccgbcagc cagbbbagbc bgaccabcbc 6120 
atcbgbaaca bcabbggcaa cgcbaccbtb gccabgbbbc agaaacaacb cbggcgcabc 6180 
gggcbbccca bacaabcgab agabbgbcgc accbgabbgc ccgacabbab cgcgagccca 6240 
tbbabaccca tabaaabcag cabccabgbb ggaabbbaab cgcggccbcg agcaagacgb 6300 
bbcccgbbga abatggcbca baacaccccb tgtabbacbg bbbabgbaag cagacagtbb 6360 
babbgbbcab gabgababab bbbbabcbbg tgcaabgtaa cabcagagab bbbgagacac 6420 
aacgbggcbb bccccccccc cccabbabbg aagcabbtab cagggbbabb gtcbcabgag 6480 
cggabacaba bbbgaabgba bbbagaaaaa baaacaaaba ggggbbccgc gcacatbbcc 6540 
ccgaaaagbg ccaccbgacg bcbaagaaac cabbabbabc atgacabbaa ccbataaaaa 6600 
baggcgbabc acgaggcccb bbcgbc 6526 



<210> 117 
<211> 6743 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 117 

bcgcgcgbbb cggbgatgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 



- 270 - 



WO 02/32943 PCT/US01/25721 

cagcttgtct gtaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggbg 120 
tbggcgggbg tcggggctgg ctbaacbabg cggcabcaga gcagattgta cbgagagtgc 180 
accatatgcg gtgbgaaaba ccgcacagab gcgtaaggag aaaabaccgc abcagabtgg 240 
cbatbggcca ttgcabacgt tgtabccaba tcataabatg bacatbtaba tbggctcatg 300 
bccaacabba ccgccabgbt gacabbgabb attgacbagb babbaatagb aabcaabtac 360 
ggggtcabba gttcabagcc catabatgga gbbccgcgtb acataacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaataatga cgtatgbbcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgtcaabgg gtggagtabb tacggtaaac 540 
bgcccactbg gcagbacabc aagbgbabca babgccaagb acgcccccta ttgacgbcaa 600 
bgacggtaaa tggcccgccb ggcatbabgc ccagbacabg accttabggg acbbbcctac 660 
tbggcagbac abcbacgbab bagbcatcgc babbaccabg gbgatgcggb tbbggcagta 720 
catcaatggg cgbggabagc ggttbgacbc acggggabbt ccaagtctcc accccabtga 780 
cgtcaabggg agbbbgbbbt ggcaccaaaa bcaacgggac bttccaaaat gtcgbaacaa 840 
cbccgcccca tbgacgcaaa tgggcggbag gcgtgbacgg bgggaggbct atabaagcag 900 
agcbcgtbta gbgaaccgbc agatcgcctg gagacgccab ccacgcbgbb bbgacctcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgbgcc aagagtgacg taagbaccgc cbatagactc tataggcaca ccccbbbggc 1080 
bcbbatgcab gcbatacbgt ttbbggcbbg gggcctatac acccccgcbt cctfcabgcba 1140 
taggbgabgg tabagctbag ccbabaggbg bgggbbabbg accattabtg accactcccc 1200 
tatbggbgac gabacttbcc abbacbaabc cabaacabgg ctctbtgcca caactabctc 1260 
batbggcbab abgccaatac bctgbccbbc agagactgac acggacbcbg babtbtbaca 1320 
ggabggggbc ccabtbatta bbtacaaabb cacatataca acaacgccgt cccccgbgcc 1380 
cgcagbttbb abbaaacaba gcgbgggabc bccacgcgaa tcbcgggbac gbgbtccgga 1440 
catgggcbcb tcbccggtag cggcggagcb bccacatccg agcccbggtc ccabgccbcc 1500 
agcggcbcab ggbcgctcgg cagctcctbg ctccbaacag tggaggccag acttaggcac 1560 
agcacaabgc ccaccaccac cagtgbgccg cacaaggccg bggcggtagg gbabgbgbcb 162 0 
gaaaabgagc gbggagatbg ggctcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag ctgagbbgtb gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbtgcggtgc bgbbaacggb ggagggcagb gbagtcbgag cagbacbcgb tgctgccgcg 1800 
cgcgccacca gacabaatag cbgacagacb aacagacbgt tccbtbccab gggbcbbbbc 1860 
tgcagbcacc gbcgbcgaca cgbgbgabca gabatcgcgg ccgcbcbaga gababcgcca 1920 
ccabgcgcgb ggagggcabc cagcgcaact ggaagcagbg gbggatcbgg ggcatccbgg 1980 
gctbcbggab ggbgatgabc bacaacgbgc gcggcaaccb gbgggtgacc gbgbactacg 2040 
gcgtgcccgb gtggaaggag gccaagacca cccbgbbcbg cgccagcgac gccaaggccb 2100 
acgacgccga ggbgcacaac gtgtgggcca cccacgcctg cgbgcccacc gaccccaacc 2160 
cccaggagab ggtgcbggag aacgbgaccg agaacbbcaa catgbgggag aacgacabgg 2220 
tggagcagab gcaccaggac abcabcagcc tgbgggacca gagccbgaag cccbgcgbga 2280 
agcbgacccc ccbgbgcgbg acccbgcacb gcagcaaccg abcgaagaac tgcagcbbca 2340 
acabgaccac cgaggbgcgc gacaagcgcg agaaggbgca cgccctgtbc taccgccbgg 2400 
acabcgbgcc ccbgaagaac gagagcagca acaccagcgg cgactaccgc cbgabcaacb 2460 
gcaacaccag cgccatcacc caggccbgcc ccaaggtgag ctbcgacccc atccccabcc 2520 
acbacbgcgc ccccgccggc bacgccabcc bgaagbgcaa caacaagacc tbcaacggca 2580 
ccggccccbg caacaacgbg agcaccabcc agbgcaccca cggcaccaag cccgtggbga 2640 
gcacccagcb gcbgctgaac ggcagccbgg ccgaggagga gabcabcatc cgcagcaaga 2700 
accbgaccga caacgtgaag accabcabcg bgcaccbgaa cgagagcgtg gagatcaacb 2760 
gcacccgccg abcggcccac bgcaacabca gccgcaccgc cbggaacaag acccbgcagg 2820 
aggbgggcaa gaagcbggcc gagcacbbcc ccaacaaggc cabcaagtbc gccaagcaca 2880 
gcggcggcga ccbggagabc accacccaca gcbbcaacbg ccgcggcgag tbctbcbacb 2940 
gcaacaccag cagccbgbbc aacagcaccb acacccccaa cagcaccgag aacabcaccg 3000 
gcaccgagaa cagcabcabc accabcccct gccgcabcaa gcagabcatc aacabgbggc 3060 
agggcgbggg ccgcgccabg bacgcccccc ccabcgaggg cabccbgacc tgccgcagca 3120 
acabcaccgg ccbgcbgctg acccgcgacg gcggcaccgg cabgcacgac accgagabcb 3180 
bccgccccga gggcggcgac atgcgcgaca acbggcgcag cgagctgbac aagtacaagg 3240 
bggbggagab caagccccbg ggcabcgccc ccaccaaggc caagcbbacc gtgcaggfcgc 3300 
gccagcbgcb gagcggcabc gtgcagcagc agagcaacct gcbgcgcgcc abcgaggccc 3360 
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agcagcacat gctgcagctg accgtgtggg gcatcaagca gctgcagacc cgcgtgctgg 3420 
ccatcgagcg ctacctgcgc gaccagcagc tcgaggacat ctggaacaac atgacctgga 34 80 
tgcagtggga ccgcgagatc agcaactaca ccaacaccat ctaccgcctg ctggaggaca 3540 
gccagaacca gcaggagaag aacgagcagg acctgctggc cctggacaag tggcagaacc 3600 
tgtggacctg gttcggcatc accaactggc tgtggtacat caagatcttc atcaagatcg 3660 
tgggcggcct gatcggcctg cgcatcatct tcgccgtgct gagcatctga acacgtggga 3720 
tccagatctg ctgtgccbtc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 3780 
tcchtgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 3840 
tcgcattgtc tgagtaggtg tcattctatt cbggggggtg gggtggggca ggacagcaag 3900 
gggg a gg att gggaagacaa tagcaggcat gctggggatg cggtgggctc tatgggtacc 3960 
caggtgctga agaattgacc cggttcctcc tgggccagaa agaagcaggc acatcccctt 4020 
ctctgtgaca caccctgtcc acgcccctgg ttcttagttc cagccccact cataggacac 4080 
tcatagctca ggagggctcc gccttcaatc ccacccgcta aagtacttgg agcggtctct 4140 
ccctccctca tcagcccacc aaaccaaacc tagcctccaa gagtgggaag aaattaaagc 4200 
aagataggct attaagtgca gagggagaga aaatgcctcc aacatgtgag gaagtaatga 4260 
gagaaatcat agaattttaa ggccatgatt taaggccatc atggccttaa tcttccgctt 4320 
cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact 4380 
caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag 4440 
caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg tttttccata 4500 
ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc 4 5 60 
cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg cgctctcctg 4620 
ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga agcgtggcgc 4680 
tttctcatag ctcacgctgt aggtatctca gtfccggtgta ggtcgttcgc tccaagctgg 4740 
gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt aactatcgfcc 4800 
ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact ggtaacagga 4860 
ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg cctaactacg 4920 
gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt accttcggaa 4980 
aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt ggtttttttg 5040 
tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct ttgatctttt 5100 
ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg gtcatgagat 5160 
tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt aaatcaatct 5220 
aaagtatata tgagtaaact tggtctgaca gttaccaafcg cttaatcagt gaggcaccta 5280 
tctcagcgat ctgtctattt cgttcatcca tagttgcctg actcgggggg ggggggcgct 534 0 
gaggtctgcc tcgtgaagaa ggtgttgctg actcatacca ggcctgaatc gccccatcat 5400 
ccagccagaa agtgagggag ccacggttga tgagagcttt gttgtaggtg gaccagttgg 54 60 
tgattttgaa cttttgcttt gccacggaac ggtctgcgtt gtcgggaaga tgcgtgatct 5520 
gatccttcaa ctcagcaaaa gttcgattta ttcaacaaag ccgccgtccc gtcaagtcag 5580 
cgtaatgctc tgccagtgtt acaaccaatt aaccaattct gattagaaaa actcatcgag 5640 
catcaaatga aactgcaatt tattcatatc aggattatca ataccatatt tttgaaaaag 5700 
ccgtttctgt aatgaaggag aaaactcacc gaggcagttc cataggatgg caagatcctg 5760 
gtatcggtct gcgattccga ctcgtccaac atcaatacaa cctattaatt tcccctcgtc 5820 
aaaaataagg ttatcaagtg agaaatcacc atgagtgacg actgaatccg gtgagaatgg 5880 
caaaagctta tgcatttctt tccagacttg ttcaacaggc cagccattac gctcgtcatc 5940 
aaaatcactc gcatcaacca aaccgttatt cattcgtgat tgcgcctgag cgagacgaaa 6000 
tacgcgatcg ctgttaaaag gacaattaca aacaggaatc gaatgcaacc ggcgcaggaa 6060 
cactgccagc gcatcaacaa tattttcacc tgaatcagga tattcttcta atacctggaa 6120 
tgctgttttc ccggggatcg cagtggtgag taaccatgca tcatcaggag tacggataaa 6180 
atgcttgatg gtcggaagag gcataaattc cgtcagccag tttagtctga ccatctcatc 6240 
tgtaacatca ttggcaacgc tacctttgcc atgtttcaga aacaactctg gcgcatcggg 6300 
cttcccatac aatcgataga ttgtcgcacc tgattgcccg acattatcgc gagcccattt 6360 
atacccatat aaatcagcat ccatgttgga atttaatcgc ggcctcgagc aagacgtttc 6420 
ccgttgaata tggctcataa caccccttgt attactgttt afcgtaagcag acagttttat 6480 
tgttcatgat gatatafcttt tatcttgtgc aatgtaacat cagagatttt gagacacaac 6540 
gtggctttcc cccccccccc attattgaag catttatcag ggttattgtc tcatgagcgg 6600 
atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg 6660 
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aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag 6720 
gcghatcacg aggccctttc gtc 6743 



<210> lis 

<211> 6674 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 118 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
99 a tggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 
gaaaatgagc gtggagattg ggc.tcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgccgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 192 0 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg atcgaagaac tgcagcttca 234 0 
acatgaccac cgaggtgcgc gacaagcgcg agaaggtgca cgccctgttc taccgcctgg .2400 
acatcgtgcc cctgaagaac gagagcagca acaccagcgg cgactaccgc ctgatcaact 2460 
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gcaacaccag cgccatcacc caggcctgcc 
■ actactgcgc ccccgccggc tacgccatcc 
ccggcccctg caacaacgtg agcaccatcc 
gcacccagct gctgctgaac ggcagcctgg 
acctgaccga caacgtgaag accatcatcg 
gcacccgccg atcggcccac tgcaacatca 
aggbgggcaa gaagctggcc gagcacttcc 
gcggcggcga cctggagahc accacccaca 
gcaacacccg abcgabcccc bgccgcabca 
gccgcgccat gtacgccccc cccatcgagg 
gccbgcbgcb gacccgcgac ggcggcaccg 
a 999 c 99 c 9 a catgcgcgac aactggcgca 
tcaagcccct gggcatcgcc cccaccaagg 
tgagcggcat cgbgcagcag cagagcaacc 
tgctgcagct gaccgtgtgg ggcatcaagc 
gctacctgcg cgaccagcag ctcgaggaca 
accgcgagab cagcaactac accaacacca 
agcaggagaa gaacgagcag gacctgctgg 
ggttcggcat caccaacbgg cbgbggbaca 
tgatcggcct gcgcatcatc bbcgccgbgc 
gctgtgcctt ctagttgcca gccatctgtt 
ctggaaggtg ccactcccac tgtccttfccc 
cbgagbaggb gtcattctat bcbggggggb 
tgggaagaca atagcaggca bgcbggggab 
aagaattgac ccggttcctc ctgggccaga 
acaccctgtc cacgccccbg gbbcbbagbt 
aggagggctc cgccttcaat cccacccgct 
atcagcccac caaaccaaac ctagcchcca 
tattaagtgc agagggagag aaaatgcctc 
tagaatttta aggccatgat ttaaggccat 
ctgactcgct gcgctcggtc gttcggctgc 
taatacggtt abccacagaa tcaggggata 
agcaaaaggc caggaaccgt aaaaaggccg 
cccctgacga gcatcacaaa aatcgacgct 
tataaagata ccaggcgttt ccccctggaa 
tgccgcttac cggabaccbg bccgccbbbc 
gctcacgctg taggtatctc agttcggtgt 
acgaaccccc cgttcagccc gaccgctgcg 
acccggtaag acacgactta bcgccacbgg 
cgaggbabgb aggcggtgct acagagttct 
gaagaacagt atttggtahc tgcgctctgc 
gtagctcbtg atccggcaaa caaaccaccg 
agcagattac gcgcagaaaa aaaggatctc 
ctgacgctca gbggaacgaa aactcacgtt 
ggatcttcac ctagatcctt ttaaattaaa 
atgagtaaac ttggtctgac agttaccaat 
tctgtctatt tcgttcatcc atagbbgccb 
ctcgtgaaga aggtgttgct gactcatacc 
aagbgaggga gccacggttg abgagagcbb 
acttttgctt bgccacggaa cggtctgcgt 
actcagcaaa agttcgattt attcaacaaa 
cbgccagbgb tacaaccaat taaccaattc 
aaacbgcaab bbabbcabab caggattatc 
taatgaagga gaaaactcac cgaggcagbb 
tgcgattccg actcgtccaa catcaataca 



ccaaggtgag cbbcgacccc atccccatcc 2520 
tgaagtgcaa caacaagacc bbcaacggca 2580 
agtgcaccca cggcaccaag cccgbggbga 2640 
ccgaggagga gabcabcatc cgcagcaaga 2700 
tgcacctgaa cgagagcgbg gagatcaact 2760 
gccgcaccgc ctggaacaag accctgcagg 2820 
ccaacaaggc catcaagttc gccaagcaca 2880 
gcttcaactg ccgcggcgag bbcbbcbacb 2940 
agcagabcat caacabgbgg cagggcgtgg 3000 
gcatccbgac cbgccgcagc aacabcaccg 3060 
gcatgcacga caccgagatc bbccgccccg 3120 
gcgagcbgta caagtacaag gbggtggaga 3180 
ccaagcbbac cgbgcaggtg cgccagctgc 3240 
bgcbgcgcgc cabcgaggcc cagcagcaca 3300 
agctgcagac ccgcgbgcbg gccabcgagc 33 60 
bcbggaacaa catgacctgg abgcagtggg 34 20 
bcbaccgccb gctggaggac agccagaacc 3480 
cccbggacaa gtggcagaac cbgbggaccb 3540 
bcaagatcbb cabcaagabc gbgggcggcc 3600 
bgagcatcbg aacacgbggg abccagatcb 3660 
gbbbgccccb cccccgtgcc tbccbbgacc 3720 
baabaaaabg aggaaatbgc abcgcattgb 3780 
ggggbggggc aggacagcaa gggggaggab 3840 
gcggbgggct cbabgggtac ccaggtgcbg 3900 
aagaagcagg cacabcccct tctcbgbgac 3960 
ccagccccac bcabaggaca ctcabagctc 4020 
aaagtacbbg gagcggbcbc bcccbccctc 4080 
agagtgggaa gaaatbaaag caagabaggc 4140 
caacabgbga ggaagtaabg agagaaabca 4200 
cabggccbba atctbccgcb tccbcgcbca 4260 
ggcgagcggb abcagcbcac bcaaaggcgg 4320 
acgcaggaaa gaacabgbga gcaaaaggcc 4 380 
cgbbgcbggc gtbbbbccat aggctccgcc 4440 
caagbcagag gtggcgaaac ccgacaggac 4500 
gcbcccbcgb gcgcbcbccb gtbccgaccc 4560 
bcccbbcggg aagcgbggcg ctbbcbcaba 4620 
aggbcgbfccg cbccaagcbg ggcbgbgbgc 4680 
cctbatccgg baacbabcgb cbbgagtcca 4740 
cagcagccac tggbaacagg abbagcagag 4 800 
bgaagbggbg gcctaacbac ggcbacacba 4860 
bgaagccagb baccbbcgga aaaagagbbg 4920 
cbggbagcgg bggbbbbbbb gbbbgcaagc 4980 
aagaagabcc bbbgatcbbb bcbacggggt 5040 
aagggabbbb ggbcabgaga bbatcaaaaa 5100 
aabgaagbbb baaatcaatc baaagbabab 5160 
gcbbaabcag bgaggcaccb abcbcagcga 5220 
gacbcggggg gggggggcgc bgaggbcbgc 5280 
aggccbgaab cgccccabca bccagccaga 5340 
tgbbgbaggb ggaccagbbg gbgatbbbga 5400 
tgbcgggaag abgcgbgabc bgabccbbca 5460 
gccgccgbcc cgbcaagbca gcgbaabgcb 5520 
bgabbagaaa aacbcabcga gcabcaaabg 5580 
aabaccabab bbbbgaaaaa gccgbbbcbg 5640 
ccabaggabg gcaagabccb ggbabcggbc 5700 
accbabbaab bbccccbcgb caaaaabaag 5760 
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gttatcaagt gagaaatcac catgagtgac gactgaatcc ggtgagaatg gcaaaagctt 582 0 
abgcabbtcb ttccagactt gbtcaacagg ccagccatta cgctcgtcat caaaabcacb 5880 
cgcabcaacc aaaccgbbab bcabbcgbga ttgcgcctga gcgagacgaa atacgcgatc 5940 
gctgbbaaaa ggacaattac aaacaggaat cgaatgcaac cggcgcagga acactgccag 6000 
cgcatcaaca abatbbbcac ctgaatcagg ababbcbbcb aabacctgga abgctgbbbb 6060 
cccggggatc gcagbggbga gtaaccatgc atcatcagga gbacggataa aabgcbtgab 6120 
ggbcggaaga ggcataaatt ccgtcagcca gbbbagtcbg accabctcab ctgtaacatc 6180 
attggcaacg cbaccbbbgc catgtttcag aaacaachct ggcgcabcgg gcttcccata 6240 
caatcgatag attgtcgcac cbgabbgccc gacabbabcg cgagcccatt tatacccata 6300 
taaafccagca tccatgttgg aatttaatcg cggccbcgag caagacgbbb cccgttgaat 6360 
abggcbcaba acaccccttg battacbgbt tatgtaagca gacagbttba bbgbbcabga 6420 
tgatatattt tbatcbtgbg caatgtaaca bcagagabbb tgagacacaa cgtggctttc 6480 
cccccccccc cattattgaa gcatttatca gggttattgt cbcabgagcg gatacatatt 6540 
bgaabgbabt tagaaaaata aacaaabagg ggttccgcgc acabbbcccc gaaaagtgcc 6600 
acctgacgtc taagaaacca ttattatcat gacabbaacc tabaaaaaba ggcgbabcac 6660 
gaggcccbbt cgbc 6674 



<210> 119 
<211> 6755 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 119 

tcgcgcgbbt cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgtct gtaagcggab gccgggagca gacaagcccg bcagggcgcg tcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabtgba cbgagagbgc 180 
accababgcg gtgbgaaata ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabbggcca bbgcabacgb bgbabccaba bcabaababg bacabbtaba bbggcbcabg 300 
tccaacabba ccgccabgtb gacabbgabb abbgacbagb babtaatagb aabcaabbac 360 
ggggbcabba gbbcabagcc cabababgga gbbccgcgbb acabaacbba cggtaaatgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 
catagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgbabca batgccaagb acgcccccba bbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacatg accbbabggg actbbccbac 660 
bbggcagbac abctacgbab bagbcabcgc batbaccatg gbgabgcggt tttggcagta 720 
catcaatggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabtga 780 
cgtcaabggg agbttgbbbt ggcaccaaaa bcaacgggac btbccaaaat gbcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agcbcgtbba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgbb tbgaccbcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggat 1020 
tccccgbgcc aagagbgacg baagbaccgc ctabagacbc babaggcaca ccccbbbggc 1080 
tctbabgcab gctabacbgt bbtbggcbbg gggccbatac acccccgcbb ccbbabgcba 1140 
baggbgabgg babagcbbag ccbabaggtg bgggbbabbg accabbabbg accacbcccc 1200 
batbggbgac gabacbbbcc abbacbaabc cataacabgg cbcbbbgcca caacbabcbc 1260 
tatbggcbab abgccaabac bctgbccbbc agagacbgac acggacbcbg tatbtbbaca 1320 
ggabggggbc ccatbbabba bbtacaaabt cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbbbbb abbaaacaba gcgbgggabc bccacgcgaa bcbcgggbac gbgbtccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggbcgctcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbct 1620 
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gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact fcaaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbtgbt gtattcbgat aagagtcaga ggtaactccc 1740 
9ttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagbactcgb tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt bcctbbccab gggbctbbbc 1860 
tgcagtcacc gtcgtcgaca cgtgtgabca gatatcgcgg ccgcbctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcabcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gbggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggbgcbggag aacgtgaccg agaacbtcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc bgbgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc ccbgbgcgbg accctgcact gcagcaaccg atcgaagaac tgcagcttca 2340 
acatgaccac cgaggtgcgc gacaagcgcg agaaggtgca cgccctgttc baccgccbgg 24 00 
acatcgtgcc cctgaagaac gagagcagca acaccagcgg cgactaccgc ctgatcaact 2460 
gcaacaccag cgccatcacc caggcctgcc ccaaggtgag cttcgacccc atccccatcc 2520 
actactgcgc ccccgccggc bacgccatcc tgaagtgcaa caacaagacc ttcaacggca 2580 
ccggcccctg caacaacgbg agcaccatcc agtgcaccca cggcaccaag cccgtggtga 2640 
gcacccagct gctgctgaac ggcagcctgg ccgaggagga gatcatcatc • cgcagcaaga 2700 
acctgaccga caacgtgaag accatcatcg tgcacctgaa cgagagcgtg gagatcaact 2760 
gcacccgccc caacaacaac acccgcaaga gcatccgcab cggccccggc caggccbbcb 2 82 0 
acgccaccgg cgagatcatc ggcgacatcc gccaggccca ctgcaacatc agccgcaccg 2880 
cctggaacaa gaccctgcag gaggtgggca agaagctggc cgagcacttc cccaacaagg 2 940 
ccatcaagtt cgccaagcac agcggcggcg acctggagat caccacccac agcttcaact 3000 
gccgcggcga gttcttctac tgcaacaccc gatcgatccc ctgccgcatc aagcagatca 3060 
tcaacatgtg gcagggcgtg ggccgcgcca tgtacgcccc ccccatcgag ggcabcctga 3120 
cctgccgcag caacatcacc ggccbgcbgc bgacccgcga cggcggcacc ggcabgcacg 3180 
acaccgagab cttccgcccc gagggcggcg acatgcgcga caacbggcgc agcgagcbgt 3240 
acaagtacaa ggbggtggag abcaagcccc bgggcabcgc ccccaccaag gccaagcbba 3300 
ccgbgcaggt gcgccagcbg cbgagcggca bcgbgcagca gcagagcaac cbgcbgcgcg 3360 
ccabcgaggc ccagcagcac abgcbgcagc tgaccgtgbg gggcatcaag cagcbgcaga 3420 
cccgcgbgcb ggccabcgag cgctaccbgc gcgaccagca gcbcgaggac abctggaaca 3480 
acatgaccbg gatgcagtgg gaccgcgaga tcagcaacba caccaacacc abctaccgcc 3 540 
t9 ct 99 a 99 a cagccagaac cagcaggaga agaacgagca ggaccbgctg gccctggaca 3 600 
agbggcagaa ccbgbggacc tggtbcggca tcaccaactg gcbgbggtac abcaagabct 3660 
bcabcaagab cgtgggcggc cbgabcggcc bgcgcabcab cbbcgccgbg ctgagcabcb 3720 
gaacacgtgg gabccagabc bgctgbgccb bcbagtbgcc agccabcbgb bgtbbgcccc 3780 
bcccccgbgc cbbccbbgac ccbggaaggb gccacbccca ctgtccbbtc cbaabaaaab 3840 
gaggaaabbg catcgcabbg bcbgagtagg bgbcattcba btctgggggg bggggbgggg 3900 
caggacagca agggggagga ttgggaagac aabagcaggc atgctgggga bgcggbgggc 3960 
bctabgggba cccaggbgcb gaagaatbga cccggbbccb ccbgggccag aaagaagcag 4020 
gcacabcccc ttctctgbga cacacccbgb ccacgcccct ggbbcbtagb bccagcccca 4080 
cbcabaggac acbcatagcb caggagggcb ccgcctbcaa bcccacccgc taaagbacbb 4140 
ggagcggtcb ctcccbcccb cabcagccca ccaaaccaaa ccbagccbcc aagagbggga 4 200 
agaaabbaaa gcaagabagg cbabbaagbg cagagggaga gaaaabgccb ccaacabgtg 4260 
aggaagtaat gagagaaabc abagaatbbt aaggccabga bttaaggcca tcabggccbb 4320 
aabcbbccgc btccbcgcbc actgacfccgc bgcgcbcggb cgbbcggcbg cggcgagcgg 4380 
babcagcbca ctcaaaggcg gbaabacggb babccacaga abcaggggab aacgcaggaa 4440 
agaacabgbg agcaaaaggc cagcaaaagg ccaggaaccg baaaaaggcc gcgtbgcbgg 4500 
cgbbbbtcca baggcbccgc cccccbgacg agcabcacaa aaabcgacgc bcaagtcaga 4560 
ggbggcgaaa cccgacagga ctabaaagab accaggcgbb bcccccbgga agcbcccbcg 4620 
bgcgctcbcc tgtbccgacc cbgccgcbba ccggabaccb gbccgccbbb cbcccbbcgg 4680 
gaagcgbggc gcbbbcbcat agcbcacgcb gbaggbabcb cagbtcggbg baggtcgbbc 4740 
gcbccaagcb gggcbgbgbg cacgaacccc ccgbbcagcc cgaccgctgc gccbbabccg 4 800 
gbaacbatcg bcbbgagbcc aacccggtaa gacacgactb abcgccactg gcagcagcca 4860 
cbggbaacag gabtagcaga gcgaggbabg baggcggtgc tacagagbbc bbgaagbggb 4920 
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ggcctaacta cggctacact agaagaacag babbbggbab ctgcgctctg ctgaagccag 4980 
ttaccttcgg aaaaagagtb ggtagctctt gatccggcaa acaaaccacc gctggtagcg 5040 
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggabct caagaagatc 5100 
cbbbgabcbt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggabbb 5160 
tggtcatgag attatcaaaa aggatcbbca cctagatcct tttaaatbaa aaatgaagtt 5220 
ttaaatcaab ctaaagtaba tatgagfcaaa cbtggbcbga cagbtaccaa tgcbtaatca 5280 
gtgaggcacc tabctcagcg abctgfccbat tbcgbbcatc catagbbgcc tgactcgggg 5340 
99999999cg ctgaggtctg ccbcgbgaag aaggtgbtgc tgactcabac caggccbgaa 5400 
tcgccccabc atccagccag aaagbgaggg agccacggtb gabgagagct tbgtbgbagg 5460 
tggaccagbb ggbgatbttg aactbbbgct tbgccacgga acggtcbgcg tbgbcgggaa 5520 
gabgcgbgab ctgatccbtc aacbcagcaa aagtbcgabb tabbcaacaa agccgccgbc 5580 
ccgfccaagtc agcgtaatgc tcbgccagbg bbacaaccaa ttaaccaabt ctgatbagaa 5640 
aaacbcatcg agcatcaaat gaaactgcaa bbbabbcata tcaggatbat caabaccata 5700 
tbbbbgaaaa agccgtbtct gbaatgaagg agaaaactca ccgaggcagt bccabaggab 5760 
ggcaagatcc tggbatcggb cbgcgabbcc gacbcgbcca acabcaabac aaccbabtaa 5820 
ctbcccctcg bcaaaaataa ggbbatcaag bgagaaabca ccabgagbga cgactgaabc 5880 
cggbgagaab ggcaaaagct tabgcabbbc bbbccagact tgbbcaacag gccagccabb 5940 
acgcbcgtca bcaaaatcac tcgcatcaac caaaccgtba tbcabbcgbg atbgcgccbg 6000 
agcgagacga aabacgcgat cgcbgtbaaa aggacaabba caaacaggaa bcgaatgcaa 6060 
ccggcgcagg aacacbgcca gcgcatcaac aabatttbca ccbgaabcag gababbctbc 6120 
taatacctgg aabgctgttb tcccggggab cgcagtggbg agbaaccabg cabcabcagg 6180 
agbacggaba aaabgcbbga bggbcggaag aggcabaaab bccgbcagcc agbbbagbcb 6240 
gaccabctca bcbgtaacab cabbggcaac gcbaccbbbg ccatgbbbca gaaacaacbc 6300 
tggcgcatcg ggcbtcccab acaabcgaba gabbgtcgca ccbgatbgcc cgacabbabc 6360 
gcgagcccab btabacccat abaaabcagc abccabgbbg gaabttaatc gcggccbcga 6420 
gcaagacgbb bcccgbbgaa tabggcbcab aacaccccbb gbatbacbgt btabgbaagc 6480 
agacagbtbb atbgbbcatg atgababatt bbbabcbbgb gcaabgbaac abcagagabb 6540 
tbgagacaca acgbggcbbt cccccccccc ccabbatbga agcabbtatc agggtbabbg 6600 
tcbcabgagc ggabacabat bbgaabgtab bbagaaaaat aaacaaatag gggbbccgcg 6660 
cacabbbccc cgaaaagbgc caccbgacgb cbaagaaacc abbabbatca bgacabbaac 6720 
cbabaaaaat aggcgbabca cgaggcccbb bcgbc 6755 



<210> 120 
<211> 6677 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 120 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagctcccg gagacggbca 60 
cagcbtgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcatcaga gcagabbgba cbgagagbgc 18 0 
accabatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc atcagabbgg 24 0 
cbabbggcca bbgcabacgb bgbabccata bcabaababg bacabbtaba bbggctcatg 300 
bccaacabta ccgccabgbb gacabtgatb abbgacbagt babtaabagb aabcaabbac 360 
ggggtcabta gtbcabagcc cabababgga gbbccgcgbt acataacbba cggbaaabgg 42 0 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 48 0 
cabagtaacg ccaabaggga cbttccabbg acgbcaatgg gbggagbabb bacggtaaac 540 
bgcccacbbg gcagbacabc aagtgtabca babgccaagt acgcccccba btgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagtacatg acctbabggg acbtbccbac 660 
bbggcagbac abcbacgtab bagtcabcgc babbaccabg gbgabgcggb bbbggcagta 720 
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catcaatggg cgtggatagc ggtttgactc 
cgtcaatggg agbtbgtbbb ggcaccaaaa 
ctccgcccca ttgacgcaaa bgggcggbag 
agcbcgbbta gbgaaccgbc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagtaccgc 
tcttatgcat gcbabacbgb ttttggcttg 
taggtgatgg tatagcttag cctataggtg 
babbggbgac gabacbbbcc attactaatc 
tattggctat atgccaatac tctgtccttc 
ggatggggtc ccatttatta tttacaaatt 
cgcagttttt attaaacata gcgtgggatc 
catgggctct tctccggtag cggcggagcb 
agcggcbcab ggtcgcbcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaabgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacabaabag cbgacagacb 
bgcagbcacc gtcgbcgaca cgtgbgatca 
ccabgcgcgt ggagggcabc cagcgcaacb 
gctbcbggab ggtgabgabc bacaacgbgc 
gcgbgcccgb gtggaaggag gccaagacca 
acgacgccga ggbgcacaac gtgbgggcca 
cccaggagab ggbgctggag aacgbgaccg 
bggagcagat gcaccaggac abcatcagcc 
agctgacccc cctgbgcgbg acccbgcact 
gcaccgacaa catgggcggc gagabcaaga 
ccagcgccab cacccaggcc bgccccaagg 
gcgcccccgc cggctacgcc atcctgaagb 
ccbgcaacaa cgbgagcacc atccagbgca 
agcbgcbgct gaacggcagc ctggccgagg 
ccgacaacgb gaagaccabc abcgbgcacc 
gccgabcggc ccactgcaac abcagccgca 
gcaagaagcb ggccgagcac bbccccaaca 
gcgaccbgga gabcaccacc cacagctbca 
ccagcagccb gbbcaacagc accbacaccc 
agaacagcat cabcaccabc ccctgccgca 
bgggccgcgc catgbacgcc ccccccabcg 
ccggccbgcb gcbgacccgc gacggcggca 
ccgagggcgg cgacabgcgc gacaacbggc 
agatcaagcc ccbgggcabc gcccccacca 
bgctgagcgg catcgbgcag cagcagagca 
acatgcbgca gcbgaccgbg bggggcatca 
agcgctaccb gcgcgaccag cagcbcgagg 
gggaccgcga gabcagcaac tacaccaaca 
accagcagga gaagaacgag caggaccbgc 
ccbggbbcgg cabcaccaac tggcbgbggb 
gccbgabcgg ccbgcgcabc abcbbcgccg 
tcbgctgbgc ctbcbagbtg ccagccabcb 
acccbggaag gtgccacbcc cacbgtccbb 
tgbcbgagba ggbgtcabtc babbcbgggg 
gatbgggaag acaabagcag gcabgctggg 
ctgaagaabt gacccggbbc cbcctgggcc 
gacacacccb gbccacgccc cbggbtcbba 



acggggabbb ccaagbctcc accccabbga 780 
tcaacgggac tbbccaaaab gtcgbaacaa 840 
gcgbgbacgg bgggaggbcb atabaagcag 900 
gagacgccab ccacgcbgbb btgaccbcca 960 
cggccgggaa cggbgcabbg gaacgcggab 1020 
cbabagacbc tataggcaca ccccttbggc 1080 
gggccbatac acccccgctt ccbtatgcba 1140 
tgggbtabtg accabbabbg accactcccc 1200 
cabaacabgg cbcbbbgcca caacbabcbc 1260 
agagacbgac acggacbcbg babttbbaca 1320 
cacabataca acaacgccgb cccccgbgcc 13 80 
tccacgcgaa bcbcgggtac gbgbbccgga 1440 
tccacabccg agcccbggbc ccabgccbcc 1500 
ctcctaacag bggaggccag acbbaggcac 1560 
cacaaggccg bggcggbagg gbabgbgtcb 1620 
gcbgacgcag abggaagact baaggcagcg 1680 
gbatbcbgab aagagbcaga ggbaacbccc 1740 
gbagtctgag cagtacbcgb bgctgccgcg 1800 
aacagacbgb bccttbccab gggbcbbbbc 1860 
gababcgcgg ccgctcbaga gababcgcca 1920 
ggaagcagbg gbggabcbgg ggcatcctgg 1980 
gcggcaaccb gtgggbgacc gtgbactacg 2 04 0 
ccctgbtctg cgccagcgac gccaaggccb 2100 
cccacgccbg cgtgcccacc gaccccaacc 2160 
agaacttcaa cabgbgggag aacgacabgg 2220 
tgbgggacca gagccbgaag cccbgcgbga 22 80 
gcagcaaccg caccabcgac tacaacaacc 2340 
acbgcagcbt ccgabcgabc aacbgcaaca 2400 
tgagcttcga ccccatcccc abccacbacb 2460 
gcaacaacaa gacctbcaac ggcaccggcc 2520 
cccacggcac caagcccgbg gbgagcaccc 2580 
aggagabcat cabccgcagc aagaaccbga 2 640 
tgaacgagag cgbggagabc aactgcaccc 27 00 
ccgccbggaa caagaccctg caggaggbgg 2760 
aggccabcaa gbtcgccaag cacagcggcg 2820 
actgccgcgg cgagbbcbbc bacbgcaaca 2880 
ccaacagcac cgagaacabc accggcaccg 2940 
bcaagcagat cabcaacabg bggcagggcg 3000 
agggcabccb gaccbgccgc agcaacabca 3060 
ccggcatgca cgacaccgag abcbtccgcc 3120 
gcagcgagcb gbacaagbac aaggbggbgg 3180 
aggccaagcb taccgtgcag gbgcgccagc 3240 
accbgctgcg cgccatcgag gcccagcagc 3300 
agcagcbgca gacccgcgbg cbggccabcg 3360 
acabcbggaa caacatgacc bggabgcagb 3420 
ccabcbaccg cctgcbggag gacagccaga 3480 
bggcccbgga caagbggcag aaccbgbgga 3540 
acatcaagab cbbcabcaag atcgbgggcg 3600 
bgcbgagcab cbgaacacgb gggabccaga 3660 
gbtgbbbgcc ccbcccccgb gccbbccbtg 3720 
tccbaabaaa abgaggaaab bgcatcgcab 3780 
ggtggggbgg ggcaggacag caagggggag 3840 
gabgcggbgg gcbctabggg bacccaggbg 3900 
agaaagaagc aggcacatcc ccbtctctgb 3960 
gbbccagccc cacbcabagg acacbcabag 4020 
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ctcaggaggg cbccgccbbc aatcccaccc gctaaagtac tbggagcggt ctctccctcc 4080 
ctcatcagcc caccaaacca aacctagcct ccaagagtgg gaagaaatta aagcaagata 4140 
ggctattaag tgcagaggga gagaaaabgc ctccaacatg tgaggaagta atgagagaaa 4200 
tcatagaatt bbaaggccat gatttaaggc catcatggcc ttaatcttcc gcttcctcgc 4260 
bcacbgacbc gcbgcgcbcg gbcgbbcggc tgcggcgagc ggtatcagct cactcaaagg 4320 
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 4380 
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgbbbtte cataggctcc 4440 
gccccccbga cgagcatcac aaaaabcgac gcbcaagbca gaggtggcga aacccgacag 4500 
gactataaag ataccaggcg bbbcccccbg gaagcbcccb cgtgcgctct cctgttccga 4560 
ccctgccgct taccggatac ctgtccgcct tbcbcccbbc gggaagcgtg gcgcbbbcbc 4620 
atagctcacg cbgtaggtab ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 4680 
bgcacgaacc ccccgbbcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 4740 
ccaacccggt aagacacgac ttatcgccac tggcagcagc cachggtaac aggattagca 4800 
gagcgaggta tgtaggcggb gctacagagb tcttgaagtg gtggcctaac tacggctaca 4860 
cbagaagaac agtatttggt abcbgcgcbc fcgctgaagcc agbbaccbtc ggaaaaagag 4920 
ttggtagcbc btgatccggc aaacaaacca ccgctggbag cggbggtbtt tbtgbtbgca 4980 
agcagcagat bacgcgcaga aaaaaaggab cbcaagaaga tcctbbgabc bttbctacgg 5040 
ggtcbgacgc bcagbggaac gaaaactcac gbbaagggat bttggbcabg agattabcaa 5100 
aaaggabcbt caccbagatc cbttbaaabb aaaaatgaag btbtaaabca atctaaagba 5160 
batabgagta aacbbggtcb gacagtbacc aabgctbaat cagtgaggca cctabcbcag 5220 
cgatctgbct abbbcgbbca tccatagbbg ccbgacbcgg gggggggggg cgcbgaggtc 5280 
tgcctcgbga agaaggbgtt gctgactcab accaggcctg aatcgcccca tcabccagcc 5340 
agaaagtgag ggagccacgg bbgabgagag cbttgbbgba ggtggaccag btggbgatbb 5400 
tgaacbbttg ctbbgccacg gaacggbcbg cgttgbcggg aagatgcgtg abcbgabcct 5460 
tcaacbcagc aaaagbtcga tbbattcaac aaagccgccg bcccgbcaag tcagcgtaab 5520 
gctchgccag tgbtacaacc aabtaaccaa btcbgabtag aaaaacbcat cgagcatcaa 5580 
abgaaacbgc aattbabbca tatcaggabt atcaatacca tattbtbgaa aaagccgtbb 5640 
ctgtaabgaa ggagaaaact caccgaggca gbtccabagg atggcaagat ccbggtatcg 5700 
gtcbgcgatt ccgacbcgtc caacatcaab acaaccbabb aatbtcccct cgbcaaaaab 5760 
aaggbbabca agbgagaaab caccatgagb gacgacbgaa bccggbgaga atggcaaaag 5820 
cttatgcabt bcbtbccaga cbbgtbcaac aggccagcca btacgcbcgt cabcaaaabc 5880 
acbcgcabca accaaaccgb babbcabbcg bgatbgcgcc tgagcgagac gaaabacgcg 594 0 
atcgcbgtba aaaggacaat tacaaacagg aatcgaabgc aaccggcgca ggaacacbgc 6000 
cagcgcabca acaababbtb cacctgaatc aggababbcb bctaataccb ggaatgcbgt 6060 
bbbcccgggg abcgcagbgg tgagbaacca bgcabcabca ggagbacgga baaaatgcbb 6120 
gatggtcgga agaggcabaa abbccgbcag ccagbtbagt cbgaccabct cabcbgbaac 6180 
atcattggca acgcbacctb bgccabgbbt cagaaacaac tcbggcgcat cgggctbccc 6240 
abacaabcga bagattgtcg caccbgattg cccgacabba tcgcgagccc abtbataccc 6300 
ababaaabca gcabccatgb bggaabttaa tcgcggccbc gagcaagacg tttcccgbbg 6360 
aababggcbc abaacacccc tbgbabtacb gbbbabgbaa gcagacagbt btabtgbbca 6420 
bgabgataba tbbbbabcbb gtgcaatgba acabcagaga bbtbgagaca caacgbggcb 6480 
tbcccccccc ccccatbatt gaagcattba bcagggbbab bgbcbcabga gcggabacab 6540 
abbbgaabgt atbbagaaaa abaaacaaab aggggtbccg cgcacabbbc cccgaaaagt 6600 
gccacctgac gtctaagaaa ccabbabtab cabgacatta accbataaaa abaggcgtat 6660 
cacgaggccc bbtcgbc 6677 



<210> 121 
<211> 6677 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR10l2x/s conbaining HIV genes 
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<400> 121 

bcgcgcgtbt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggbca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagatbgba ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 

cbabbggcca bbgcabacgb fcgtatccata bcabaabafcg bacabtbaba ttggctcatg 300 

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaabbac 360 
ggggtcabta gttcatagcc cabababgga gttccgcgtt acataactta cggtaaatgg 420 

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 

tgcccacttg gcagtacatc aagbgtabca tatgccaagt acgcccccba ttgacgtcaa 600 

bgacggbaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 

tbggcagbac atctacgtat tagtcatcgc tattaccatg gbgabgcggb tttggcagta 720 

catcaatggg cgtggatagc ggbbbgactc acggggabbb ccaagtctcc accccattga 780 

cgtcaatggg agbbtgbbtb ggcaccaaaa tcaacgggac tttccaaaat gtcgbaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggbcb ababaagcag 900 

agcbcgbfcba gtgaaccgtc agabcgccbg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc cbabagacbc tataggcaca cccctttggc 1080 

bcbbatgcab gcbatacbgb ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg babagcbtag ccbataggtg tgggttattg accattabtg accactcccc 1200 

babbggbgac gatacttbcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tabbggctab atgccaatac bcbgbccbbc agagactgac acggacbctg babbbbtaca 1320 

ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt abbaaacaba gcgbgggabc tccacgcgaa tctcgggtac gtgttccgga 1440 

catgggctcb tctccggtag cggcggagcb tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag achtaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag ctgagttgtt gtattcfcgat aagagtcaga ggtaactccc 174 0 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 18 00 

cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 

tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 

ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 

gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 

gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 

acgacgccga ggtgcacaac gfcgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 

cccaggagat ggbgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 

tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 22 80 

agctgacccc cctghgcgtg accctgcact gcagcaaccg caccatcgac tacaacaacc 2340 

gcaccgacaa catgggcggc gagatcaaga actgcagctt ccgatcgatc aactgcaaca 24 00 

ccagcgccat cacccaggcc tgccccaagg tgagcttcga ccccatcccc atccactact 2460 

gcgcccccgc cggctacgcc atcctgaagt gcaacaacaa gaccttcaac ggcaccggcc 2520 

cctgcaacaa cgtgagcacc atccagtgca cccacggcac caagcccgtg gtgagcaccc 2580 

agctgctgct gaacggcagc ctggccgagg aggagabcat catccgcagc aagaacctga 2640 

ccgacaacgt gaagaccatc atcgtgcacc tgaacgagag cgtggagatc aacfcgcaccc 27 00 

gccgatcggc ccactgcaac atcagccgca ccgcctggaa caagaccctg caggaggtgg 2760 

gcaagaagct ggccgagcac ttccccaaca aggccatcaa gttcgccaag cacagcggcg 2 820 

gcgacctgga gabcaccacc cacagcbbca acbgccgcgg cgagbbcbbc bacbgcaaca 28 80 

ccagcagccb gtbcaacagc accbacaccc ccaacagcac cgagaacabc accggcaccg 294 0 

agaacagcab cabcaccabc cccbgccgca bcaagcagab cabcaacabg bggcagggcg 30 00 

bgggccgcgc cabgbacgcc ccccccabcg agggcabcct gaccbgccgc agcaacabca 3060 

ccggccbgcb gcbgacccgc gacggcggca ccggcabgca cgacaccgag atcbbccgcc 312 0 

ccgagggcgg cgacabgcgc gacaacbggc gcagcgagct gbacaagbac aaggbggbgg 3180 
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agatcaagcc cctgggcatc gcccccacca aggccaagct taccgtgcag gbgcgccagc 3240 
tgcbgagcgg catcgtgcag cagcagagca acctgctgcg cgccatcgag gcccagcagc 3300 
acabgcbgca gctgaccgtg tggggcatca agcagcbgca gacccgcgtg ctggccatcg 3360 
agcgctacct gcgcgaccag cagctcgagg acatctggaa caacatgacc tggatgcagt 3420 
gggaccgcga gatcagcaac tacaccaaca ccatcbaccg cctgctggag gacagccaga 3480 
accagcagga gaagaacgag caggaccbgc tggccctgga caagtggcag aacctgtgga 3540 
cctggttcgg cabcaccaac tggctgtggt acatcaagat cbbcatcaag atcgtgggcg 3600 
gcctgatcgg ccfcgcgcatc atcttcgccg tgctgagcat ctgaacacgt gggabccaga 3660 
tctgctgtgc cbbctagbbg ccagccatct gbbgtbtgcc ccbcccccgb gccbtccbtg 3720 
accctggaag gbgccactcc cactgbcctt tcctaataaa atgaggaaab tgcatcgcat 3780 
tgtcbgagta ggbgtcabbc tabtcbgggg ggtggggtgg ggcaggacag caagggggag 3 840 
gabtgggaag acaabagcag gcatgcbggg gatgcggtgg gctctabggg tacccaggbg 3 900 
cbgaagaatt gacccggbbc cbccbgggcc agaaagaagc aggcacatcc ccbbcbcbgt 3960 
gacacacccb gbccacgccc cbggtbctta gbbccagccc cacbcabagg acacbcabag 4020 
ctcaggaggg ctccgcctbc aabcccaccc gctaaagtac bbggagcggb cbctcccbcc 4 080 
cbcatcagcc caccaaacca aacctagccb ccaagagtgg gaagaaabba aagcaagaba 414 0 
ggctat^aag bgcagaggga gagaaaatgc ctccaacatg tgaggaagba abgagagaaa 4200 
tcatagaatt bbaaggccat gatbbaaggc catcatggcc ttaabctbcc gcttccfccgc 4260 
tcactgactc gcbgcgcbcg gbcgtbcggc bgcggcgagc ggbabcagct cactcaaagg 4320 
cggtaatacg gbbatccaca gaatcagggg ataacgcagg aaagaacabg tgagcaaaag 43 80 
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgbtgct ggcgttbttc cabaggcbcc 444 0 
gccccccbga cgagcabcac aaaaabcgac gcbcaagtca gaggtggcga aacccgacag 4500 
gactabaaag abaccaggcg tbtccccctg gaagcbccct cgtgcgctct cctgbtccga 4560 
ccctgccgcb taccggabac cbgbccgccb ttcbcccttc gggaagcgtg gcgcbttctc 4620 
abagctcacg cbgtaggbab cbcagbtcgg bgtaggtcgb tcgcbccaag cbgggcbgtg 4680 
tgcacgaacc ccccgbbcag cccgaccgct gcgccttatc cggbaactab cgtcbtgagt 4740 
ccaacccggt aagacacgac hbabcgccac tggcagcagc cactggbaac aggabbagca 4800 
gagcgaggba tgbaggcggt gcbacagagt tcbbgaagtg gbggcctaac bacggctaca 4860 
cbagaagaac agbabbbggb atcbgcgcbc bgctgaagcc agttaccbtc ggaaaaagag 4920 
btggbagcbc bbgatccggc aaacaaacca ccgcbggtag cggtggtbbb tttgtbbgca 4980 
agcagcagab bacgcgcaga aaaaaaggab cbcaagaaga bccttbgatc ttttcbacgg 504 0 
ggbctgacgc tcagbggaac gaaaactcac gtbaagggab tbtggbcabg agatbabcaa 5100 
aaaggatctb caccbagatc ctttbaaafcb aaaaatgaag tbtbaaatca abctaaagta 5160 
bababgagta aactbggbcb gacagtbacc aafcgcbtaat cagbgaggca ccbatctcag 5220 
cgabctgbct atbtcgbbca bccatagtbg ccbgactcgg gggggggggg cgcbgaggtc 5280 
bgccbcgbga agaaggbgbb gctgacbcab accaggccbg aabcgcccca tcabccagcc 5340 
agaaagtgag ggagccacgg bbgabgagag cbbbgbbgta ggtggaccag ttggbgabtb 5400 
tgaacttbtg cbttgccacg gaacggfcctg cgttgtcggg aagabgcgbg atctgatccb 5460 
tcaactcagc aaaagtbcga tttattcaac aaagccgccg tcccgtcaag tcagcgtaat 5520 
gctcbgccag bgbbacaacc aabtaaccaa bbcbgatbag. aaaaactcat cgagcatcaa 5580 
abgaaactgc aabbbabbca tabcaggatt atcaatacca tatbtttgaa aaagccgbbb 5640 
ctgbaatgaa ggagaaaact caccgaggca gbbccatagg atggcaagab ccbggbabcg 5700 
gtcbgcgatt ccgacbcgbc caacabcaab acaaccbatb aatbbccccb cgbcaaaaab 5760 
aaggbbatca agbgagaaat caccabgagb gacgacbgaa tccggbgaga abggcaaaag 5820 
cbbatgcabt bcbbtccaga cbbgtbcaac aggccagcca ttacgctcgb catcaaaatc 5880 
acbcgcabca accaaaccgt babbcabbcg bgatbgcgcc tgagcgagac gaaabacgcg 5940 
abcgctgbba aaaggacaat bacaaacagg aabcgaabgc aaccggcgca ggaacacbgc 6000 
cagcgcabca acaabatbbt cacctgaabc aggababbct bcbaabaccb ggaabgcbgb 6060 
bttcccgggg abcgcagbgg bgagbaacca tgcatcabca ggagbacgga baaaatgctb 6120 
gabggbcgga agaggcataa abtccgtcag ccagtttagt cbgaccabcb cabcbgtaac 6180 
abcabtggca acgctaccbb tgccabgbtb cagaaacaac bctggcgcab cgggcttccc 6240 
atacaatcga bagatbgbcg caccbgabbg cccgacabba bcgcgagccc atttataccc 6300 
ababaaabca gcabccabgt bggaabttaa bcgcggccbc gagcaagacg tbtcccgbbg 6360 
aabatggcbc ataacacccc bbgbabtact gbbbabgbaa gcagacagbb btabtgbbca 6420 
bgabgataba bbbbfcabcbt gbgcaatgta acabcagaga bbbtgagaca caacgbggct 6480 
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ttcccccccc ccccattatt gaagcattta tcagggttat tgtctcatga gcggatacat 6540 

atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt 6600 

gccacctgac gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat 6660 
cacgaggccc tttcgtc 6677 



<210> 122 
<211> 6608 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 122 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 160 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 3 00 

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

ta 9gt9 at 99 tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 

tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 

ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 

gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 204 0 

gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 

acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 

cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 222 0 

tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 

agctgacccc cctgtgcgtg accctgcact gcagcaaccg caccatcgac tacaacaacc 234 0 
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gcaccgacaa catgggcggc gagabcaaga 
ccagcgccat cacccaggcc tgccccaagg 
gcgcccccgc cggctacgcc abccbgaagb 
cctgcaacaa cgtgagcacc atccagtgca 
agctgctgct gaacggcagc ctggccgagg 
ccgacaacgfc gaagaccatc atcgtgcacc 
gccgatcggc ccactgcaac atcagccgca 
gcaagaagcb ggccgagcac ttccccaaca 
gcgaccbgga gatcaccacc cacagcttca 
cccgatcgat cccctgccgc atcaagcaga 
ccatgtacgc cccccccatc gagggcabcc 
tgchgacccg cgacggcggc accggcatgc 
gcgacatgcg cgacaactgg cgcagcgagc 
ccctgggcat cgcccccacc aaggccaagc 
gcabcgbgca gcagcagagc aacctgctgc 
agctgaccgt gfcggggcatc aagcagctgc 
tgcgcgacca gcagctcgag gacatctgga 
agatcagcaa ctacaccaac accatcbacc 
agaagaacga gcaggacctg ctggccctgg 
gcatcaccaa ctggctgtgg tacatcaaga 
gcctgcgcat catcttcgcc gbgcbgagca 
ccttctagtt gccagccatc tgttgtttgc 
ggtgccactc ccacbgbccb bbccbaabaa 
aggbgbcabb ctattctggg gggbggggtg 
gacaatagca ggcahgctgg ggatgcggtg 
tgacccggtt cctcctgggc cagaaagaag 
tgbccacgcc cctggttctt agttccagcc 
gctccgcctt caatcccacc cgctaaagta 
ccaccaaacc aaacctagcc tccaagagtg 
gtgcagaggg agagaaaabg cctccaacat 
tttaaggcca tgatttaagg ccatcatggc 
cgctgcgctc ggbcgttcgg cbgcggcgag 
ggttatccac agaatcaggg gataacgcag 
a 9gccaggaa ccgtaaaaag gccgcgttgc 
acgagcabca caaaaabcga cgctcaagtc 
gataccaggc gbttccccct ggaagcbccc 
bbaccggaba ccbgbccgcc bbtctccctt 
gcbgbaggba bcbcagbbcg gbgbaggbcg 
cccccgbtca gcccgaccgc bgcgccbtat 
taagacacga cbbabcgcca ctggcagcag 
afcgbaggcgg bgctacagag btcbtgaagt 
cagbabtbgg batcbgcgcb cbgctgaagc 
cbtgabccgg caaacaaacc accgctggba 
bbacgcgcag aaaaaaagga bctcaagaag 
cbcagbggaa cgaaaacbca cgtbaaggga 
tcaccbagab ccbttbaaab baaaaabgaa 
aaacbbggbc tgacagbbac caabgcbbaa 
babbbcgbtc abccabagtt gcctgacbcg 
aagaaggbgb tgcbgacbca baccaggccb 
gggagccacg gbbgabgaga gcbbtgttgb 
gcbbbgccac ggaacggbcb gcgbtgtcgg 
caaaagtbcg abtbabbcaa caaagccgcc 
gbgbbacaac caahbaacca abbcbgabta 
caabbbabbc ababcaggat babcaabacc 
aggagaaaac bcaccgaggc agbtccatag 
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actgcagcbb ccgabcgabc aacbgcaaca 2400 
tgagcbbcga ccccabcccc abccactacb 2460 
gcaacaacaa gaccbbcaac ggcaccggcc 2520 
cccacggcac caagcccgbg gtgagcaccc 2580 
aggagatcab cabccgcagc aagaaccbga 2640 
bgaacgagag cgbggagabc aacbgcaccc 2700 
ccgccbggaa caagacccbg caggaggtgg 2760 
a ggccabcaa gtbcgccaag cacagcggcg 2820 
acbgccgcgg cgagtbcttc bacbgcaaca 2880 
tcabcaacat gbggcagggc gtgggccgcg 2940 
bgacctgccg cagcaacatc accggcctgc 3000 
acgacaccga gabcbbccgc cccgagggcg 3060 
bgbacaagba caaggbggbg gagabcaagc 3120 
bbaccgbgca ggbgcgccag cbgcbgagcg 3180 
gcgccabcga ggcccagcag cacatgcbgc 324 0 
agacccgcgb gcbggccabc gagcgctacc 3300 
acaacatgac cbggatgcag bgggaccgcg 33 60 
gccbgctgga ggacagccag aaccagcagg 3420 
acaagbggca gaaccbgbgg accbggbbcg 34 80 
bcbbcabcaa gabcgbgggc ggccbgabcg 3540 
bcbgaacacg tgggatccag abcbgcbgtg 3 600 
cccbcccccg tgccbtccbb gacccbggaa 3660 
aabgaggaaa btgcatcgca ttgbctgagb 3720 
gggcaggaca gcaaggggga ggabbgggaa 3780 
ggcbcbatgg gbacccaggb gcbgaagaab 3840 
caggcacabc cccbbcbcbg bgacacaccc 3900 
ccacbcabag gacactcaba gcbcaggagg 3960 
cbtggagcgg bcbcbcccbc cctcabcagc 4020 
ggaagaaabt aaagcaagab aggcbabbaa 4080 
gbgaggaagt aatgagagaa abcabagaab 414 0 
cbbaabcbbc cgcbtccbcg cbcacbgacb 4200 
cggtabcagc bcacbcaaag gcggbaabac 4260 
gaaagaacat gbgagcaaaa ggccagcaaa 4320 
tggcgbbbtb ccabaggcbc cgccccccbg 43 80 
agaggbggcg aaacccgaca ggacbataaa 4440 
bcgbgcgctc bccbgbbccg acccbgccgc 4500 
cgggaagcgb ggcgcbbtcb cabagcbcac 4560 
bbcgcbccaa gctgggctgb gtgcacgaac 4620 
ccggbaacba bcgbcbbgag bccaacccgg 4680 
ccacbggbaa caggabbagc agagcgaggb 4740 
ggbggcctaa cbacggctac acbagaagaa 4800 
cagbbaccbb cggaaaaaga gtbggbagcb 4860 
gcggbggbbb bbbbgbbbgc aagcagcaga 4920 
abccbtbgab cttbbcbacg gggbcbgacg 4980 
tbbbggbcab gagabbabca aaaaggatcb 5040 
gbbbbaaabc aabctaaagt abababgagb 5100 
bcagbgaggc accbabcbca gcgabcbgbc 5160 
9999999999 gcgcbgaggb ctgccbcgbg 5220 
gaabcgcccc abcabccagc cagaaagtga 5280 
aggbggacca gbbggbgabb bbgaacbtbb 5340 
gaagabgcgb gabcbgabcc bbcaacbcag 5400 
gbcccgbcaa gbcagcgbaa bgcbcbgcca 5460 
gaaaaacbca bcgagcabca aabgaaacbg 5520 
ababbbbtga aaaagccgtb tctgbaabga 5580 
gatggcaaga bccbggbabc ggbcbgcgab 5640 
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tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 5700 

aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat 5760 

ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 5820 

aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 5880 

aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 5940 

aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 6000 

gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 6060 

aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 6120 

aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 6180 

atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 6240 

agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 6300 

cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 6360 

atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 6420 

cccccattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 64 80 

tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 6540 

cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 6600 

ctttcgtc 6608 



<210> 123 
<211> 6689 
<212> DNA 

<2\3> Artificial Sequence 

* 

<220> 

<223> plaamid pVR1012x/s containing HIV genes 

* 

<400> 123 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
'cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcatfcg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg' tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
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agcacaatgc ccaccaccac cagtgbgccg cacaaggccg tggcggtagg gbatgbgbct 1620 
gaaaabgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gbabbcbgab aagagtcaga ggbaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt bccbbbccat gggtcttttc 1860 
fcgcagfccacc gtcgtcgaca cgbgbgatca gatatcgcgg ccgctctaga gabatcgcca 1920 
ccabgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcbbcbggat ggtgatgatc bacaacgbgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 . 
acgacgccga ggtgcacaac gtgtgggcca cccacgccbg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacbbcaa catgtgggag aacgacatgg 2220 
tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgfcga 2280 
agctgacccc cctgtgcgtg acccfcgcact gcagcaaccg caccatcgac tacaacaacc 2340 
gcaccgacaa catgggcggc gagatcaaga acbgcagcbb ccgabcgabc aacbgcaaca 2400 
ccagcgccat cacccaggcc bgccccaagg tgagcbbcga ccccabcccc abccacbacb 2460 
gcgcccccgc cggctacgcc abccbgaagt gcaacaacaa gacctbcaac ggcaccggcc 2520 
ccbgcaacaa cgbgagcacc abccagtgca cccacggcac caagcccgtg gtgagcaccc 2580 
agctgcbgct gaacggcagc cbggccgagg aggagabcat catccgcagc aagaaccbga 2640 
ccgacaacgt gaagaccatc abcgbgcacc tgaacgagag cgtggagabc aacbgcaccc 2700 
gccccaacaa caacacccgc aagagcabcc gcabcggccc cggccaggcc tbcbacgcca 2 760 
ccggcgagat cabcggcgac abccgccagg cccacbgcaa cabcagccgc accgccbgga 2 820 
acaagaccct gcaggaggtg ggcaagaagc tggccgagca cbtccccaac aaggccabca 2 880 
agbtcgccaa gcacagcggc ggcgacctgg agabcaccac ccacagcbbc aacbgccgcg 2940 
gcgagttctt ctactgcaac acccgatcga tccccbgccg catcaagcag abcatcaaca 3000 
tgtggcaggg cgtgggccgc gccabgtacg ccccccccat cgagggcabc cbgacctgcc 3060 
gcagcaacat caccggcctg cbgcbgaccc gcgacggcgg caccggcabg cacgacaccg 3120 
agabctbccg ccccgagggc ggcgacatgc gcgacaacbg gcgcagcgag cbgtacaagb 3180 
acaaggbggb ggagabcaag ccccbgggca tcgcccccac caaggccaag cbtaccgbgc 3240 
aggbgcgcca gcbgcbgagc ggcabcgtgc agcagcagag caaccbgcbg cgcgccabcg 3300 
aggcccagca gcacabgctg cagcbgaccg tgtggggcab caagcagcbg cagacccgcg 3360 
bgcbggccat cgagcgcbac ctgcgcgacc agcagcbcga ggacatcbgg aacaacabga 3420 
ccbggabgca gtgggaccgc gagabcagca acbacaccaa caccatcbac cgcctgcbgg 3480 
aggacagcca gaaccagcag gagaagaacg agcaggaccb gcbggccctg gacaagbggc 354 0 
agaaccbgbg gaccbggbtc ggcabcacca acbggcbgtg gtacatcaag abcbtcabca 3600 
agabcgbggg cggccbgatc ggccbgcgca tcabcbbcgc cgbgctgagc atctgaacac 3660 
gbgggatcca gatcbgcbgt gccbbcbagb bgccagccab ctgbtgbttg cccctccccc 3720 
gbgccbbccb bgacccbgga aggbgccact cccacbgbcc bbbccbaaba aaabgaggaa 3780 
abbgcabcgc abbgbcbgag baggbgbcab bcbabbcbgg ggggtggggt ggggcaggac 384 0 
agcaaggggg aggabbggga agacaabagc aggcabgcbg gggabgcggt gggctcbabg 3900 
ggtacccagg bgcbgaagaa bbgacccggb bccbcctggg ccagaaagaa gcaggcacab 3960 
ccccttcbcb gtgacacacc cbgtccacgc cccbggbbcb bagbtccagc cccacbcaba 4020 
ggacacbcat agcbcaggag ggctccgccb bcaabcccac ccgcbaaagt acbtggagcg 4080 
gbcbcbcccb cccbcatcag cccaccaaac caaaccbagc cbccaagagb gggaagaaab 4140 
baaagcaaga baggcbabba agbgcagagg gagagaaaab gccbccaaca bgbgaggaag 4200 
baabgagaga aabcabagaa bbbbaaggcc abgabbbaag gccabcabgg ccbtaabcbb 4260 
ccgcttccbc gcbcactgac bcgctgcgcb cggtcgbbcg gcbgcggcga gcggbabcag 4320 
cbcacbcaaa ggcggtaata cggbtabcca cagaabcagg ggabaacgca ggaaagaaca 4380 
bgbgagcaaa aggccagcaa aaggccagga accgbaaaaa ggccgcgbbg cbggcgbtbb 4440 
bccabaggcb ccgccccccb gacgagcatc acaaaaabcg acgcbcaagb cagaggbggc 4500 
gaaacccgac aggacbabaa agabaccagg cgbtbccccc bggaagctcc cbcgbgcgcb 4560 
cbccbgbbcc gacccbgccg cbtaccggab acctgbccgc cbbbctcccb tcgggaagcg 4620 
bggcgcbbbc bcabagcbca cgcfcgbaggt abctcagbtc ggbgtaggbc gttcgcbcca 4680 
agcbgggcbg bgbgcacgaa ccccccgbbc agcccgaccg cbgcgcctba bccggbaacb 4740 
abcgbcbbga gbccaacccg gbaagacacg acbtabcgcc acbggcagca gccactggba 4 800 
acaggabbag cagagcgagg babgtaggcg gbgcbacaga gtbcbbgaag bggbggccba 4 860 
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acbacggcba cacbagaaga acagbabbtg gtatctgcgc tctgctgaag ccagttacct 4920 
tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggbggbb 4980 
tbbttgbbbg caagcagcag abtacgcgca gaaaaaaagg atctcaagaa gatcctttga 5040 
tcbbbbcbac ggggtctgac gctcagtgga acgaaaactc acgttaaggg atbbbggbca 5100 
bgagatbabc aaaaaggatc ttcacctaga bccbbbbaaa ttaaaaatga agttttaaat 5160 
caatctaaag tatatatgag taaacttggt ctgacagtta ccaabgctba atcagtgagg 5220 
caccbabcbc agcgatctgt cbabbbcgtb catccatagt tgcctgactc gggggggggg 5280 
ggcgctgagg tcbgcctcgb gaagaaggbg ttgctgactc ataccaggcc bgaabcgccc 5340 
cabcatccag ccagaaagbg agggagccac ggttgatgag agctbbgbbg taggtggacc 5400 
agthggtgat btbgaacbbt tgctttgcca cggaacggtc bgcgbbgbcg ggaagahgcg 5460 
tgatctgatc cttcaactca gcaaaagttc gabtbatbca acaaagccgc cgtcccgtca 5520 
agtcagcgta atgcbcbgcc agbgbbacaa ccaattaacc aabbcbgatb agaaaaactc 5580 
atcgagcatc aaabgaaacb gcaatttatt catatcagga ttatcaatac catatttttg 5640 
aaaaagccgt ttctgtaahg aaggagaaaa ctcaccgagg cagttccata ggatggcaag 5700 
atccbggbat cggtctgcga ttccgactcg tccaacatca atacaaccta btaabbbccc 5760 
ctcgtcaaaa ataaggttat caagtgagaa atcaccatga gtgacgactg aabccggtga 5820 
gaatggcaaa agcbbabgca tttctttcca gacttgttca acaggccagc cattacgctc 5880 
gtcatcaaaa tcactcgcat caaccaaacc gttattcatt cgtgattgcg cctgagcgag 5940 
acgaaatacg cgatcgctgt baaaaggaca abtacaaaca ggaatcgaat gcaaccggcg 6000 
caggaacact gccagcgcat caacaababb btcaccbgaa bcaggatabt cttcbaabac 6060 
ctggaabgcb gbbtbcccgg ggabcgcagb ggbgagbaac cabgcatcab caggagbacg 6120 
gabaaaabgc bbgabggbcg gaagaggcab aaattccgtc agccagbtba gtctgaccat 6180 
cbcatctgba acatcatbgg caacgctacc bbtgccatgt ttcagaaaca acbcbggcgc 6240 
abcgggcbbc ccabacaatc gabagabtgb cgcaccbgab tgcccgacat babcgcgagc 63 00 
ccatbbabac ccabataaab cagcabccab gbbggaatbb aabcgcggcc bcgagcaaga 6360 
cgbbbcccgt bgaababggc bcabaacacc ccbtgbabba cbgbbtabgb aagcagacag 6420 
tbtbabbgbt cabgabgata batbbbbabc btgtgcaabg baacabcaga gabbbtgaga 6480 
cacaacgbgg cbbbcccccc ccccccabba tbgaagcabb babcagggbt abtgbctcat 6540 
gagcggabac ababbbgaab gbabbbagaa aaabaaacaa abaggggbbc cgcgcacabt 6600 
tccccgaaaa gbgccaccbg acgtcbaaga aaccabbabt abcabgacat taaccbabaa 6660 
aaataggcgb abcacgaggc cctttcgbc 6689 



<210> 124 
<211> 6785 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVRl0l2x/s conbaining HIV genes 
<400> 124 

tcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
tbggcgggbg tcggggcbgg cbbaacbabg cggcatcaga gcagabbgba ctgagagtgc 180 
accatatgcg gbgbgaaaba ccgcacagab gcgtaaggag aaaataccgc atcagatbgg 240 
ctabtggcca bbgcabacgb tgbatccaba bcataababg bacabbbaba ttggcbcabg 300 
bccaacabba ccgccabgbb gacatbgabb abbgacbagb babbaabagb aatcaabbac 360 
ggggbcabba gbtcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgtatgbbcc 4 80 
cabagbaacg ccaabaggga cbbbccatbg acgbcaabgg gbggagbabb bacggbaaac 540 
tgcccacbbg gcagbacabc aagbgtatca batgccaagt acgcccccba bbgacgbcaa 600 
tgacggtaaa bggcccgccb ggcabbabgc ccagbacabg accbtabggg acbbbccbac 660 
tbggcagtac abcbacgbab bagbcabcgc tabbaccabg gbgabgcggb bbbggcagba 720 
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catcaatggg cgtggatagc ggbbbgacbc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac bbtccaaaab gtcgtaacaa 840 
ctccgcccca bbgacgcaaa bgggcggbag gcgtgbacgg bgggaggbcb ababaagcag 900 
agcbcgbbba gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagbgacg baagtaccgc ctatagactc bataggcaca ccccbbbggc 1080 
bcbbabgcab gctatactgt ttttggcttg gggcctatac acccccgcbb ccbbabgcba 1140 
taggbgabgg tatagcttag ccbataggbg bgggbbabbg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
babbggcbab atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccabbbabba tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagbbbbb abbaaacaba gcgtgggatc bccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agcccbggbc ccatgcctcc 1500 
agcggctcab ggbcgcbcgg cagctcctbg cbccbaacag bggaggccag actbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggtagg gbabgbgtcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbt gbabbcbgab aagagtcaga ggbaacbccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagt gbagbcbgag cagtactcgt bgctgccgcg 1800 
cgcgccacca gacabaabag cbgacagact aacagacbgb bccbbtccab gggbcbbbbc 1860 
bgcagbcacc gbcgbcgaca cgbgtgabca gababcgcgg ccgcbcbaga gababcgcca 1920 
ccabgcgcgb ggagggcabc cagcgcaacb ggaagcagbg gbggabcbgg ggcabccbgg 1980 
gcbbcbggab ggbgatgabc bacaacgbgc gcggcaaccb gbgggtgacc gtgbacbacg 2040 
gcgbgcccgt gbggaaggag gccaagacca ccctgbbcbg cgccagcgac gccaaggccb 2100 
acgacgccga ggbgcacaac gbgbgggcca cccacgccbg cgtgcccacc gaccccaacc 2160 
cccaggagat ggbgcbggag aacgtgaccg agaacbbcaa catgtgggag aacgacabgg 2220 
bggagcagab gcaccaggac abcabcagcc bgbgggacca gagccbgaag cccbgcgbga 22 80 
agcbgacccc ccbgbgcgbg acccbgcact gcagcaaccg caccatcgac bacaacaacc 2340 
gcaccgacaa cabgggcggc gagabcaaga acbgcagcbb caacatgacc accgaggbgc 24 00 
gcgacaagcg cgagaaggbg cacgcccbgt bcbaccgccb ggacatcgbg ccccbgaaga 24 60 
acgagagcag caacaccagc ggcgacbacc gccbgabcaa cbgcaacacc agcgccabca 2520 
cccaggccbg ccccaaggbg agcbtcgacc ccabccccab ccacbacbgc gcccccgccg 2580 
gcbacgccab ccbgaagbgc aacaacaaga ccbbcaacgg caccggcccc tgcaacaacg 2640 
bgagcaccab ccagbgcacc cacggcacca agcccgbggb gagcacccag cbgcbgcbga 2700 
acggcagccb ggccgaggag gagatcabca bccgcagcaa gaaccbgacc gacaacgbga 2760 
agaccabcab cgbgcaccbg aacgagagcg bggagatcaa cbgcacccgc cgabcggccc 2820 
acbgcaacab cagccgcacc gccbggaaca agacccbgca ggaggbgggc aagaagcbgg 2880 
ccgagcacbb ccccaacaag gccabcaagt bcgccaagca cagcggcggc gaccbggaga 2940 
bcaccaccca cagcbbcaac bgccgcggcg agbtcbbcba cbgcaacacc agcagccbgb 3000 
bcaacagcac cbacaccccc aacagcaccg agaacabcac cggcaccgag aacagcabca 3060 
bcaccabccc cbgccgcabc aagcagabca bcaacabgbg gcagggcgbg ggccgcgcca 3120 
bgbacgcccc ccccabcgag ggcabccbga ccbgccgcag caacatcacc ggccbgcbgc 3180 
tgacccgcga cggcggcacc ggcabgcacg acaccgagab cbbccgcccc gagggcggcg 3240 
acabgcgcga caacbggcgc agcgagcbgt acaagbacaa ggbggbggag abcaagcccc 3300 
bgggcabcgc ccccaccaag gccaagcbba ccgbgcaggb gcgccagcbg cbgagcggca 3360 
Ccgbgcagca gcagagcaac cbgcbgcgcg ccabcgaggc ccagcagcac abgcbgcagc 3420 
bgaccgbgbg gggcabcaag cagcbgcaga cccgcgtgcb ggccatcgag cgcbacctgc 34 80 
gcgaccagca gcbcgaggac abcbggaaca acabgaccbg gabgcagbgg gaccgcgaga 3540 
bcagcaacba caccaacacc abcbaccgcc bgcbggagga cagccagaac cagcaggaga 3600 
agaacgagca ggaccbgcbg gcccbggaca agbggcagaa ccbgbggacc bggbbcggca 3660 
bcaccaacbg gcbgbggbac abcaagabcb bcabcaagab cgbgggcggc cbgabcggcc 3720 
tgcgcabcab cbbcgccgbg cbgagcabct gaacacgbgg gabccagabc bgcbgbgccb 3780 
bcbagbbgcc agccabcbgb bgtbbgcccc bcccccgbgc cbbccbbgac ccbggaaggb 3840 
gccacbccca cbgbccbbtc cbaabaaaab gaggaaabbg cabcgcabtg bcbgagbagg 3900 
bgbcabbcba bbcbgggggg bggggbgggg caggacagca agggggagga bbgggaagac 3960 
aabagcaggc abgcbgggga bgcggbgggc bcbabgggba cccaggtgcb gaagaabbga 4 020 
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cccggttcct cctgggccag aaagaagcag gcacatcccc ttctctgtga cacaccctgt 4080 
ccacgcccct ggttcbtagt tccagcccca ctcataggac actcatagct caggagggcb 4140 
ccgccttcaa tcccacccgc taaagtactt ggagcggtct cbcccbcccb catcagccca 4200 
ccaaaccaaa cctagcctcc aagagtggga agaaattaaa gcaagatagg cbattaagbg 4260 
cagagggaga gaaaatgcct ccaacabgbg aggaagtaat gagagaaabc abagaabbbb 4320 
aaggccatga tttaaggcca bcabggccbb aatcttccgc bbccbcgcbc acbgacbcgc 4380 
tgcgctcggt cgbtcggctg cggcgagcgg tatcagctca cbcaaaggcg gbaabacggb 4440 
tatccacaga abcaggggab aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 4500 
ccaggaaccg taaaaaggcc gcgttgctgg cgbbbbbcca taggctccgc ccccctgacg 4560 
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 4620 
accaggcgbb tccccctgga agctccctcg bgcgcbcbcc bgbbccgacc cbgccgcbba 4680 
ccggatacct ghccgccttt ctcccbtcgg gaagcgbggc gcbbbcbcab agcbcacgcb 4740 
gbaggbabcb cagbbcggbg baggbcgbbc gcbccaagcb gggcbgbgbg cacgaacccc 4800 
ccgtbcagcc cgaccgcbgc gccbbabccg gbaacbabcg bcbbgagbcc aacccggbaa 4860 
gacacgacbb atcgccactg gcagcagcca cbggbaacag gabbagcaga gcgaggbabg 4920 
baggcggbgc bacagagbbc bbgaagbggb ggccbaacta cggcbacact agaagaacag 4980 
tabtbggtat cbgcgcbcbg cbgaagccag bbaccbtcgg aaaaagagbb ggbagctcbb 5040 
gabccggcaa acaaaccacc gcbggtagcg gbggbbtbbt bgbbbgcaag cagcagabba 5100 
cgcgcagaaa aaaaggabcb caagaagabc cbbbgabcbb tbcbacgggg bcbgacgcbc 5160 
agbggaacga aaacbcacgt baagggabbb bggbcabgag abbabcaaaa aggabcbbca 5220 
ccbagabccb bbbaaabbaa aaabgaagbb bbaaabcaab cbaaagbaba tabgagbaaa 5280 
cbbggbcbga cagbbaccaa tgcbbaabca gbgaggcacc tabcbcagcg abcbgtcbab 5340 
bbcgbbcabc cabagbbgcc bgacbcgggg ggggggggcg cbgaggbcbg ccbcgtgaag 54 00 
aaggbgbbgc bgactcabac caggcctgaa bcgccccabc abccagccag aaagbgaggg 5460 
agccacggbb gabgagagcb bbgbbgtagg bggaccagbb ggbgatbbbg aacbbbbgcb 5520 
bbgccacgga acggbcbgcg bbgbcgggaa gabgcgbgab cbgabccbbc aacbcagcaa 5580 
aagbbcgabb babbcaacaa agccgccgbc ccgbcaagbc agcgbaatgc bcbgccagbg 5640 
bbacaaccaa bbaaccaabb cbgatbagaa aaacbcabcg agcatcaaab gaaacbgcaa 5700 
bbbabbcaba bcaggabbat caabaccaba bbbbbgaaaa agccgttbcb gtaabgaagg 5760 
agaaaacbca ccgaggcagb bccabaggab ggcaagabcc tggbatcggb ctgcgabbcc 5820 
gacbcgbcca acabcaabac aaccbabbaa bbbccccbcg bcaaaaabaa ggbbabcaag 5880 
bgagaaabca ccabgagbga cgacbgaabc cggbgagaab ggcaaaagcb babgcabbbc 5940 
bbbccagacb bgbbcaacag gccagccabb acgctcgbca tcaaaatcac bcgcabcaac 6000 
caaaccgbba bbcabbcgbg abbgcgccbg agcgagacga aabacgcgab cgctgbbaaa 6060 
aggacaabba caaacaggaa bcgaabgcaa ccggcgcagg aacactgcca gcgcatcaac 6120 
aababbbbca ccbgaabcag gatabbctbc baabaccbgg aabgcbgbbb bcccggggab 6180 
cgcagbggbg agbaaccabg catcabcagg agbacggaba aaabgcbtga bggbcggaag 6240 
aggcabaaab bccgbcagcc agbbbagbcb gaccabcbca bcbgbaacab cabbggcaac 6300 
gcbaccbbbg ccabgbbbca gaaacaacbc tggcgcabcg ggcbbcccab acaabcgaba 6360 
gabbgbcgca ccbgabbgcc cgacatbabc gcgagcccab bbabacccab ataaatcagc 6420 
abccabgbbg gaabbbaabc gcggccbcga gcaagacgbb bcccgbbgaa babggcbcab 6480 
aacaccccbb gtabbacbgb tbabgbaagc agacagbbbb abbgbbcatg abgabababb 6540 
tbbabcbtgb gcaabgbaac abcagagabb bbgagacaca acgbggcbbb cccccccccc 6600 
ccabbabtga agcabbbabc agggbbabbg bcbcabgagc ggabacabab btgaatgbab 6660 
bbagaaaaat aaacaaabag gggbbccgcg cacabbbccc cgaaaagbgc caccbgacgb 6720 
ctaagaaacc atbabbabca tgacabbaac cbabaaaaab aggcgtabca cgaggcccbb 6780 
bcgbc 6785 



<210> 125 
<211> 6716 
<212> DNA 

<213> Arbificial Sequence 
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<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<4O0> 125 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg hcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tabtaatagt aatcaatbac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 42 0 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcatbatgc ccagtacatg accttatggg acbttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc . acggggattt ccaagtctcc accccattga 780 
cgfccaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa B40 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggbg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 • 
ggatggggtc ccatttatta bbtacaaatt cacatataca acaacgccgt cccccgbgcc 13 80 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agcccbggbc ccatgccbcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagbgbgccg cacaaggccg tggcggbagg gtabgtgtcb 1620 
gaaaabgagc gtggagatbg ggctcgcacg gctgacgcag atggaagacb taaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbtgbb gtatbcbgat aagagbcaga ggtaactccc 1740 
gbtgcggtgc bgttaacggt ggagggcagb gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggbcttttc 1860 
bgcagtcacc gtcgtcgaca cgtgbgabca gabatcgcgg ccgctctaga gatabcgcca 1920 
ccatgcgcgt ggagggcabc cagcgcaact ggaagcagbg gbggatctgg ggcatccbgg 1980 
gcttcbggab ggbgabgabc bacaacgbgc gcggcaaccb gtgggbgacc gtgbactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggbgcacaac gtgbgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgcbggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 222 0 
tggagcagat gcaccaggac abcabcagcc tgtgggacca gagccbgaag cccbgcgtga 22 80 
agctgacccc ccbgbgcgbg accctgcacb gcagcaaccg caccatcgac bacaacaacc 234 0 
gcaccgacaa cabgggcggc gagabcaaga actgcagcbt caacatgacc accgaggbgc 2400 
gcgacaagcg cgagaaggtg cacgccctgt tctaccgcct ggacatcgtg cccctgaaga 24 60 
acgagagcag caacaccagc ggcgactacc gcctgabcaa ctgcaacacc agcgccatca 2520 
cccaggcctg ccccaaggtg agcttcgacc ccatccccat ccactacbgc gcccccgccg 2580 
gctacgccat cctgaagtgc aacaacaaga cctbcaacgg caccggcccc tgcaacaacg 264 0 
bgagcaccat ccagtgcacc cacggcacca agcccgbggt gagcacccag ctgcbgctga 2700 
acggcagcct ggccgaggag gagatcatca tccgcagcaa gaacctgacc gacaacgtga 2760 
agaccabcat cgbgcacctg aacgagagcg tggagabcaa cbgcacccgc cgatcggccc 2820 
actgcaacat cagccgcacc gccbggaaca agaccctgca ggaggbgggc aagaagctgg 28 80 
ccgagcactt ccccaacaag gccabcaagt tcgccaagca cagcggcggc gacctggaga 2940 
fccaccaccca cagcttcaac tgccgcggcg agtbcttcba ctgcaacacc cgatcgatcc 3000 
cctgccgcat caagcagatc atcaacatgt ggcagggcgt gggccgcgcc atgtacgccc 3060 
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cccccabcga gggcatcctg acctgccgca 
. acggcggcac cggcatgcac gacaccgaga 
acaactggcg cagcgagctg tacaagtaca 
cccccaccaa ggccaagctt accgbgcagg 
agcagagcaa ccbgctgcgc gccatcgagg 
ggggcatcaa gcagctgcag acccgcgtgc 
agchcgagga catctggaac aacabgacct 
acaccaacac cabctaccgc ctgctggagg 
aggacctgct ggccctggac aagtggcaga 
ggctgtggta catcaagatc ttcatcaaga 
tcttcgccgh gctgagcatc hgaacacgtg 
cagccatctg ttgtttgccc ctcccccgtg 
acbgbcctbb cctaataaaa tgaggaaatt 
attctggggg gtggggtggg gcaggacagc 
catgctgggg atgcggtggg ctctatgggt 
tcctgggcca gaaagaagca ggcacatccc 
tggtbcttag ttccagcccc actcatagga 
atcccacccg ctaaagtact tggagcggtc 
acctagcctc caagagtggg aagaaattaa 
agaaaatgcc tccaacatgt gaggaagtaa 
abbbaaggcc atcatggcct taatcttccg 
tcgttcggct gcggcgagcg gtabcagcbc 
aatcagggga taacgcagga aagaacatgt 
gtaaaaaggc cgcgttgctg gcgtttttcc 
aaaabcgacg ctcaagtcag aggtggcgaa 
ttccccctgg aagctccctc gtgcgctctc 
tgbccgccbb bcbcccbbcg ggaagcgtgg 
tcagttcggt gtaggtcgtt cgctccaagc 
ccgaccgctg cgccttatcc ggtaactatc 
tatcgccact ggcagcagcc acbggbaaca 
ctacagagtt cttgaagtgg tggcctaact 
tctgcgctct gctgaagcca gttaccttcg 
aacaaaccac cgctggtagc ggtggttttt 
aaaaaggabc tcaagaagat cctttgatch 
aaaactcacg ttaagggatt ttggtcatga 
htttaaabfca aaaatgaagt bbbaaabcaa 
acagtbacca atgchtaatc agtgaggcac 
ccatagbbgc cbgacbcggg gggggggggc 
ctgactcata ccaggcctga atcgccccat 
bgabgagagc bbbgbbgtag gfcggaccagt 
aacggbcbgc gttgtcggga agatgcgbga 
ttattcaaca aagccgccgt cccgtcaagt 
attaaccaat tctgattaga aaaactcatc 
atcaggatta tcaataccat abttbbgaaa 
accgaggcag ttccatagga tggcaagatc 
aacatcaata caacctatta abttccccbc 
accatgagtg acgactgaab ccggtgagaa 
ttgbbcaaca ggccagccab tacgctcgtc 
abtcabbcgb gattgcgcct gagcgagacg 
acaaacagga atcgaatgca accggcgcag 
acctgaatca ggatattctt ctaatacctg 
gagtaaccat gcatcatcag gagtacggab 
ttccgtcagc cagbbbagbc tgaccatctc 
gccatgtttc agaaacaact ctggcgcatc 
acctgattgc ccgacattah cgcgagccca 



gcaacabcac cggcctgctg ctgacccgcg 3120 
tctbccgccc cgagggcggc gacatgcgcg 3180 
aggtggtgga gatcaagccc cbgggcabcg 3240 
tgcgccagct gctgagcggc atcgtgcagc 3300 
cccagcagca catgctgcag ctgaccgbgt 3360 
tggccatcga gcgcbacctg cgcgaccagc 3420 
ggatgcagtg ggaccgcgag atcagcaact 3480 
acagccagaa ccagcaggag aagaacgagc 3540 
accbgbggac ctggbtcggc abcaccaacb 3600 
tcgtgggcgg cctgatcggc ctgcgcafcca 3660 
ggatccagat ctgctgtgcc ttcfcagttgc 3720 
ccttccttga ccctggaagg tgccactccc 3780 
gcatcgcatt gtctgagtag gtgtcattct 3840 
aa 99999 a 99 attgggaaga caabagcagg 3900 
acccaggbgc bgaagaabtg acccggbbcc 3960 
cbbcbctgtg acacaccctg bccacgcccc 4020 
cactcatagc tcaggagggc bccgcctbca 4080 
tcbccctccc bcabcagccc accaaaccaa 4140 
agcaagabag gctattaagt gcagagggag 4200 
tgagagaaat cabagaabtb baaggccatg 4260 
cttcctcgct cacbgacbcg ctgcgcbcgg 4320 
actcaaaggc ggbaatacgg tbabccacag 4380 
gagcaaaagg ccagcaaaag gccaggaacc 4440 
ataggcfcccg ccccccbgac gagcafccaca 4500 
acccgacagg acbataaaga taccaggcgt 4560 
cbgbbccgac ccbgccgctt accggabacc 4620 
cgcbbtctca bagcbcacgc tgtaggbabc 4680 
tgggcbgtgt gcacgaaccc cccgbtcagc 4740 
gbcbtgagbc caacccggta agacacgact 4800 
ggabtagcag agcgaggbab gbaggcggbg 4860 
acggcbacac bagaagaaca gtatttggba 4920 
gaaaaagagt bggbagctct tgabccggca 4980 
tbgbbbgcaa gcagcagabt acgcgcagaa 5040 
bttctacggg gtcbgacgct cagtggaacg 5100 
gattatcaaa aaggabcbbc accbagabcc 5160 
bcbaaagbat ababgagbaa acbbggbcbg 5220 
cbatcbcagc gabcbgbcba bbbcgtbcab 5280 
gcbgaggtct gccbcgbgaa gaaggtgtbg 5340 
cabccagcca gaaagbgagg gagccacggb 5400 
bggtgabtbb gaactbttgc bbtgccacgg 5460 
bctgafcccbt caacbcagca aaagbtcgab 5520 
cagcgtaabg cbcbgccagb gbbacaacca 5580 
gagcabcaaa tgaaactgca abbbatbcab 5640 
aagccgbbbc bgtaatgaag gagaaaacbc 5700 
ctggbatcgg bctgcgattc cgacbcgbcc 5760 
gbcaaaaaba aggbbabcaa gtgagaaatc 5820 
bggcaaaagc btabgcatbb cttbccagac 5880 
abcaaaabca cbcgcatcaa ccaaaccgtt 5940 
aaabacgcga bcgcbgbtaa aaggacaatb 6000 
gaacacbgcc agcgcabcaa caatattbtc 6060 
gaatgctgbb bbcccgggga bcgcagfcggb 6120 
aaaabgcbbg atggtcggaa gaggcataaa 61B0 
abcbgbaaca tcabbggcaa cgctaccbtb 6240 
gggcbbccca tacaatcgat agatbgbcgc 6300 
bbbabaccca tabaaabcag cabccabgtb 6360 
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ggaatttaat cgcggcctcg agcaagacgt ttcccgttga atatggctca taacacccct 6420 

tgtatfcactg tttabgtaag cagacagttt tattgttcat gatgatatat ttttatcttg 6480 

tgcaatgtaa catcagagat tttgagacac aacgtggctt tccccccccc cccattattg 6540 

aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta tbfcagaaaaa 6600 

taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac 6660 

cattattatc atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtc 6716 



<210> 126 
<211> 6797 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 126 . 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt. ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg fcatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgcgt ggagggcatc cagcgcaact ggaagcagtg gtggatctgg ggcatcctgg 1980 
gcttctggat ggtgatgatc tacaacgtgc gcggcaacct gtgggtgacc gtgtactacg 2040 
gcgtgcccgt gtggaaggag gccaagacca ccctgttctg cgccagcgac gccaaggcct 2100 
acgacgccga ggtgcacaac gtgtgggcca cccacgcctg cgtgcccacc gaccccaacc 2160 
cccaggagat ggtgctggag aacgtgaccg agaacttcaa catgtgggag aacgacatgg 2220 
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tggagcagat gcaccaggac atcatcagcc tgtgggacca gagcctgaag ccctgcgtga 2280 
agctgacccc cctgtgcgtg accctgcact gcagcaaccg caccatcgac bacaacaacc 2340 
gcaccgacaa catgggcggc gagabcaaga actgcagcbb caacatgacc accgaggtgc 2400 
gcgacaagcg cgagaaggtg cacgccctgt bctaccgccb ggacatcgtg cccctgaaga 2460 
acgagagcag caacaccagc ggcgactacc gccbgabcaa ctgcaacacc agcgccatca 2520 
cccaggcctg ccccaaggtg agcttcgacc ccatccccat ccacbacbgc gcccccgccg 2580 
gctacgccat ccbgaagbgc aacaacaaga ccttcaacgg caccggcccc bgcaacaacg 264 0 
bgagcaccat ccagbgcacc cacggcacca agcccgbggb gagcacccag cbgcbgcbga 2700 
acggcagcct ggccgaggag gagatcatca bccgcagcaa gaaccbgacc gacaacgbga 2760 
agaccatcat cgtgcaccbg aacgagagcg bggagabcaa cbgcacccgc cccaacaaca 2820 
acacccgcaa gagcatccgc abcggccccg gccaggcctt ctacgccacc ggcgagatca 2880 
tcggcgacat ccgccaggcc cactgcaaca bcagccgcac cgcctggaac aagaccctgc 2940 
aggaggbggg caagaagcbg gccgagcacb tccccaacaa ggccatcaag bbcgccaagc 3000 
acagcggcgg cgacctggag abcaccaccc acagcttcaa cbgccgcggc gagbbctbct 3060 
acbgcaacac ccgabcgabc ccctgccgca bcaagcagat cabcaacabg bggcagggcg 3120 
bgggccgcgc cabgtacgcc ccccccabcg agggcabcct gaccbgccgc agcaacatca 3180 
ccggcctgcb gctgacccgc gacggcggca ccggcabgca cgacaccgag atcbbccgcc 324 0 
ccgagggcgg cgacabgcgc gacaacbggc gcagcgagct gtacaagtac aaggtggbgg 33 00 
agabcaagcc ccbgggcabc gcccccacca aggccaagct taccgtgcag gtgcgccagc 33 60 
tgcbgagcgg cabcgtgcag cagcagagca accbgcbgcg cgccatcgag gcccagcagc 342 0 
acabgctgca gcbgaccgtg bggggcabca agcagcbgca gacccgcgbg ctggccatcg 34 80 
agcgcbaccb gcgcgaccag cagctcgagg acabcbggaa caacatgacc tggatgcagt 3540 
gggaccgcga gatcagcaac tacaccaaca ccatcbaccg cctgctggag gacagccaga 3600 
accagcagga gaagaacgag caggacctgc tggccctgga caagtggcag aacctgbgga 3660 
ccbggttcgg catcaccaac tggctgtggt acabcaagat cbbcatcaag atcgbgggcg 3720 
gccbgatcgg ccbgcgcabc atcbbcgccg bgcbgagcat cbgaacacgb gggabccaga 3780 
bcbgcbgbgc cbbcbagbbg ccagccatcb gtbgtbbgcc ccbcccccgb gccbbccttg 3840 
acccbggaag gbgccactcc cacbgbcctfc bccbaataaa atgaggaaat bgcabcgcab 3900 
bgtctgagba ggbgtcattc tabbcbgggg ggbggggbgg ggcaggacag caagggggag 3960 
gabtgggaag acaabagcag gcabgcbggg gabgcggbgg gcbcbabggg bacccaggbg 4020 
ctgaagaabt gacccggtbc cbcctgggcc agaaagaagc aggcacatcc cctbcbcbgb 4080 
gacacacccb gbccacgccc cbggtbctba gtbccagccc cacbcabagg acacbcabag 4140 
ctcaggaggg cbccgccbbc aabcccaccc gcbaaagtac tbggagcggt ctcbccctcc 4200 
cbcatcagcc caccaaacca aaccbagcct ccaagagtgg gaagaaatta aagcaagata 4260 
ggcfcabtaag bgcagaggga gagaaaatgc cbccaacatg tgaggaagba atgagagaaa 4320 
tcabagaabb bbaaggccat gatttaaggc cabcabggcc btaabcbtcc gcbbccbcgc 4380 
bcacbgacbc gctgcgcbcg gbcgbbcggc bgcggcgagc ggbabcagcb cacbcaaagg 4440 
cggbaabacg gttabccaca gaatcagggg abaacgcagg aaagaacabg bgagcaaaag 4500 
gccagcaaaa ggccaggaac cgbaaaaagg ccgcgbbgcb ggcgbbbtbc cabaggcbcc 4560 
gcccccctga cgagcabcac aaaaabcgac gcfccaagbca gaggbggcga aacccgacag 4 620 
gacbabaaag abaccaggcg bttccccctg gaagctcccb cgbgcgctcb ccbgttccga 4680 
cccbgccgcb baccggabac cbgbccgccb bbcbccctbc gggaagcgbg gcgcbtbcfcc 4740 
atagcbcacg cbgbaggbab ctcagbbcgg bgbaggbcgb tcgcbccaag cbgggctgbg 4 800 
bgcacgaacc ccccgbbcag cccgaccgcb gcgccbbabc cggbaactat cgbcbtgagb 4860 
ccaacccggt aagacacgac bbabcgccac bggcagcagc cactggtaac aggattagca 4 920 
gagcgaggta bgtaggcggb gcbacagagb bcbtgaagtg gbggcctaac tacggcbaca 4 980 
cbagaagaac agtabbbggb atctgcgcbc bgctgaagcc agbtacctbc ggaaaaagag 5040 
btggbagctc ttgabccggc aaacaaacca ccgctggbag cggbggbtbt bbbgtttgca 5100 
agcagcagab bacgcgcaga aaaaaaggat ctcaagaaga bccbbbgabc btbtctacgg 5160 
ggbcbgacgc bcagbggaac gaaaacbcac gbbaagggat tbbggbcabg agabtabcaa 5220 
aaaggabctb caccbagatc cbbbtaaabt aaaaabgaag tbbbaaatca abctaaagta 5280 
bababgagba aacbbggtcb gacagbtacc aafcgcbtaab cagbgaggca ccbatctcag 5340 
cgatcbgbcb abtbcgtbca bccatagttg cctgactcgg gggggggggg cgctgaggtc 5400 
tgcctcgtga agaaggbgbb gctgacbcab accaggccbg aabcgcccca bcatccagcc 5460 
agaaagbgag ggagccacgg bbgabgagag ctbtgbtgta ggbggaccag bbggbgattb 5520 
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tgaacttttg ctttgccacg gaacggtctg cgbbgtcggg aagatgcgbg abcbgatcct 5580 
tcaactcagc aaaagttcga bbbatbcaac aaagccgccg tcccgtcaag tcagcgtaat 5640 
gcbcbgccag tgttacaacc aabbaaccaa ttctgattag aaaaacbcab cgagcatcaa 5700 
atgaaactgc aattfcattca tatcaggatt atcaatacca babbtbtgaa aaagccgttt 5760 
ctgtaatgaa ggagaaaact caccgaggca gttccatagg abggcaagab ccbggtabcg 5820 
gbcbgcgabt ccgactcgtc caacatcaat acaacctatt aabbtccccb cgtcaaaaat 5880 
aaggbbatca agtgagaaat caccatgagt gacgactgaa bccggbgaga atggcaaaag 594 0 
cttatgcatt tctttccaga ctbgttcaac aggccagcca btacgcbcgb cabcaaaabc 6000 
actcgcabca accaaaccgb tattcattcg bgabbgcgcc tgagcgagac gaaatacgcg 6060 
atcgctgtta aaaggacaat tacaaacagg aatcgaatgc aaccggcgca ggaacactgc 6120 
cagcgcatca acaatabbbb cacctgaatc aggatattct bcbaabaccb ggaatgctgt 6180 
tttcccgggg atcgcagtgg tgagtaacca tgcabcatca ggagbacgga taaaatgctt 6240 
gatggtcgga agaggcataa abtccgbcag ccagtttagt ctgaccatct catctgtaac 6300 
abcabtggca acgctacctt bgccabgbbb cagaaacaac tctggcgcat cgggcttccc 6360 
atacaatcga tagattgtcg cacctgattg cccgacatba bcgcgagccc abtbabaccc 6420 
atataaatca gcatccatgt bggaatbtaa tcgcggcctc gagcaagacg bbtcccgbbg 6480 
aababggcbc ataacacccc ttgtattact gbtbabgtaa gcagacagtb ttattgttca 6540 
bgatgatata tttttatctt gtgcaatgta acatcagaga ttttgagaca caacgtggcb 6600 
ttcccccccc ccccattatt gaagcattta tcagggttat bgtctcatga gcggabacab 6660 
abtbgaabgb abbbagaaaa abaaacaaab aggggbbccg cgcacabbbc cccgaaaagb 6720 
gccaccbgac gbcbaagaaa ccabtabtat cabgacabba acctataaaa ataggcgbab 6780 
cacgaggccc bbbcgbc 6797 



<210> 127 
<211> 6718 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<4O0> 127 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbtgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggtg 120 
bbggcgggbg bcggggctgg cbbaacbabg cggcabcaga gcagabbgta cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
ctabtggcca bbgcabacgt bgbabccaba bcabaatabg tacabbbaba bbggctcatg 300 
tccaacabba ccgccabgbb gacabtgabb atbgactagb tabtaabagb aabcaabbac 360 
ggggbcabba gbbcabagcc cabababgga gtbccgcgbb acataacbba cggtaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg tcaabaabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbbtccabbg acgbcaabgg gbggagbabb bacggtaaac 540 
bgcccacbbg gcagbacabc aagtgbabca tabgccaagb acgcccccta btgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbabggg acbbtccbac 660 
btggcagtac abcbacgtat bagtcabcgc babbaccabg gbgatgcggb bbbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagtcbcc accccabbga 780 
cgbcaabggg agbbtgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb atataagcag 900 
agctcgbbba gbgaaccgbc agatcgccbg gagacgccab ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcatbg gaacgcggab 1020 
bccccgbgcc aagagbgacg baagbaccgc cbatagacbc bataggcaca ccccttbggc 1080 
bcbbatgcab gcbabacbgt btbbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggbgahgg babagcbbag ccbabaggbg bgggbtatbg accabbatbg accactcccc 1200 
babbggbgac gabacbbbcc abbactaabc cabaacabgg ctcbbbgcca caactabcbc 1260 
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tattggctat atgccaatac 
ggabggggtc ccatttatta 
cgcagttttt atbaaacaba 
catgggctct tctccggtag 
agcggcbcab ggtcgctcgg 
agcacaabgc ccaccaccac 
gaaaabgagc gtggagattg 
gcagaagaag atgcaggcag 
gttgcggtgc bgtbaacggb 
cgcgccacca gacahaatag 
tgcagtcacc gtcgtcgaca 
ccatgcgtgt bcgbggbabc 
gtttctggafc gabcatcabc 
gtgttccggt bbggaccgac 
acgaccgtga agttcacaac 
cgcaggaaat cgtbctggaa 
ttgaccagat gcacgaagac 
aactgacccc gctgtgcgtt 
acaccaccac cgaaatccgt 
acatcgttct gctgaaagaa 
actgcaacgc ttccaccatc 
tccactactg cgctccggct 
gtaaaggtcc gtgcaacaac 
tttccaccca gctgctgctg 
aaaacctgac cgacaacgbb 
tttgcacccg tccgaacaac 
tctacgctac cggtgacatc 
ccaaatggaa cgaaaccctg 
acaaaaccat caaattcgct 
tcaactgccg bggbgaabbc 
ccgaagacga aaccatcacc 
gbgttggtcg bgctatgbac 
bcaccggbcb gctgctggtt 
gbccgggtgg tggtaacatg 
tcgaactgaa accgctgggb 
agctgcbgbc cbccatcgbb 
agcacatgct gcagctgacc 
bcgaacgtba cctgaaagac 
gggaccgbga aatctccaac 
cccagcagga aaaaaacgaa 
cctggttcga catctccaac 
gccagccatc tgttgtttgc 
ccactgtcct bbccbaabaa 
ctattctggg gggtggggbg 
ggcatgctgg ggatgcggtg 
cctcctgggc cagaaagaag 
cctggttctt agttccagcc 
caatcccacc cgctaaagta 
aaacctagcc tccaagagtg 
agagaaaatg cctccaacat 
tgatttaagg ccabcabggc 
ggtcgttcgg ctgcggcgag 
agaatcaggg gataacgcag 
ccgtaaaaag gccgcgttgc 
caaaaatcga cgctcaagtc 



tctgtccttc agagacbgac 
bbbacaaabb cacatataca 
gcgtgggatc bccacgcgaa 
cggcggagct bccacabccg 
cagctccttg ctcctaacag 
cagtgtgccg cacaaggccg 
ggctcgcacg gctgacgcag 
ctgagttgtt gtattctgat 
ggagggcagb gtagtctgag 
cbgacagacb aacagactgt 
cgbgbgabca gatatcgcgg 
ccgcgtaact ggccgcagtg 
bgccgbgbbg bbggbaacab 
gctaaaacca cccbgbbctg 
gtttgggcta cccacgcttg 
aacgbbaccg aaaacttcaa 
atcatctccc bgbgggacca 
accctgcact gcaccaacgc 
gacaaaaaac agcagggbba 
aaccgtaaca actccaacaa 
acccaggctt gcccgaaagt 
ggbtacgcba bcctgaaabg 
gtbtccaccg bbcagbgcac 
aacggbtccc bggcbgaaaa 
aaaaccatca bcgttcaccb 
aacacccgta aatccabgcg 
atcggbgaca bccgbcaggc 
aaacgbgbta aagaaaaact 
ccgbccbccg gbggbgaccb 
bbcbacbgca acaccacccg 
cbgccgtgcc gbatcaaaca 
gcbccgccga bcgcbggtaa 
cgbgacggbg gbgaagacaa 
aaagacaact ggcgttccga 
atcgcbccga ccggbgcbaa 
cagcagcagb ccaaccbgcb 
gbbbggggba tcaaacagct 
cagcagcbcg agatctggaa 
bacaccgaca ccatctaccg 
aaagacctgc tggcbcbgga 
bggcbgbggt gaggatccag 
ccctcccccg bgccttcctb 
aabgaggaaa bbgcatcgca 
gggcaggaca gcaaggggga 
ggctctabgg gbacccaggb 
caggcacabc cccttcbcbg 
ccacbcabag gacacbcaba 
ctbggagcgg bctcbccctc 
ggaagaaabt aaagcaagab 
gtgaggaagt aabgagagaa 
ctbaabcbbc cgcbbccbcg 
cggbabcagc bcacbcaaag 
gaaagaacat gbgagcaaaa 
bggcgbbbbt ccabaggctc 
agaggbggcg aaacccgaca 



acggacbcbg babttbbaca 1320 
acaacgccgb cccccgbgcc 13 80 
bcbcgggbac gbgbtccgga 1440 
agcccbggtc ccabgcctcc 1500 
bggaggccag acbbaggcac 1560 
bggcggbagg gbabgbgbcb 1620 
abggaagacb baaggcagcg 1680 
aagagtcaga ggtaactccc 1740 
cagbacbcgb tgcbgccgcg 1800 
bccbbtccab gggbcbbtbc 1860 
ccgcbcbaga gatabcgcca- 1920 
gtggatgbgg ggtabcctgg 1980 
gtgggtbacc gbtbactacg 2040 
cgctbccgac accaaagccb 2100 
cgbbccgacc gacccgaacc 2160 
cabgtggaaa aacgacatgg 2220 
gtcccbgaaa ccgbgcgbba 2280 
tagccgbaac tgcbccbtca 2340 
cgcbctgbbc taccgtccgg 2400 
ctccgaabac atccbgatca 2460 
baactbcgac ccgatcccga 2520 
caacaacaaa accbbcbccg 2580 
ccatggbabc aaaccggtbg 2640 
agaaabcabc atccgttccg 2700 
gaacaaabcc gbtgaaatcg 2760 
tabcggtccg ggtcagaccb 2820 
btacbgcaac abcbccggbb 2880 
gcaggaaaac bacaacaaca 2940 
ggaaabcacc acccacbccb 3000 
tcbgbbcaac aacaacgcba 3060 
gabcabcaac abgbggcagg 3120 
cabcaccbgc aaabccaaca 3180 
caaaaccgaa gaaabcbbcc 3240 
acbgtacaaa bacaaagbba 3300 
gcbbaccgbb caggcbcgbc 3360 
gcgbgcbabc gaagcbcagc 3420 
gcagacccgb gbbcbggcba 34 80 
caacabgacc bggabggaab 3 540 
bcbgcbggaa gacbcccaga 3600 
cbccbggaaa aaccbgbggb 3660 
atcbgcbgbg cctbcbagbb 3720 
gacccbggaa ggtgccacbc 3780 
btgbcbgagb aggbgbcabb 3840 
ggabbgggaa gacaabagca 3900 
gcbgaagaab bgacccggbb 3960 
bgacacaccc bgbccacgcc 4020 
gcbcaggagg gctccgccbb 4080 
ccbcabcagc ccaccaaacc 4140 
aggcbabbaa gbgcagaggg 4200 
atcabagaab bbbaaggcca 4260 
cbcacbgacb cgcbgcgcbc 4320 
gcggbaabac ggbbabccac 4380 
ggccagcaaa aggccaggaa 4440 
cgcccccctg acgagcatca 4500 
ggacbabaaa gataccaggc 4560 
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gtttccccct ggaagctccc bcgbgcgcbc tcctgttccg accctgccgc bbaccggaba 4620 
ccbgtccgcc tttctccctt cgggaagcgb ggcgctttct cabagcbcac gcbgtaggba 4680 
tctcagttcg gtgtagghcg bbcgcbccaa gcbgggctgb gtgcacgaac cccccgbbca 4740 
gcccgaccgc tgcgccttat ccggtaacta bcgtctbgag tccaacccgg taagacacga 4800 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 4860 
tgctacagag bbcbbgaagb ggtggcctaa ctacggctac actagaagaa cagtatttgg 4920 
tatctgcgct cbgcbgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 4980 
caaacaaacc accgctggta gcggtggttt bbbbgbbbgc aagcagcaga ttacgcgcag 5040 
aaaaaaagga tctcaagaag abccbbbgat cttttctacg gggtctgacg ctcagtggaa 5100 
cgaaaactca cgttaaggga bbtbggtcat gagattatca aaaaggatct tcacctagat 5160 
ccbbtbaaat taaaaatgaa gbbbbaaabc aatctaaagt atatatgagt aaacttggbc 5220 
bgacagbbac caabgcbbaa bcagbgaggc acctatctca gcgatctgtc tatttcgttc 5280 
atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg aagaaggtgt 5340 
tgctgactca baccaggccb gaatcgcccc atcatccagc cagaaagtga gggagccacg 5400 
gttgatgaga gcbtbgtbgb aggtggacca gttggtgatt ttgaactttt gctttgccac 5460 
ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagttcg 5520 
atbbabtcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca gtgttacaac 5580 
caattaacca attctgatta gaaaaactca tcgagcatca aabgaaacbg caatttattc 5640 
atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga aggagaaaac 5700 
tcaccgaggc agttccatag gabggcaaga tcctggtatc ggtctgcgat tccgactcgt 5760 
ccaacatcaa tacaacctat taabbbcccc bcgbcaaaaa taaggttatc aagtgagaaa 5820 
tcaccatgag tgacgactga atccggtgag aatggcaaaa gcbbabgcab btcbbbccag 5880 
acbbgbtcaa caggccagcc abbacgcbcg bcatcaaaab cacbcgcatc aaccaaaccg 5940 
tbabbcabbc gbgabbgcgc cbgagcgaga cgaaabacgc gabcgcbgtb aaaaggacaa 6000 
ttacaaacag gaatcgaabg caaccggcgc aggaacactg ccagcgcatc aacaababbb 6060 
bcaccbgaab caggababtc bbctaatacc bggaabgcbg bbbtcccggg gabcgcagbg 6120 
gtgagbaacc abgcabcatc aggagbacgg ataaaabgct tgabggtcgg aagaggcaba 6180 
aattccgtca gccagbtbag bcbgaccatc bcabcbgbaa catcabtggc aacgcbaccb 6240 
ttgccatgtb bcagaaacaa ctcbggcgca bcgggcbbcc cabacaabcg abagabbgbc 6300 
gcaccbgatb gcccgacabb atcgcgagcc cabbtabacc cababaaatc agcabccabg 6360 
btggaabbba abcgcggccb cgagcaagac gbbbcccgbb gaababggcb cabaacaccc 6420 
ctbgbatbac bgbbtabgba agcagacagt tbfcatbgtbc atgabgabat abbbbbabcb 6480 
bgbgcaabgb aacabcagag abtbbgagac acaacgtggc btbccccccc cccccatbab 6540 
bgaagcabbb abcagggbba btgtcbcabg agcggabaca tabbbgaatg babbbagaaa 6600 
aabaaacaaa taggggbbcc gcgcacabbb ccccgaaaag bgccaccbga cgbcbaagaa 6660 
accatbatba bcabgacabb aaccbabaaa aabaggcgta bcacgaggcc cbbbcgtc 6718 



<210> 128 
<211> 6586 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 128 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 12 0 
btggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gtgbgaaaba ccgcacagab gcgtaaggag aaaabaccgc abcagatbgg 240 
cbabbggcca bbgcabacgb bgbabccaba bcataababg bacabbbaba bbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgabb abbgacbagt babbaabagb aabcaabbac 360 
ggggbcatba gbbcabagcc cabatabgga gbbccgcgbb acabaacbta cggbaaabgg 420 
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cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 

catagtaacg ccaataggga ctttccatfcg acgtcaatgg gfcggagtatt tacggtaaac 540 

bgcccacttg gcagtacatc aagtgtabca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagbacatg accttatggg acbbbccbac 660 

ttggcagtac abcbacgbat tagtcabcgc babbaccabg gtgatgcggt tttggcagta 720 

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgbaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctbbggc 1080 

bcttabgcab gctatactgt ttttggcttg gggcctatac acccccgctt ccbbafcgcba 1140 

taggtgatgg tatagcttag cctataggtg bgggttattg accattattg accactcccc 1200 

tattggtgac gatactfctcc attactaatc cabaacabgg ctcbtbgcca caacbabctc 1260 

tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccabbtatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt atbaaacata gcgtgggabc tccacgcgaa tctcgggtac gtgtbccgga 1440 

cabgggcbcb tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg bggcggbagg gtatgtgtct 1620 

gaaaabgagc gtggagattg ggctcgcacg gctgacgcag abggaagacb taaggcagcg 1680 

gcagaagaag abgcaggcag cbgagbbgtb gtabtcbgab aagagtcaga ggtaactccc 174 0 

gttgcggtgc bgbtaacggb ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacabaabag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 

bgcagbcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgcbcbaga gatatcgcca 1920 

ccatgcgbgt bcgtggtatc ccgcgtaact ggccgcagtg gbggabgbgg ggtatcctgg 1980 

gtttctggat gatcatcatc tgccgtgttg ttggbaacat gbgggbbacc gtttactacg 2040 

gtgttccggt bbggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 

acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 

cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 

ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 2280 

aactgacccc gctgtgcgtt accctgcact gcaccaacgc tagcatcaac tgcaacgctt 2340 

ccaccatcac ccaggcttgc ccgaaagtta actbcgaccc gatcccgatc cactactgcg 2400 

ctccggcbgg btacgcbabc cbgaaabgca acaacaaaac cttcbccggt aaaggtccgb 2460 

gcaacaacgb ttccaccgbb cagbgcaccc atggbatcaa accggtbgbt bccacccagc 2520 

tgcbgcbgaa cggbtcccbg gctgaaaaag aaatcabcat ccgttccgaa aaccbgaccg 2580 

acaacgbbaa aaccabcabc gbbcacctga acaaatccgb bgaaabcgtt tgcacccgbc 2640 

cgaacaacaa cacccgbaaa bccabgcgba tcggbccggg bcagaccbbc bacgcbaccg 2700 

gtgacabcab cggbgacabc cgbcaggctt acbgcaacab cbccggtbcc aaabggaacg 2760 

aaacccbgaa acgbgbbaaa gaaaaacbgc aggaaaacba caacaacaac aaaaccabca 2820 
aatbcgctcc gbccbccggb ggbgaccbgg aaatcaccac . ccactccbtc aactgccgbg 2 880 

gtgaabtctb ctacbgcaac accacccgtc bgttcaacaa caacgcbacc gaagacgaaa 2940 

ccatcacccb gccgtgccgb abcaaacaga bcatcaacab gtggcagggt gtbggbcgbg 3 000 

cbatgbacgc bccgccgabc gcbggtaaca bcacctgcaa abccaacatc accggbcbgc 3060 

tgcbggbtcg bgacggbggb gaagacaaca aaaccgaaga aabctbccgt ccgggbggbg 3120 

gtaacabgaa agacaacbgg cgbbccgaac bgbacaaaba caaagtbatc gaactgaaac 3180 

cgcbgggtab cgcbccgacc ggtgcbaagc bbaccgtbca ggcbcgbcag cbgctgbccb 3240 
ccatcgbtca gcagcagbcc aacctgctgc gbgcbatcga agcbcagcag cacatgcbgc 3300 

agcbgaccgb bbggggbatc aaacagctgc agacccgbgb bcbggcbabc gaacgbbacc 3360 

bgaaagacca gcagctcgag abctggaaca acabgaccbg gatggaabgg gaccgbgaaa 3420 
bcbccaacba caccgacacc abcbaccgtc bgcbggaaga. ctcccagacc cagcaggaaa 3480 

aaaacgaaaa agacctgcbg gctcbggact cctggaaaaa cctgtggbcc bggbbcgaca 3540 

tctccaacbg gcbgtggbga ggabccagat cbgctgbgcc btctagtbgc cagccabcbg 3600 

btgbbtgccc cbcccccgbg ccttccbtga cccbggaagg bgccactccc acbgtcctbb 3660 

cctaabaaaa bgaggaaabb gcabcgcabt gbcbgagbag gbgbcaCbcb abbctggggg 3720 
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gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg 3780 
atgcggtggg cbcbabgggb acccaggtgc bgaagaabbg acccggttcc bccbgggcca 3840 
gaaagaagca ggcacatccc cttctctgtg acacaccctg bccacgcccc tggttcttag 3900 
ttccagcccc actcatagga cactcatagc tcaggagggc tccgccttca atcccacccg 3960 
ctaaagtact tggagcggtc tctccctccc bcabcagccc accaaaccaa acctagcctc 4020 
caagagbggg aagaaabbaa agcaagatag gctabbaagb gcagagggag agaaaatgcc 4080 
tccaacatgt gaggaagtaa bgagagaaab catagaabbb taaggccatg atttaaggcc 4140 
abcabggcct taatcttccg cttcctcgct cactgactcg ctgcgctcgg bcgbbcggcb 4200 
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 4260 
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 432 0 
cgcgttgctg gcgtttttcc abaggcbccg ccccccbgac gagcatcaca aaaatcgacg 4380 
ctcaagtcag aggtggcgaa acccgacagg actataaaga baccaggcgt bbcccccbgg 4440 
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggabacc tgtccgcctt 4 500 
tctcccttcg ggaagcgtgg cgctbtcbca bagctcacgc bgbaggbabc bcagbbcggb 4560 
gbaggbcgbb cgctccaagc bgggctgbgb gcacgaaccc cccgtbcagc ccgaccgcbg 4 62 0 
cgcctbabcc ggbaacbabc gbctbgagtc caacccggba agacacgacb tabcgccacb 4 680 
ggcagcagcc actggbaaca ggattagcag agcgaggtat gbaggcggbg cbacagagbb 4 740 
ctbgaagtgg tggccbaacb acggcbacac bagaagaaca gbatttggba bcbgcgcbcb 4 800 
gctgaagcca gtbacctbcg gaaaaagagb tggbagctcb bgabccggca aacaaaccac 4 860 
cgctggtagc ggtggbbbtt btgbbbgcaa gcagcagabb acgcgcagaa aaaaaggabc 4 920 
tcaagaagab ccbbbgatcb bbbcbacggg gbcbgacgcb cagbggaacg aaaacbcacg 4980 
bbaagggabb bbggbcabga gabbatcaaa aaggabcbbc accbagatcc bttbaaabba 5 04 0 
aaaabgaagb bbbaaabcaa bcbaaagbab abatgagbaa acbbggbcbg acagtbacca 5100 
abgcbbaabc agbgaggcac cbabctcagc gabcbgtcba btbcgbbcab ccabagbbgc 5160 
cbgacbcggg gggggggggc gcbgaggbcb gccbcgbgaa gaaggbgbbg cbgacbcaba 5220 
ccaggccbga abcgccccab cabccagcca gaaagbgagg gagccacggb bgabgagagc 5280 
tbbgbbgbag gbggaccagt bggbgabbbb gaacbtbtgc bbbgccacgg aacggbctgc 5340 
gbbgbcggga agatgcgtga bcbgabccbb caacbcagca aaagtbcgab tbabbcaaca 5400 
aagccgccgb cccgbcaagb cagcgtaabg cbcbgccagb gttacaacca abbaaccaat 546 0 
bcbgabbaga aaaacbcabc gagcatcaaa bgaaacbgca abttatbcab abcaggabba 5520 
bcaabaccab abtbbbgaaa aagccgbtbc bgtaabgaag gagaaaacbc accgaggcag 5580 
tbccabagga bggcaagabc cbggbabcgg bcbgcgabbc cgacbcgbcc aacabcaaba 5640 
caaccbabba abtbccccbc gbcaaaaaba aggbbatcaa gbgagaaabc accabgagbg 5700 
acgacbgaab ccggbgagaa bggcaaaagc bbabgcabbt cbbtccagac btgbbcaaca 5760 
ggccagccab bacgcbcgtc abcaaaabca ctcgcabcaa ccaaaccgbb atbcabbcgb 5820 
gabbgcgccb gagcgagacg aaabacgcga bcgcbgbbaa aaggacaabb acaaacagga 5880 
abcgaabgca accggcgcag gaacacbgcc agcgcabcaa caatabtbbc accbgaabca 5940 
ggababbcbb cbaabaccbg gaabgcbgbb bbcccgggga bcgcagbggb gagbaaccab 6000 
gcabcabcag gagbacggab aaaabgcbbg abggbcggaa gaggcabaaa btccgtcagc 6060 
cagbbbagbc bgaccabcbc abcbgbaaca bcabbggcaa cgcbacctbt gccatgbbbc 6120 
agaaacaacb cbggcgcabc gggcbbccca bacaabcgab agabbgbcgc accbgatbgc 6180 
ccgacabbab cgcgagccca bbbabaccca babaaabcag cabccabgbb ggaabbbaab 6240 
cgcggccbcg agcaagacgb bbcccgbbga ababggcbca baacaccccb bgbabbacbg 6300 
ttbabgbaag cagacagbbb babbgbbcab gabgababab tbbbabcbbg bgcaabgbaa 6360 
cabcagagab bbbgagacac aacgbggcbb bccccccccc cccatbabbg aagcabbbab 642 0 
cagggbbatb gbcbcatgag cggabacaba bbbgaabgba bbbagaaaaa baaacaaaba 6480 
ggggbbccgc gcacabtbcc ccgaaaagbg ccaccbgacg tcbaagaaac cabtabbabc 6540 
abgacabbaa ccbataaaaa baggcgbabc acgaggccct bbcgbc 6586 



<210> 129 
<211> 6505 
<212> DNA 

<213> Arbificial Sequence 
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<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 129 

tcgcgcgttt cggtgabgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gbaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
tbggcgggbg tcggggctgg cttaactatg cggcatcaga gcagattgta cbgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgbatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gtbcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga cbttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagbgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc babtaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaab gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgt.tta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tabaggcaca cccctttggc 1080 
tcttatgcat gcbatacbgb ttttggcbtg gggcctatac acccccgctt ccttabgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
battggtgac gabacttbcc attactaatc cataacatgg ctctttgcca caacbatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg batbtbtaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttbbt abtaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgbtccgga 144 0 
cabgggctct tct<*cggbag cggcggagcb tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccbtg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggbagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag abgcaggcag ctgagttgtt gtattctgat aagagtcaga ggbaactccc 1740 
gttgcggbgc tgbtaacggt ggagggcagt gbagtctgag cagtactcgt tgcbgccgcg 1800 
cgcgccacca gacabaatag ctgacagact aacagactgt tcctttccat gggtcbtttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gababcgcgg ccgctctaga gabatcgcca 1920 
ccatgcgbgt tcgtggbatc ccgcgtaacb ggccgcagtg gbggabgtgg ggtatcctgg 1980 
gtttctggab gatcatcatc tgccgtgttg tbggbaacat gbgggtbacc gtttactacg 2040 
gtgbtccggb ttggaccgac gctaaaacca ccctgbbctg cgctbccgac accaaagccb 2100 
acgaccgtga agbtcacaac gtttgggcta cccacgcbbg cgttccgacc gacccgaacc 2160 
cgcaggaaab cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacabgg 2220 
ttgaccagab gcacgaagac abcatctccc tgbgggacca gbccctgaaa ccgbgcgbta 2280 
aactgacccc gctgtgcgtt accctgcact gcaccaacgc tagcatcaac tgcaacgctt 2340 
ccaccatcac ccaggcbtgc ccgaaagtta acttcgaccc gatcccgatc cacbactgcg 24 0 0 
cbccggcbgg bbacgcbatc ctgaaatgca acaacaaaac cbtctccggt aaaggtccgb 2460 
gcaacaacgb ttccaccgtt cagtgcaccc atggtabcaa accggbtgtt tccacccagc 2520 
bgcbgcbgaa cggbbccctg gctgaaaaag aaatcabcat ccgfctccgaa aacctgaccg 2580 
acaacgttaa aaccatcatc gbbcacctga acaaatccgb tgaaatcgtb bgcacccgtg 2640 
ctagcgctta ctgcaacatc bccggttcca aatggaacga aaccctgaaa cgbgttaaag 2700 
aaaaacbgca ggaaaacbac aacaacaaca aaaccatcaa atbcgctccg tccbccggtg 2760 
gtgacctgga aatcaccacc cactccttca actgccgtgg tgaattcttc tactgcaaca 2820 
ccacccgbcb gtbcaacaac aacgctaccg aagacgaaac catcaccctg ccgbgccgba 2880 
tcaaacagab catcaacatg tggcagggtg ttggtcgtgc batgtacgcfc ccgccgabcg 294 0 
ctggfcaacab caccfcgcaaa tccaacatca ccggtctgct gcbggttcgt gacggtggtg 3000 
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aagacaacaa aaccgaagaa atcttccgtc cgggtggtgg taacatgaaa gacaactggc 3060 
gbbccgaacb gtacaaatac aaagttatcg aactgaaacc gcbgggbabc gctccgaccg 3120 
gtgctaagct baccgbbcag gctcgtcagc tgctgtcctc catcgttcag cagcagbcca 3180 
acctgctgcg bgcbabcgaa gctcagcagc acatgctgca gctgaccgtt tggggtatca 3240 
aacagctgca gacccgtgtt ctggctatcg aacgbbaccb gaaagaccag cagcbcgaga 3300 
tctggaacaa catgacctgg abggaabggg accgtgaaat ctccaactac accgacacca 3360 
bcbaccgbcb gcbggaagac bcccagaccc agcaggaaaa aaacgaaaaa gaccbgcbgg 3420 
cbcbggacbq cbggaaaaac cbgtggbccb ggtbcgacat cbccaacbgg cbgbggbgag 3480 
gabccagatc tgcbgbgccb bcbagbbgcc agccabcbgt bgbttgcccc bcccccgbgc 3540 
ctbccbtgac ccbggaaggb gccactccca cbgbccbbbc cbaabaaaab gaggaaabbg 3600 
cabcgcabbg bcbgagbagg bgbcabbcba bbcbgggggg bggggbgggg caggacagca 3 660 
agggggagga tbgggaagac aabagcaggc abgcbgggga bgcggtgggc bctatgggba 3 72 0 
cccaggbgcb gaagaabtga cccggbbcct cctgggccag aaagaagcag gcacabcccc 3 780 
btcbcbgbga cacacccbgt ccacgccccb ggtbcbbagt bccagcccca cbcabaggac 3 840 
actcabagcb caggagggct ccgccbbcaa bcccacccgc baaagtacbb ggagcggbct 3900 
ctcccbcccb cabcagccca ccaaaccaaa ccbagccbcc aagagtggga agaaabbaaa 3960 
gcaagabagg cbatbaagbg cagagggaga gaaaabgccb ccaacabgbg aggaagbaat 4 020 
gagagaaabc abagaatbbt aaggccabga bbtaaggcca bcatggccbt aatcbtccgc 4 080 
btcctcgcbc actgacbcgc bgcgcbcggb cgtbcggcbg cggcgagcgg batcagcbca 4140 
cbcaaaggcg gbaabacggt babccacaga abcaggggat aacgcaggaa agaacabgbg 4200 
agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgtbgcbgg cgtbbtbcca 4 260 
baggcbccgc cccccbgacg agcabcacaa aaabcgacgc bcaagbcaga ggtggcgaaa 4 320 
cccgacagga cbabaaagab accaggcgbb bcccccbgga agcbcccbcg bgcgcbcbcc 4 3 80 
bgbbccgacc cbgccgcbba ccggabaccb gbccgccbbt cbcccbtcgg gaagcgbggc 4440 
gcbbbcbcab agcbcacgcb gbaggbabcb cagbbcggbg baggbcgbbc gctccaagcb 4500 
gggcbgbgbg cacgaacccc ccgtbcagcc cgaccgcbgc gcctbatccg gtaacbatcg 4560 
bcbbgagbcc aacccggtaa gacacgactb abcgccacbg gcagcagcca ctggbaacag 4 620 
gatbagcaga gcgaggbabg baggcggbgc bacagagbbc bbgaagbggb ggccbaacba 4 680 
cggcbacacb agaagaacag babbbggbab cbgcgcbcbg cbgaagccag bbaccbbcgg 4740 
aaaaagagbb ggtagcbcbt gabccggcaa acaaaccacc gcbggbagcg gbggfcbbtbb 4 800 
bgbbbgcaag cagcagabba cgcgcagaaa aaaaggabct caagaagatc cbbbgabcbb 4 860 
bbcbacgggg bctgacgcbc agbggaacga aaacbcacgt baagggabbb bggbcabgag 4920 
atbabcaaaa aggabcbbca ccbagabccb bbtaaabbaa aaabgaagbb bbaaabcaab 4980 
cbaaagbaba babgagbaaa cbbggbcbga cagbbaccaa bgcbbaabca gbgaggcacc 5040 
babcbcagcg abcbgbcbab bbcgbbcabc cabagtbgcc bgacbcgggg ggggggggcg 5100 
cbgaggbcbg cctcgbgaag aaggbgbbgc bgacbcabac caggccbgaa bcgccccabc 5160 
atccagccag aaagbgaggg agccacggtb gatgagagct bbgtbgbagg bggaccagbb 5220 
ggbgabbbbg aacbbbbgcb bbgccacgga acggbcbgcg bbgtcgggaa gabgcgbgab 5280 
cbgatccbbc aacbcagcaa aagbbcgabb babbcaacaa agccgccgbc ccgbcaagbc 5340 
agcgbaabgc bcbgccagbg bbacaaccaa bbaaccaabb cbgabbagaa aaacbcatcg 54 00 
agcatcaaab gaaacbgcaa bbbabbcaba bcaggatbab caabaccaba bbbtbgaaaa 5460 
agccgbbbcb gbaabgaagg agaaaacbca ccgaggcagb bccabaggab ggcaagatcc 5520 
bggbabcggb cbgcgabtcc gacbcgbcca acabcaatac aaccbabbaa bbbccccbcg 5580 
bcaaaaabaa ggbbabcaag bgagaaabca ccabgagtga cgacbgaatc cggbgagaab 5640 
ggcaaaagcb babgcabbbc bbbccagacb bgbbcaacag gccagccabb acgcbcgbca 5700 
bcaaaabcac bcgcabcaac caaaccgbba bbcabbcgbg abbgcgccbg agcgagacga 5760 
aabacgcgab cgcbgbbaaa aggacaabba caaacaggaa bcgaabgcaa ccggcgcagg 5B20 
aacacbgcca gcgcabcaac aababbtbca ccbgaabcag gababbcbbc baabaccbgg 5B80 
aabgcbgbbb bcccggggab cgcagbggbg agbaaccabg cabcabcagg agbacggaba 594 0 
aaabgcbbga bggbcggaag aggcabaaab bccgbcagcc agbbbagbcb gaccabcbca 6000 
bcbgbaacab cabbggcaac gcbacctbbg ccabgbbtca gaaacaactc bggcgcabcg 6060 
ggcbtcccab acaabcgaba gabbgbcgca ccbgabbgcc cgacabbatc gcgagcccab 6120 
tbabaxrccab abaaabcagc abccabgbbg gaatbbaabc gcggccbcga gcaagacgbb 6180 
bcccgbbgaa babggcbcab aacaccccbb gbabbactgt bbabgbaagc agacagbbtb 6240 
abbgtbcabg abgabababb bbbabctbgb gcaabgbaac abcagagatb bbgagacaca 6300 
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acgtggcttt cccccccccc ccattattga agcatttatc agggttattg tctcatgagc 6360 

ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 6420 

cgaaaagtgc cacctgacgt ctaagaaacc attattatca tgacattaac ctataaaaat 64 80 
aggcgtatca cgaggccctt tcgtc 6505 



<210> 130 
<211> 6466 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 130 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 60 0 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 1980 
gtttctggat gatcatcatc tgccgtgctg ttggtaacat gtgggttacc gtttactacg 2040 
gtgttccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 
ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 2280 
aactgacccc gctgtgcgtt accctgcact gcaccaacgc tagcatcaac tgcaacgctt 2340 
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ccaccatcac ccaggcttgc ccgaaagbba acbbcgaccc gabcccgabc cactactgcg 2400 

ctccggctgg ttacgctatc ctgaaatgca acaacaaaac cttctccggt aaaggtccgt 2460 

gcaacaacgt ttccaccgtt cagbgcaccc atggtatcaa accggttgtt tccacccagc 2520 

tgctgctgaa cggttccctg gctgaaaaag aaatcatcat ccgttccgaa aaccbgaccg 2580 

acaacgbbaa aaccatcatc gttcacctga acaaatccgt bgaaabcgbb tgcacccgtg 2640 

cbagcgcbba ctgcaacatc tccggttcca aatggaacga aaccctgaaa cgbgbbaaag 2700 

aaaaactgca ggaaaactac aacaacaaca aaaccatcaa attcgctccg tcctccggtg 2760 

gtgacctgga aabcaccacc cacbccbbca actgccgtgg tgaattcttc tactgcaaca 2820 

ccgctagcct gccgtgccgt abcaaacaga tcatcaacat gtggcagggt gtbggbcgbg 2880 

cbabgbacgc tccgccgatc gctggbaaca tcacctgcaa atccaacatc accggtctgc 2940 

tgctggttcg bgacggtggb gaagacaaca aaaccgaaga aatcttccgt ccgggtggtg 3000 

gtaacatgaa agacaactgg cgttccgaac tgtacaaata caaagbbabc gaactgaaac 3060 

cgcbgggbab cgctccgacc ggtgctaagc bbaccgbbca ggctcgtcag ctgctgtcct 3120 

ccatcgttca gcagcagtcc aacctgctgc gbgcbabcga agctcagcag cacatgctgc 3180 

agctgaccgt ttggggtatc aaacagctgc agacccgtgt tctggctatc gaacgbtacc 3240 

tgaaagacca gcagctcgag abcbggaaca acabgaccbg gatggaatgg gaccgbgaaa 3300 

tctccaacta caccgacacc abcbaccgbc tgctggaaga ctcccagacc cagcaggaaa 3360 

aaaacgaaaa agacctgctg gctctggact cctggaaaaa cctgtggtcc tggttcgaca 3420 

tctccaactg gctgtggtga ggatccagat cbgcbgbgcc bbcbagbtgc cagccabctg 3480 

bbgtbtgccc ctcccccgbg ccttccttga ccctggaagg tgccactccc actgtccbtt 3540 

ccbaabaaaa bgaggaaatt gcatcgcabb gbcbgagbag gbgbcabtcb abtcbggggg 3600 

gbggggbggg gcaggacagc aagggggagg atbgggaaga caabagcagg cabgctgggg 3660 

abgcggbggg cbcbatgggt acccaggbgc tgaagaatbg acccggbbcc tccbgggcca 3720 

gaaagaagca ggcacabccc cbbcbcbgbg acacacccbg bccacgcccc tggbtcbbag 3780 

bbccagcccc actcatagga cacbcabagc tcaggagggc bccgccbbca abcccacccg 3 840 

cbaaagbact bggagcggtc bcbcccbccc tcabcagccc accaaaccaa accbagccbc 3900 

caagagbggg aagaaabbaa agcaagatag gctabbaagb gcagagggag agaaaabgcc 3960 

bccaacabgb gaggaagbaa bgagagaaat cabagaabbb baaggccatg abbtaaggcc 4020 

abcabggccb taabcbbccg cbbccbcgct cacbgacbcg cbgcgcbcgg bcgtbcggcb 4080 

gcggcgagcg gtabcagcbc acbcaaaggc ggbaatacgg ttabccacag aatcagggga 4140 

baacgcagga aagaacabgb gagcaaaagg ccagcaaaag gccaggaacc gbaaaaaggc 4200 

cgcgtbgcbg gcgbbbbtcc abaggcbccg ccccccbgac gagcabcaca aaaabcgacg 4260 

cbcaagbcag aggbggcgaa acccgacagg actabaaaga baccaggcgt btcccccbgg 4320 

aagctcccbc gtgcgcbcbc cbgbbccgac cctgccgcbt accggabacc tgbccgccbb 4380 

bctcccbbcg ggaagcgtgg cgcbbbcbca tagcbcacgc bgbaggbabc bcagtbcggb 4440 

gbaggbcgbb cgcbccaagc bgggctgbgt gcacgaaccc cccgbbcagc ccgaccgcbg 4500 

cgccbbabcc ggbaacbabc gtcbbgagbc caacccggba agacacgacb batcgccacb 4560 

ggcagcagcc acbggbaaca ggabbagcag agcgaggtab gbaggcggbg cbacagagbb 4620 

cbbgaagbgg bggccbaacb acggcbacac tagaagaaca gbabbbggba tcbgcgcbcb 4680 

gcbgaagcca gbtaccbtcg gaaaaagagb tggbagcbcb bgabccggca aacaaaccac 4740 

cgcbggbagc ggtggbbtbb bbgbbbgcaa gcagcagabb acgcgcagaa aaaaaggabc 4800 

bcaagaagab ccbbbgatcb bbbcbacggg gbcbgacgcb cagbggaacg aaaactcacg 4860 

bbaagggatb bbggbcabga gabbabcaaa aaggabcbbc accbagabcc tbtbaaabba 4920 

aaaabgaagb bbbaaabcaa bcbaaagtab ababgagbaa acbtggtcbg acagtbacca 4980 

abgcbbaabc agbgaggcac ctabcbcagc gabcbgbcba btbcgbtcab ccabagbbgc 5040 

ctgacbcggg gggggggggc gcbgaggbcb gccbcgbgaa gaaggbgbbg cbgacbcaba 5100 

ccaggccbga abcgccccab cabccagcca gaaagbgagg gagccacggt bgabgagagc 5160 

bbbgtbgbag gbggaccagb bggbgabtbb gaacbbbbgc bbbgccacgg aacggtcbgc 5220 

gbbgbcggga agabgcgbga bcbgabccbb caacbcagca aaagbbcgab bbabtcaaca 5280 

aagccgccgb cccgbcaagb cagcgbaabg cbcbgccagt gbbacaacca abbaaccaab 5340 

bctgabbaga aaaacbcabc gagcabcaaa bgaaactgca abbbabbcab abcaggabba 5400 

bcaabaccab abbbbtgaaa aagccgbbbc bgbaabgaag gagaaaacbc accgaggcag 5460 

bbccabagga bggcaagabc ctggbabcgg bcbgcgabbc cgacbcgtcc aacabcaaba 5520 

caaccbabba abtbccccbc gtcaaaaaba aggbbabcaa gbgagaaabc accatgagbg 5580 

acgacbgaat ccggbgagaa bggcaaaagc tbabgcabbb cbbbccagac tbgbtcaaca 5640 
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ggccagccat tacgctcgtc atcaaaatca ctcgcatcaa ccaaaccgtt attcattcgt 5700 
gattgcgcct gagcgagacg aaatacgcga tcgctgttaa aaggacaatt acaaacagga 5760 
atcgaatgca accggcgcag gaacactgcc agcgcatcaa caatattttc accbgaabca 5820 
ggabatbctb ctaatacctg gaatgctgtt ttcccgggga bcgcagtggt gagtaaccat 5880 
gcatcatcag gagtacggat aaaabgcttg atggbcggaa gaggcataaa ttccgtcagc 594 0 
cagbbtagfcc tgaccatcbc abcbgtaaca tcabtggcaa cgctacctbt gccatgbfctc 6000 
agaaacaact cbggcgcatc gggcttccca tacaabcgat agabtgbcgc accbgabtgc 6060 
ccgacabtab cgcgagccca tttataccca tabaaabcag catccabgbt ggaatttaab 6120 
cgcggcctcg agcaagacgb btcccgtbga abatggctca taacaccccb bgbatbacbg 6180 
bbtabgbaag cagacagbbb tattgbbcat gabgababab btbbatcbbg bgcaabgbaa 6240 
cabcagagab ttbgagacac aacgbggcbt bccccccccc cccattabbg aagcabbbab 6300 
cagggttabb gtctcatgag cggabacaba bbtgaabgta tttagaaaaa taaacaaaba 6360 
ggggtbccgc gcacabtbcc ccgaaaagbg ccaccbgacg tctaagaaac cabtabbabc 6420 
abgacattaa ccbataaaaa baggcgbabc acgaggcccb btcgbc 6466 



<210> 131 
<211> 6547 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 131 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggbca 60 
cagcttgtcb gtaagcggat gccgggagca gacaagcccg tcagggcgcg bcagcgggtg 120 
btggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatabgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagabtgg 240 
cbabtggcca ttgcatacgt tgtatccata tcataatatg tacatbtaba tbggctcatg 300 
bccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaatbac 360 
ggggbcatta gttcatagcc catatabgga gbtccgcgbb acataactta cggtaaabgg 420 
cccgcctggc bgaccgccca acgacccccg cccattgacg tcaataatga cgtatgtbcc 480 
catagbaacg ccaabaggga cbtbccabtg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtabca tatgccaagt acgcccccta ttgacgbcaa 600 
bgacggbaaa tggcccgccb ggcattabgc ccagtacabg accttatggg actttccbac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccabg gbgabgcggt ttbggcagta 720 
catcaabggg cgtggatagc ggbtbgacbc acggggabbt ccaagtctcc accccabbga 780 
cgtcaabggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggbag gcgtgtacgg bgggaggtcb atabaagcag 900 
agctcgttta gtgaaccgbc agabcgcctg gagacgccab ccacgctgtb ttgaccbcca 960 
bagaagacac cgggaccgab ccagcctccg cggccgggaa cggtgcabbg gaacgcggat 1020 
tccccgtgcc aagagbgacg baagtaccgc ctatagactc bataggcaca ccccttbggc 1080 
bctbabgcat gctatactgt ttttggcttg gggcctabac acccccgctt ccttatgcta 1140 
baggtgatgg tatagctbag cctabaggbg tgggbtattg accatbattg accactcccc 1200 
tattggtgac gatactttcc attactaabc cataacatgg ctcbttgcca caacbabctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg batbttbaca 132 0 
ggatggggbc ccatttatba bttacaaabt cacatataca acaacgccgt cccccgbgcc 1380 
cgcagbbttt attaaacaba gcgbgggatc bccacgcgaa bcbcgggbac gtgbtccgga 1440 
cabgggctct tctccggbag cggcggagct bccacatccg agccctggbc ccabgccbcc 1500 
agcggctcab ggtcgctcgg cagcbccttg ctcctaacag tggaggccag actbaggcac 1560 
agcacaatgc ccaccaccac cagbgbgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagatbg ggcbcgcacg gcbgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagbbgtb gbattctgat aagagbcaga ggbaactccc 1740 



- 302 - 



WO 02/32943 PCT/US01/25721 

gttgcggtgc tgbtaacggb ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagacbgb tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggbabc ccgcgbaacb ggccgcagtg gtggatgtgg ggtatcctgg 1980 
gtttctggat gatcatcatc bgccgbgbbg ttggtaacat gtgggttacc gbttacbacg 2040 
gtgtbccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacabgg 2220 
ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgbgcgbba 2280 
aactgacccc gcbgtgcgtb accctgcact gcaccaacgc tagcatcaac tgcaacgcth 2340 
ccaccatcac ccaggcttgc ccgaaagtta acttcgaccc gatcccgatc cactactgcg 2400 
ctccggctgg ttacgcbatc cbgaaabgca acaacaaaac cbbctccggb aaaggtccgfc 2460 
gcaacaacgt ttccaccgtt cagtgcaccc abggbabcaa accggttgtt tccacccagc 2520 
tgctgctgaa cggtbccctg gcbgaaaaag aaahcatcat ccgttccgaa aacctgaccg 2580 
acaacgttaa aaccatcatc gtbcacctga acaaatccgt tgaaatcgtt tgcacccgtc 2640 
cgaacaacaa cacccgtaaa bccabgcgta tcggtccggg tcagaccttc tacgctaccg 2700 
gtgacatcat cggtgacatc cgtcaggctt actgcaacat ctccggttcc aaatggaacg 2760 
aaaccctgaa acgbgbtaaa gaaaaactgc aggaaaacta caacaacaac aaaaccatca 2820 
aattcgctcc gtcctccggt ggbgaccbgg aaatcaccac ccactccttc aactgccgtg 2880 
gbgaabbctb cbactgcaac accgcbagcc bgccgbgccg tatcaaacag atcatcaaca 294 0 
tgtggcaggg tgtbggtcgt gcbatgtacg cbccgccgab cgcbggtaac atcaccbgca 3000 
aabccaacab caccggbctg cbgctggttc gbgacggtgg bgaagacaac aaaaccgaag 3060 
aaabcbtccg bccgggbggb ggbaacatga aagacaactg gcgbbccgaa cbgtacaaab 3120 
acaaagbbab cgaacbgaaa ccgcbgggta bcgcbccgac cggbgctaag cbtaccgbbc 3180 
aggctcgbca gcbgcbgbcc bccatcgttc agcagcagtc caaccfcgcbg cgtgctabcg 324 0 
aagcbcagca gcacabgctg cagcbgaccg bttggggtat caaacagcbg cagacccgbg 3300 
btctggcbab cgaacgbbac cbgaaagacc agcagcbcga gabctggaac aacatgacct 3360 
ggabggaatg ggaccgbgaa abcbccaacb acaccgacac catctaccgb ctgctggaag 3420 
actcccagac ccagcaggaa aaaaacgaaa aagaccbgcb ggcbcbggac tcctggaaaa 34 80 
accbgbggbc ctggbbcgac abcbccaacb ggctgbggbg aggatccaga tctgcbgbgc 354 0 
ctbcbagbtg ccagccabcb gtbgbttgcc cctcccccgb gccbtcctbg accctggaag 3600 
gtgccacbcc cacbgbcctt bccbaataaa atgaggaaab bgcatcgcat tgtctgagba 3660 
ggbgbcabbc babbcbgggg ggbggggbgg ggcaggacag caagggggag gatbgggaag 3720 
acaatagcag gcabgcbggg gabgcggbgg gcbcbabggg bacccaggbg cbgaagaabb 3780 
gacccggbtc cbccbgggcc agaaagaagc aggcacatcc ccbbcbcbgt gacacaccct 3 840 
gtccacgccc cbggbbcbta gbfcccagccc cactcabagg acactcabag ctcaggaggg 3900 
ctccgccbtc aabcccaccc gcbaaagbac bbggagcggb cbcbcccbcc ctcatcagcc 3960 
caccaaacca aaccbagccb ccaagagbgg gaagaaabba aagcaagaba ggcbabbaag 4020 
bgcagaggga gagaaaabgc cbccaacatg bgaggaagba abgagagaaa bcatagaabb 4080 
btaaggccab gabbbaaggc cabcabggcc bbaabcbbcc gcbbcctcgc tcacbgacbc 4140 
gctgcgcbcg gbcgbbcggc bgcggcgagc ggbabcagcb cacbcaaagg cggtaabacg 4200 
gttabccaca gaabcagggg abaacgcagg aaagaacabg bgagcaaaag gccagcaaaa 4260 
ggccaggaac cgbaaaaagg ccgcgbtgcb ggcgbbbtbc cabaggcbcc gccccccbga 4320 
cgagcabcac aaaaabcgac gcbcaagbca gaggbggcga aacccgacag gactataaag 4380 
ataccaggcg bbbccccctg gaagcbcccb cgbgcgctcb ccbgbtccga cccbgccgcb 4440 
baccggabac cbgbccgccb bbcbcccbbc gggaagcgbg gcgcbttctc abagcbcacg 4500 
ctgbaggbab cbcagbbcgg bgbaggtcgb bcgcbccaag cbgggctgtg tgcacgaacc 4560 
ccccgbbcag cccgaccgcb gcgccbtabc cggbaactab cgbctbgagb ccaacccggb 4 620 
aagacacgac bbabcgccac bggcagcagc cacbggbaac aggatbagca gagcgaggba 4680 
bgbaggcggb gcbacagagb bcbtgaagbg gbggccbaac bacggctaca ctagaagaac 4740 
agtabbbggb abcbgcgctc bgcbgaagcc agbtaccttc ggaaaaagag bbggbagcbc 4800 
btgabccggc aaacaaacca ccgcbggbag cggbggbbbb btbgtbtgca agcagcagab 4860 
bacgcgcaga aaaaaaggab cbcaagaaga tccbbbgabc bbbbcbacgg ggtcbgacgc 4920 
bcagbggaac gaaaacbcac gbbaagggab bbbggbcabg agabtatcaa aaaggabcbb 4980 
cacctagatc cbtbbaaabb aaaaabgaag bbbbaaatca abcbaaagba batabgagba 5040 
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aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct 5100 

atttcgttca tccatagttg ccfcgactcgg gggggggggg cgctgaggtc tgcctcgtga 5160 

agaaggtgtt gctgactcat accaggcctg aatcgcccca tcatccagcc agaaagtgag 5220 

ggagccacgg ttgatgagag ctttgttgta ggtggaccag ttggtgattt tgaacttttg 5280 

ctttgccacg gaacggtctg cgttgtcggg aagatgcgtg atctgatcct tcaactcagc 5340 

aaaagttcga tttattcaac aaagccgccg tcccgtcaag tcagcgtaat gctctgccag 5400 

tgttacaacc aattaaccaa ttctgattag aaaaactcat cgagcatcaa atgaaactgc 5460 

aatttattca tatcaggatt atcaatacca tatttttgaa aaagccgttt ctgtaatgaa 5520 

ggagaaaact caccgaggca gttccatagg atggcaagat cctggtatcg gtctgcgatt 5580 

ccgactcgtc caacatcaat acaacctatt aatttcccct cgtcaaaaat aaggttatca 5640 

agtgagaaat caccatgagt gacgactgaa tccggtgaga atggcaaaag cttatgcatt 5700 

tctttccaga cttgttcaac aggccagcca ttacgctcgt catcaaaatc actcgcatca 5760 

accaaaccgt tattcattcg tgattgcgcc tgagcgagac gaaatacgcg atcgctgtta 5820 

aaaggacaat tacaaacagg aatcgaatgc aaccggcgca ggaacactgc cagcgcatca 5880 

acaatatttt cacctgaatc aggatattct tctaatacct ggaatgctgt tttcccgggg 5940 

atcgcagtgg tgagtaacca tgcatcatca ggagtacgga taaaatgctt gatggtcgga 6000 

agaggcataa attccgtcag ccagtttagt ctgaccatct catctgtaac atcattggca 6060 

acgctacctt tgccatgtht cagaaacaac tctggcgcat cgggcttccc atacaatcga 6120 

tagattgtcg cacctgattg cccgacatta tcgcgagccc atttataccc atataaatca 6180 

gcatccatgt tggaatttaa tcgcggcctc gagcaagacg tttcccgttg aatatggctc 6240 

ataacacccc ttgtattact gtttatgtaa gcagacagtt ttattghtca tgatgatata 6300 

tttttatctt gtgcaatgta acatcagaga ttttgagaca caacgtggct ttcccccccc 6360 

ccccattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 6420 

atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 6480 

gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc 6540 

tttcgtc 6547 



<210> 132 
<211> 6637 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/e containing HIV genes 
<400> 132 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtfctgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agafccgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggac 102 0 
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tccccgtgcc aagagtgacg taagbaccgc ctabagactc babaggcaca cccctbbggc 1080 
bcbbatgcat gcbabacbgb ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg bgggbtabbg accattattg accactcccc 1200 
babbggbgac gabacbbtcc abbacbaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgh cccccgtgcc 1380 
cgcagttttt atbaaacaba gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct bcbccggbag cggcggagct tccacatccg agccctggtc ccabgccbcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gbggagabbg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggbaacbccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 18 00 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggbcbbbbc 1860 
tgcagtcacc gbcgbcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggbatccbgg 1980 
gtttcfcggat gatcatcatc tgccgtgttg bbggbaacab gtgggttacc gtttactacg 2040 
gbgbbccggb bbggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgbbctggaa aacgttaccg aaaacttcaa cabgbggaaa aacgacatgg 2220 
ttgaccagat gcacgaagac atcabcbccc tgtgggacca gtccctgaaa ccgbgcgbba 2280 
aactgacccc gcbgbgcgbt accctgcact gcaccaacgc tagccgtaac tgctccttca 2340 
acaccaccac cgaaatccgt gacaaaaaac agcagggtta cgctctgttc baccgbccgg 2400 
acatcgttct gctgaaagaa aaccgtaaca actccaacaa ctccgaatac atccbgabca 2460 
actgcaacgc bbccaccatc acccaggctt gcccgaaagt taacbtcgac ccgatcccga 2520 
bccacbacbg cgctccggcb ggttacgcta bcctgaaatg caacaacaaa accbbctccg 2580 
gbaaaggtcc gbgcaacaac gtbbccaccg bbcagbgcac ccatggbatc aaaccggbtg 2640 
tttccaccca gcbgcbgcbg aacggtbccc bggctgaaaa agaaabcatc abccgtbccg 2700 
aaaaccbgac cgacaacgtt aaaaccabca tcgbbcaccb gaacaaabcc gtbgaaabcg 2760 
tttgcacccg tgcbagcgcb bacbgcaaca bcbccggtbc caaabggaac gaaacccbga 2820 
aacgbgbbaa agaaaaactg caggaaaact acaacaacaa caaaaccatc aaattcgcbc 2880 
cgbccbccgg tggbgacctg gaaatcacca cccactccbb caacbgccgb ggbgaatbcb 2940 
bcbacbgcaa caccacccgb cbgtbcaaca acaacgctac cgaagacgaa accabcaccc 3000 
bgccgbgccg tatcaaacag abcabcaaca bgbggcaggg bgbtggbcgb gctabgbacg 3060 
cbccgccgat cgctggbaac abcaccbgca aabccaacat caccggbcbg ctgcbggbtc 3120 
gtgacggbgg tgaagacaac aaaaccgaag aaabcbbccg bccgggbggt ggbaacabga 318 0 
aagacaactg gcgtbccgaa ctgbacaaat acaaagbbat cgaacbgaaa ccgcbgggba 3240 
bcgcbccgac cggbgcbaag ctbaccgbbc aggcbcgbca gcbgcbgbcc bccabcgbbc 33 00 
agcagcagbc caaccbgcbg cgbgctabcg aagctcagca gcacabgcbg cagctgaccg 33 60 
tbbggggbab caaacagcbg cagacccgbg bbcbggctat cgaacgbbac cbgaaagacc 3420 
agcagcbcga gabcbggaac aacabgaccb ggabggaabg ggaccgtgaa abctccaacb 3480 
acaccgacac cabcbaccgt ctgcbggaag actcccagac ccagcaggaa aaaaacgaaa 3540 
aagaccbgcb ggcbcbggac bccbggaaaa accbgbggbc cbggbbcgac abcbccaacb 3600 
ggcbgbggtg aggabccaga bcbgcbgbgc cbbcbagbbg ccagccabcb gtbgbbbgcc 3660 
ccbcccccgb gccbbccbbg acccbggaag gbgccacbcc cactgbccbb bcctaabaaa 3720 
abgaggaaab bgcabcgcat bgbcbgagba ggbgbcabbc tabtctgggg ggtggggbgg 3780 
ggcaggacag caagggggag gabbgggaag acaabagcag gcatgcbggg gabgcggtgg 3840 
gcbcbabggg bacccaggbg ctgaagaabb gacccggbbc cbcctgggcc agaaagaagc 3900 
aggcacabcc cctbcbcbgt gacacacccb gbccacgccc cbggtbctba gbbccagccc 3960 
cacbcabagg acacbcabag cbcaggaggg cbccgccbbc aabcccaccc gcbaaagbac 4020 
tbggagcggb cbcbcccbcc cbcabcagcc caccaaacca aaccbagcct ccaagagbgg 4080 
gaagaaabba aagcaagaba ggctabbaag bgcagaggga gagaaaabgc cbccaacabg 414 0 
tgaggaagba abgagagaaa tcatagaabb bbaaggccab gabtbaaggc cabcabggcc 4200 
tbaabcbbcc gcttccbcgc tcacbgacbc gcbgcgcbcg gbcgbtcggc bgcggcgagc 4260 
ggtabcagcb cacbcaaagg cggbaabacg gbbafcccaca gaatcagggg ataacgcagg 4320 
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aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct 43 80 

ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca 4440 

gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct 4500 

cgtgcgctct cctgthccga ccctgccgct taccggatac ctgtccgcct ttctcccttc 4560 

gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt 4620 

tcgctccaag ctgggcbgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc 4680 

cggbaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc 4740 

cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg 4800 

gtggcctaac tacggctaca ctagaagaac agtatttggt atctgcgctc tgctgaagcc 4860 

agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag 4920 

cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga 4980 

tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat 5040 

tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag 5100 

ttttaaatca afcctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat 5160 

cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagthg cctgactcgg 5220 

9999999999 cgctgaggtc tgcctcgtga agaaggtgtt gctgactcat accaggcctg 5280 

aatcgcccca tcatccagcc agaaagtgag ggagccacgg ttgatgagag ctttgttgta 5340 

ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtctg cgtcgtcggg 5400 

aagatgcgtg atctgatcct tcaactcagc aaaagttcga ttfcattcaac aaagccgccg 5460 

tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa ttctgattag 5520 

aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt atcaatacca 5580 

tatttttgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca gttccatagg 5640 

atggcaagah cctggtatcg gtctgcgatt ccgactcgtc caacatcaat acaacctatt 5700 

aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt gacgactgaa 5760 

tccggtgaga atggcaaaag cttatgcatt tctttccaga cttgttcaac aggccagcca 5820 

ttacgctcgt catcaaaatc actcgcatca accaaaccgt tattcattcg tgattgcgcc 5880 

tgagcgagac gaaatacgcg atcgctgtta aaaggacaat tacaaacagg aatcgaatgc 5940 

aaccggcgca ggaacactgc cagcgcatca acaatatttt cacctgaatc aggatattct 6000 

tctaatacct ggaatgctgt tttcccgggg atcgcagtgg tgagtaacca tgcatcatca 6060 

ggagtacgga taaaatgctt gatggtcgga agaggcataa attccgtcag ccagtttagt 6120 

ctgaccatct catctgtaac atcattggca acgctacctt tgccatgttt cagaaacaac 6180 

tctggcgcat cgggcttccc atacaatcga tagattgtcg cacctgattg cccgacatta 6240 

tcgcgagccc atttataccc atataaatca gcatccatgt tggaatttaa tcgcggcctc 6300 

gagcaagacg tttcccgttg aatatggctc ataacacccc ttgtattact gtttatgtaa 6360 

gcagacagtt ttattgttca tgatgatata tttttatctt gtgcaatgta acafccagaga 6420 

bttfcgagaca caacgfcggct ttcccccccc ccccattatt gaagcattta tcagggttat 6480 

tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 6540 

cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta 6600 

acctataaaa ataggcgtat cacgaggccc tttcgtc 6637 



<210> 133 
<211> 6598 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 133 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
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cbatbggcca bbgcabacgb bgbabccaba bcabaababg bacabtbaba ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt abtgactagt tattaatagt aatcaattac 360 
ggggtcatta gbbcabagcc cabababgga gttccgcgtt acabaacbba cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga cbbbccabbg acgtcaatgg gbggagbabb tacggtaaac 540 
tgcccacttg gcagtacatc aagbgbabca tatgccaagt acgcccccta btgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagbacabg accbbabggg actttcctac 660 
ttggcagtac abcbacgbab tagtcatcgc tattaccatg gtgatgcggt bbbggcagba 720 
cabcaabggg cgtggatagc ggtbbgacbc acggggattt ccaagfcctcc accccabbga 780 
cgtcaatggg agtttgtttfc ggcaccaaaa tcaacgggac tttccaaaat gbcgbaacaa 840 
ctccgcccca ttgacgcaaa bgggcggbag gcgbgbacgg tgggaggtct atataagcag 900 
agctcgttta gbgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacchcca 960 
tagaagacac cgggaccgat ccagccbccg cggccgggaa cggbgcatbg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca ccccbbbggc 1080 
tcttatgcat gcbabacbgb bbbbggcbtg gggcctatac acccccgctt ccbtabgcba 1140 
taggtgatgg. tatagcttag cctataggtg tgggttattg accatbabbg accactcccc 1200 
tattggtgac gatactttcc abtacbaabc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg battbbbaca 1320 
ggatggggtc ccatbbabba tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt abbaaacaba gcgtgggatc tccacgcgaa tchcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagch tccacatccg agcccbggbc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg cbccbaacag bggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc ghggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggbaacbccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgh bgcbgccgcg 1800 
cgcgccacca gacataatag cbgacagact aacagactgt tcctttccat gggbcbbbbc I860 
tgcagfccacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagbg gtggafcgtgg ggtatcctgg 1980 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 
gtghtccggt fctggaccgac gctaaaacca ccctgttchg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 222 0 
ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtfca 2280 
aactgacccc gctgtgcgtt accctgcact gcaccaacgc tagccgtaac tgctcchtca 2340 
acaccaccac cgaaatccgt gacaaaaaac agcagggtta cgctctgttc taccgtccgg 2400 
acatcgttct gctgaaagaa aaccgtaaca actccaacaa cbccgaatac atcctgatca 2460 
actgcaacgc ttccaccatc acccaggctt gcccgaaagt taacttcgac ccgatcccga 2520 
tccactactg cgctccggct ggttacgcta fccctgaaatg caacaacaaa accttctccg 2580 
gtaaaggtcc gtgcaacaac gtttccaccg ttcagtgcac ccatggtatc aaaccggttg 2640 
tttccaccca gctgctgctg aacggtbccc bggctgaaaa agaaatcatc abccgttccg 2700 
aaaacctgac cgacaacgbt aaaaccatca tcgttcaccb gaacaaatcc gbbgaaabcg 2760 
bbbgcacccg bgcbagcgct bacbgcaaca bctccggtbc caaatggaac gaaacccbga 2820 
aacgtgtbaa agaaaaacbg caggaaaact acaacaacaa caaaaccabc aaattcgctc 2880 
cgtccbccgg tggtgaccbg gaaabcacca cccacbccbt caacbgccgb ggbgaabbct 2940 
tcbactgcaa caccgcbagc cbgccgbgcc gbabcaaaca gabcabcaac abgbggcagg 3000 
gbgbtggbcg bgcbabgbac gcbccgccga bcgcbggbaa cabcacctgc aaabccaaca 3060 
bcaccggtcb gcbgcbggbb cgbgacggtg gbgaagacaa caaaaccgaa gaaatcbbcc 3120 
gbccgggbgg bggbaacabg aaagacaacb ggcgtbccga acbgbacaaa tacaaagbba 3180 
bcgaacbgaa accgcbgggt abcgctccga ccggtgcbaa gcbtaccgbb caggcbcgbc 324 0 
agcbgcbgbc cbccabcgbb cagcagcagb ccaaccbgcb gcgbgcbabc gaagcbcagc 3300 
agcacabgcb gcagcbgacc gtbbggggba bcaaacagcb gcagacccgb gtbcbggcba 3360 
bcgaacgbba ccbgaaagac cagcagcbcg agabcbggaa caacabgacc bggabggaat 3420 
gggaccgbga aabcbccaac bacaccgaca ccatcbaccg bcbgcbggaa gacbcccaga 3480 
cccagcagga aaaaaacgaa aaagaccbgc tggcbcbgga cbcctggaaa aaccbgbggt 3540 
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cctggttcga catctccaac tggcbgtggb 
gccagccatc bgbbgtttgc ccctcccccg 
ccacbgbcct ttcctaataa aatgaggaaa 
ctattctggg gggtggggtg gggcaggaca 
ggcatgctgg ggatgcggtg ggctctatgg 
cctcctgggc cagaaagaag caggcacatc 
ccbggbbctt agtbccagcc ccactcatag 
caatcccacc cgctaaagta cbbggagcgg 
aaacctagcc tccaagagtg ggaagaaatt 
agagaaaatg cctccaacat gtgaggaagt 
tgatttaagg ccatcatggc cttaatcttc 
ggtcgttcgg ctgcggcgag cggtatcagc 
agaatcaggg gataacgcag gaaagaacat 
ccgtaaaaag gccgcgbbgc tggcgttttt 
caaaaatcga cgctcaagtc agaggtggcg 
gbbbcccccb ggaagctccc tcgtgcgctc 
ccbgbccgcc tttctccctt cgggaagcgt 
tctcagttcg gbgtaggbcg ttcgctccaa 
gcccgaccgc bgcgccttab ccggtaacta 
cbbabcgcca ctggcagcag ccactggtaa 
tgcbacagag ttcttgaagt ggtggcctaa 
babcbgcgct ctgctgaagc cagttacctt 
caaacaaacc accgctggta gcggbggbbb 
aaaaaaagga tctcaagaag atcctttgat 
cgaaaacbca cgttaaggga ttttggtcat 
ccfctttaaat baaaaabgaa gbbttaaabc 
tgacagbtac caatgcttaa bcagtgaggc 
atccatagtt gcctgactcg gggggggggg 

bgcbgacbca taccaggcct gaabcgcccc 
gttgatgaga gcbbbgttgb aggtggacca 
ggaacggtct gcgbbgbcgg gaagatgcgt 
atttattcaa caaagccgcc gtcccgtcaa 
caabbaacca attctgatta gaaaaactca 
atafccaggat tatcaatacc atatttttga 
bcaccgaggc agttccatag gatggcaaga 
ccaacatcaa tacaacctat taatttcccc 
tcaccatgag tgacgactga atccggtgag 
acttgttcaa caggccagcc attacgctcg 
ttattcattc gtgattgcgc ctgagcgaga 
ttacaaacag gaahcgaatg caaccggcgc 
tcacctgaat caggatattc bbcbaatacc 
gtgagtaacc atgcatcatc aggagtacgg 
aattccgtca gccagbbbag bcbgaccatc 
bbgccabgbt tcagaaacaa cbcbggcgca 
gcacctgatt gcccgacatt atcgcgagcc 
ttggaattta atcgcggcct cgagcaagac 
ctbgbabbac bgbbbabgba agcagacagt 
bgbgcaabgt aacatcagag abttbgagac 
tgaagcattt atcagggbba ttgtctcatg 
aataaacaaa taggggttcc gcgcacattt 
accattatta tcatgacatt aacctataaa 



gaggatccag atctgctgtg ccttctagtt 3600 
tgccttccfct gaccctggaa ggtgccactc 3660 
ttgcatcgca ttgtctgagt aggtgtcatt 3720 
gcaaggggga ggattgggaa gacaatagca 3780 
gtacccaggt gctgaagaat tgacccggtt 3840 
cccttctctg tgacacaccc tgtccacgcc 3900 
gacactcata gctcaggagg gctccgcctt 3960 
tctctccctc cctcatcagc ccaccaaacc 4020 
aaagcaagat aggctattaa gtgcagaggg 4 080 
aatgagagaa atcatagaat tttaaggcca 4140 
cgcttcctcg ctcactgact cgctgcgctc 4200 
tcactcaaag gcggtaatac ggttatccac 4260 
gtgagcaaaa ggccagcaaa aggccaggaa 4 32 0 
ccataggctc cgcccccctg acgagcatca 4380 
aaacccgaca ggactataaa gataccaggc 4440 
tcctgttccg accctgccgc ttaccggata 4500 
ggcgcttbct catagctcac gctgtaggta 4560 
gctgggctgt gtgcacgaac cccccgtbca 4620 
tcgtcttgag tccaacccgg taagacacga 4680 
caggattagc agagcgaggt atgtaggcgg 4740 
ctacggcfcac actagaagaa cagtatttgg 4800 
cggaaaaaga gttggtagct cttgatccgg 4860 
ttbfcgtttgc aagcagcaga btacgcgcag 4920 
ctbbtcbacg gggtcbgacg ctcagbggaa 4980 
gagabbabca aaaaggatcb bcaccbagab 5040 
aabcbaaagt ababatgagb aaacbbggbc 5100 
accbabctca gcgatcbgbp battbcgttc 5160 
gcgcbgaggb cbgccbcgbg aagaaggbgb 5220 
atcabccagc cagaaagbga gggagccacg 5280 
gtbggbgatt bbgaacbtbt gcbtbgccac 5340 
gabcbgabcc bbcaacbcag caaaagbbcg 5400 
gtcagcgbaa bgcbcbgcca gbgbbacaac 5460 
tcgagcabca aabgaaacbg caabbtattc 5520 
aaaagccgtb bctgbaabga aggagaaaac 55 80 
bccbggbatc ggtcbgcgab tccgacbcgb 5640 
bcgbcaaaaa taaggbbabc aagbgagaaa 5700 
aabggcaaaa gcbbabgcab bbcbbbccag 5760 
bcabcaaaab cacbcgcabc aaccaaaccg 5820 
cgaaabacgc gabcgctgbb aaaaggacaa 5880 
aggaacacbg ccagcgcabc aacaababbb 5940 
bggaatgctg bbbtcccggg gatcgcagbg 6000 
abaaaabgcb bgabggtcgg aagaggcaba 6060 
tcatctgbaa cabcabbggc aacgctaccb 6120 
tcgggcbbcc catacaabcg abagabbgtc 6180 
cabbbabacc cababaaabc agcatccabg 6240 
gtbtcccgbb gaatatggcb cataacaccc 6300 
bbbabbgbbc atgabgatat abtbtbatcb 6360 
acaacgbggc btbccccccc cccccabbab 6420 
agcggabaca batbbgaabg babbtagaaa 6480 
ccccgaaaag tgccaccbga cgbctaagaa 6540 
aataggcgba bcacgaggcc ctbbcgbc 6598 



<210> 134 
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<211> 6679 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 134 

tcgcgcgttt cggfcgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 19B0 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 
gtgttccggt ttggaccgac gctaaaacca ccctgttcfcg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 
ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 22 80 
aactgacccc gctgtgcgtt accctgcact gcaccaacgc tagccgtaac tgctccttca 2340 
acaccaccac cgaaatccgt gacaaaaaac agcagggtta cgctctgttc taccgtccgg 24 00 
acatcgttct gctgaaagaa aaccgtaaca actccaacaa ctccgaatac atcctgatca 24 60 
actgcaacgc ttccaccatc acccaggctt gcccgaaagt taacttcgac ccgatcccga 2520 
tccactactg cgctccggct ggttacgcta tcctgaaatg caacaacaaa accttctccg 2580 
gtaaaggtcc gtgcaacaac gtttccaccg ttcagtgcac ccatggtatc aaaccggttg 2640 
tttccaccca gctgctgctg aacggttccc tggctgaaaa agaaatcatc atccgttccg 2700 
aaaacctgac cgacaacgtt aaaaccatca tcgttcacct gaacaaatcc gttgaaatcg 2760 
tttgcacccg tccgaacaac aacacccgta aatccatgcg tatcggtccg ggtcagacct 2820 
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bcbacgcbac cggtgacatc atcggtgaca bccgbcaggc btacbgcaac atcbccggtt 2880 

ccaaatggaa cgaaaccctg aaacgtgtha aagaaaaact gcaggaaaac tacaacaaca 2940 

acaaaaccat caaattcgct ccgbccbccg gtggtgacct ggaaatcacc acccactcct 3000 

tcaactgccg tggtgaattc ttctactgca acaccgctag cctgccgtgc cgtatcaaac 3060 

agatcabcaa catgtggcag ggtgttggtc gtgctatgta cgctccgccg atcgctggta 3120 

acatcacctg caaatccaac abcaccggbc tgctgctggt bcgbgacggt ggtgaagaca 3180 

acaaaaccga agaaatcttc cgtccgggbg gtggtaacat gaaagacaac bggcgbbccg 324 0 

aactgtacaa atacaaagtt abcgaacbga aaccgcbggg batcgctccg accggtgcta 3300 

agcttaccgt tcaggctcgt cagctgctgt ccbccabcgb bcagcagcag tccaacctgc 3360 

tgcgtgctat cgaagctcag cagcacatgc bgcagcbgac cgtttggggt atcaaacagc 3420 

tgcagacccg tgttctggct atcgaacgtt acctgaaaga ccagcagctc gagabcbgga 3480 

acaacabgac ctggabggaa bgggaccgtg aaabctccaa cbacaccgac accatcbacc 3540 

gtcbgcbgga agactcccag acccagcagg aaaaaaacga aaaagaccbg ctggcbctgg 3600 

acbccbggaa aaaccbgbgg bccbggbbcg acabcbccaa cbggcbgtgg bgaggabcca 3660 

gabctgcbgb gccbbctagb bgccagccab cbgbbgtbtg cccctccccc gtgccbbcct 3720 

tgacccbgga aggtgccacb cccacbgtcc bbtccbaata aaabgaggaa atbgcabcgc 3780 

atbgtcbgag baggbgtcab tctabbctgg ggggbggggt ggggcaggac agcaaggggg 384 0 

aggattggga agacaatagc aggcabgctg gggatgcggb gggcbctabg ggtacccagg 390 0 

tgctgaagaa bbgacccggb bccbccbggg ccagaaagaa gcaggcacab ccccbtcbct 3960 

gbgacacacc cbgbccacgc cccbggbbcb bagbtccagc cccacbcaba ggacacbcab 4020 

agcbcaggag ggcbccgccb bcaabcccac ccgctaaagt acbtggagcg gbcbctcccb 40 BO 

ccctcabcag cccaccaaac caaaccbagc cbccaagagt gggaagaaab baaagcaaga 4140 

baggctabta agbgcagagg gagagaaaab gccbccaaca bgtgaggaag baatgagaga 42 00 

aabcatagaa tbbbaaggcc abgabbbaag gccatcabgg cctbaabcbt ccgcbbccbc 4260 

gctcacbgac bcgcbgcgcb cggbcgbbcg gctgcggcga gcggbabcag cbcacbcaaa 432 0 

ggcggtaaba cggbbabcca cagaabcagg ggabaacgca ggaaagaaca bgbgagcaaa 4380 

aggccagcaa aaggccagga accgbaaaaa ggccgcgbbg cbggcgbtbb bccabaggcb 444 0 

ccgccccccb gacgagcabc acaaaaabcg acgcbcaagb cagaggbggc gaaacccgac 4500 

aggactabaa agabaccagg cgbbbccccc bggaagcbcc cbcgtgcgcb ctccbgbbcc 4560 

gaccctgccg cbbaccggab accbgtccgc ctbbcbcccb bcgggaagcg bggcgctbtc 4620 

tcatagcbca cgcbgbaggb atcbcagbbc ggbgtaggbc gbbcgcbcca agcbgggcbg 4680 

bgbgcacgaa ccccccgbbc agcccgaccg cbgcgccbba bccggtaacb atcgbcbbga 4740 

gbccaacccg gbaagacacg actbabcgcc acbggcagca gccacbggba acaggatbag 4800 

cagagcgagg babgbaggcg gbgcbacaga gtbcbbgaag bggbggccba acbacggcba 4860 

cacbagaaga acagbabbbg gbabcfcgcgc bcbgcbgaag ccagbtaccb bcggaaaaag 4920 

agbbggbagc bcbbgabccg gcaaacaaac caccgcbggb agcggbggbt btbbbgtbbg 4980 

caagcagcag abbacgcgca gaaaaaaagg abcbcaagaa gabccbbtga bcbbbbcbac 5040 

ggggbcbgac gcbcagbgga acgaaaacbc acgbbaaggg abbbbggbca bgagabbabc 5100 

aaaaaggabc bbcaccbaga bccbbtbaaa tbaaaaabga agbbbbaaab caabctaaag 5160 

bababatgag baaacbbggb cbgacagbta ccaabgcbba atcagtgagg caccbatcbc 5220 

agcgabctgb cbabtbcgtb catccabagb bgccbgacbc gggggggggg ggcgcbgagg 52 80 

bcbgccbcgb gaagaaggbg tbgctgactc abaccaggcc tgaabcgccc catcabccag 5340 

ccagaaagbg agggagccac ggbbgabgag agcbbbgbbg taggbggacc agbbggbgab 5400 

bbbgaacbtb bgcbbbgcca cggaacggtc bgcgbbgbcg ggaagabgcg bgabcbgabc 5460 

cbbcaacbca gcaaaagbbc gabbbabbca acaaagccgc cgbcccgbca agbcagcgta 5520 

abgctctgcc agbgbbacaa ccaabbaacc aabbcbgabb agaaaaacbc abcgagcatc 5580 

aaabgaaacb gcaabbbabt cababcagga bbabcaabac cababtbbbg aaaaagccgb 5640 

bbcbgbaabg aaggagaaaa cbcaccgagg cagbbccaba ggabggcaag abccbggbab 5700 

cggbcbgcga bbccgacbcg bccaacabca abacaaccba bbaabtbccc cbcgbcaaaa 5760 

abaaggtbab caagbgagaa abcaccabga gbgacgacbg aabccggbga gaabggcaaa 582 0 

agcbbatgca bbbcbbbcca gacbbgbbca acaggccagc cabbacgcbc gbcabcaaaa 5880 

bcacbcgcab caaccaaacc gbbabtcabb cgbgabbgcg ccbgagcgag acgaaabacg 5940 

cgabcgcbgb baaaaggaca abbacaaaca ggaatcgaat gcaaccggcg caggaacact 6000 

gccagcgcab caacaababt bbcaccbgaa bcaggababb cbbcbaabac cbggaabgcb 6060 

gtbbbcccgg ggabcgcagb ggbgagbaac cabgcabcab caggagbacg gabaaaabgc 6120 
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ttgabggtcg gaagaggcat aaattccgtc agccagttta gtctgaccat ctcatctgta 6180 

acatcatbgg caacgctacc tttgccatgt ttcagaaaca actctggcgc atcgggcttc 6240 

ccatacaatc gatagattgt cgcacctgat bgcccgacab batcgcgagc ccabttatac 6300 

ccatataaat cagcatccat gttggaabtt aatcgcggcc tcgagcaaga cgtbtcccgb 6360 

tgaatatggc tcataacacc ccbbgbabta ctgtttabgt aagcagacag bbtbatbgtb 6420 

catgatgata tatbbbbabc ttgtgcaatg taacatcaga gattttgaga cacaacgtgg 6480 

ctttcccccc ccccccatta ttgaagcatt tatcagggtt attgtctcat gagcggatac 6540 

atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatb tccccgaaaa 6600 

gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt 6660 

atcacgaggc ccfcfctcgtc 6679 



<210> 135 
<211> 6646 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 135 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagcaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcbtg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatfca tttacaaatt cacatataca acaacgccgt cccccgtgcc 13 80 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 14 4 0 
catgggctct tctccggtag cggcggagct tccacabccg agcccbggbc ccabgcctcc 1500 
agcggctcat ggtcgctcgg cagctcctbg ctccbaacag bggaggccag acbbaggcac 1560 
agcacaatgc ccaccaccac cagtgbgccg cacaaggccg bggcggtagg gtatgtgtcb 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagtbgbt gbattctgat aagagtcaga ggtaacbccc 1740 
gttgcggtgc bgttaacggt ggagggcagt gtagtcbgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctfctccat gggbcbbttc 1860 
tgcagbcacc gtcgtcgaca cgtgtgatca gabafccgcgg ccgctctaga gafcatcgcca 1920 
ccabgcgtgt bcgtggtatc ccgcgtaacb ggccgcagtg gbggatgtgg ggtatcctgg 1980 
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gtttctggat gatcatcatc tgccgtgttg 
gtgttccggt ttggaccgac gctaaaacca 
acgaccgtga agttcacaac gtttgggcta 
cgcaggaaat cgttctggaa aacgttaccg 
ttgaccagat gcacgaagac atcatctccc 
aactgacccc gctgtgcgtt accctgcact 
ccaacgacat gaacaaagaa atccgtaact 
ccaccatcac ccaggcttgc ccgaaagtta 
ctccggctgg ttacgctatc ctgaaatgca 
gcaacaacgt ttccaccgtt cagtgcaccc 
tgctgctgaa cggttccctg gctgaaaaag 
acaacgttaa aaccatcatc gttcacctga 
cgaacaacaa cacccgtaaa tccatgcgta 
gtgacatcat cggtgacatc cgtcaggctt 
aaaccctgaa acgtgttaaa gaaaaactgc 
aattcgctcc gtcctccggt ggtgacctgg 
gtgaattctt ctactgcaac accacccgtc 
ccatcaccct gccgtgccgt atcaaacaga 
ctatgtacgc tccgccgatc gctggtaaca 
tgctggttcg tgacggtggt gaagacaaca 
gtaacatgaa agacaactgg cgttccgaac 
cgctgggtat cgctccgacc ggtgctaagc 
ccatcgttca gcagcagtcc aacctgctgc 
agctgaccgt ttggggtatc aaacagctgc 
tgaaagacca gcagctcgag atctggaaca 
tctccaacta caccgacacc atctaccgtc 
aaaacgaaaa agacctgctg gctctggact 
tctccaactg gctgtggtga ggatccagat 
ttgtttgccc ctcccccgtg ccttccttga 
cctaataaaa tgaggaaatt gcatcgcatt 
gtggggtggg gcaggacagc aagggggagg 
atgcggtggg ctctatgggt acccaggtgc 
gaaagaagca ggcacatccc cttctctgtg 
ttccagcccc actcatagga cactcatagc 
ctaaagtact tggagcggtc" tctccctccc 
caagagtggg aagaaattaa agcaagatag 
tccaacatgt gaggaagfcaa tgagagaaat 
atcatggcct taatcttccg cttcctcgct 
gcggcgagcg gtatcagctc actcaaaggc 
taacgcagga aagaacatgt gagcaaaagg 
cgcgttgctg gcgtttttcc ataggctccg 
ctcaagtcag aggtggcgaa acccgacagg 
aagctccctc gtgcgctctc ctgttccgac 
tctcccttcg ggaagcgtgg cgctttctca 
gtaggtcgtt cgctccaagc tgggctgtgt 
cgccttatcc ggtaactatc gtcttgagtc 
ggcagcagcc actggtaaca ggattagcag 
cttgaagtgg tggcctaact acggctacac 
gctgaagcca gttaccttcg gaaaaagagt 
cgctggtagc ggtggttttt ttgtttgcaa 
tcaagaagat cctttgatct tttctacggg 
ttaagggatt ttggtcatga gattatcaaa 
aaaatgaagt tttaaatcaa tcbaaagtat 
atgcttaatc agtgaggcac ctatctcagc 
ctgactcggg gggggggggc gctgaggtct 



ttggtaacat gtgggttacc gtttacfcacg 2040 
ccctgttctg cgcttccgac accaaagcct 2100 
cccacgcttg cgttccgacc gacccgaacc 2160 
aaaacttcaa catgtggaaa aacgacatgg 2220 
tgtgggacca gtccctgaaa ccgtgcgtta 2280 
gcaccaacgc taccttcaaa aacaacgtta 2340 
gctccttcgc atgcatcaac tgcaacgctt 2400 
acttcgaccc gatcccgafcc cactactgcg 2460 
acaacaaaac cttctccggt aaaggtccgt 2520 
atggtatcaa accggttgtt tccacccagc 2580 
aaatcatcah ccgttccgaa aacctgaccg 2640 
acaaatccgt fcgaaatcgtt tgcacccgtc 2700 
tcggtccggg tcagaccttc tacgctaccg 2760 
actgcaacat ctccggttcc aaatggaacg 2820 
aggaaaacta caacaacaac aaaaccatca 2880 
aaatcaccac ccactccttc aactgccgtg 2940 
tgttcaacaa caacgctacc gaagacgaaa 3000 
tcatcaacat gtggcagggt gttggtcgtg 3060 
tcacctgcaa atccaacatc accggtctgc 3120 
aaaccgaaga aatcttccgt ccgggtggtg 3180 
tgtacaaata caaagttatc gaactgaaac 3240 
ttaccgttca ggctcgtcag ctgctgtcct 3300 
gtgctatcga agctcagcag cacatgctgc 3360 
agacccgtgt tctggctatc gaacgttacc 3420 
acatgacctg gatggaatgg gaccgtgaaa 3480 
tgctggaaga ctcccagacc cagcaggaaa 3540 
cctggaaaaa cctgtggtcc tggttcgaca 3600 
ctgctgtgcc ttctagttgc cagccatctg 3660 
ccctggaagg tgccactccc actgtccttt 3720 
gtctgagtag gtgtcattct attctggggg 3780 
attgggaaga caatagcagg catgctgggg 384 0 
tgaagaattg acccggttcc tcctgggcca 3900 
acacaccctg tccacgcccc tggttcttag 3960 
tcaggagggc tccgccttca atcccacccg 4020 
tcatcagccc accaaaccaa acchagcctc 4080 
gctattaagt gcagagggag agaaaatgcc 4140 
catagaattt taaggccatg atttaaggcc 4200 
cactgachcg ctgcgctcgg tcgttcggct 4260 
ggtaatacgg ttatccacag aatcagggga 4320 
ccagcaaaag gccaggaacc gtaaaaaggc 4380 
cccccctgac gagcafccaca aaaatcgacg 4440 
actataaaga taccaggcgt ttccccctgg 4500 
cctgccgctt accggatacc tgtccgcctt 4560 
tagctcacgc tgtaggtatc tcagttcggt 4620 
gcacgaaccc cccgttcagc ccgaccgctg 4680 
caacccggta agacacgact tatcgccact 4740 
agcgaggtat gtaggcggtg ctacagagtt 4800 
tagaagaaca gtatttggta tctgcgctct 4860 
tggtagctct tgatccggca aacaaaccac 4920 
gcagcagatt acgcgcagaa aaaaaggatc 4980 
gtctgacgct cagtggaacg aaaacfccacg 5040 
aaggatcttc acctagatcc fcttfcaaatta 5100 
atatgagtaa acttggtctg acagtbacca 5160 
gatctgtcta tttcgttcat ccatagttgc 5220 
gcctcgtgaa gaaggtgtfcg ctgactcata 5280 
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ccaggcctga abcgccccab catccagcca gaaagtgagg gagccacggt tgatgagagc 5340 
tttgttgtag gbggaccagb bggbgabbbb gaacbbbbgc tbbgccacgg aacggtctgc 5400 
gbbgbcggga agatgcgtga tctgatcctt caactcagca aaagbbcgab ttattcaaca 5460 
aagccgccgt cccgtcaagt cagcgtaatg ctctgccagt gbbacaacca afctaaccaat 5520 
tctgattaga aaaactcatc gagcatcaaa bgaaactgca atbbabtcat atcaggatta 5580 
tcaataccat abbbtbgaaa aagccgtttc tgtaatgaag gagaaaactc accgaggcag 5640 
ttccatagga tggcaagatc cbggbatcgg tctgcgattc cgactcgtcc aacatcaaba 5700 
caacctatta abbbccccbc gtcaaaaata aggbbatcaa gtgagaaatc accatgagtg 5760 
acgactgaat ccggtgagaa tggcaaaagc ttatgcattt ctttccagac bbgbtcaaca 5820 
ggccagccab bacgcbcgbc atcaaaabca cbcgcatcaa ccaaaccgbt abbcattcgb 5880 
gatbgcgccb gagcgagacg aaabacgcga bcgcbgtbaa aaggacaabt acaaacagga 594 0 
atcgaatgca accggcgcag gaacacbgcc agcgcatcaa caatatbtbc accbgaabca 6000 
ggabatbcbb cbaabaccbg gaabgctgbt btcccgggga bcgcagbggb gagbaaccab 6060 
gcabcabcag gagbacggab aaaabgcbbg abggbcggaa gaggcabaaa bbccgbcagc 6120 
cagbbbagbc tgaccabcbc abctgbaaca bcabbggcaa cgcbaccbtb gccabgbbbc 6180 
agaaacaact cbggcgcabc gggcbtccca bacaabcgat agabbgtcgc acctgabbgc 624 0 
ccgacabbat cgcgagccca bbbabaccca babaaabcag catccatgtt ggaabtbaab 6300 
cgcggccbcg agcaagacgt ttcccgbbga ababggctca baacacccct tgtabtacbg 6360 
fcbbabgtaag cagacagbtt battgbbcab gabgababab bbtbabcbbg bgcaabgbaa 6420 
cabcagagab bbbgagacac aacgtggcbb bccccccccc cccabbabbg aagcabbbab 6480 
cagggbbabb gbcbcabgag cggabacaba bbbgaabgba bbtagaaaaa baaacaaafca 6540 
ggggbbccgc gcacabbbcc ccgaaaagtg ccaccbgacg bctaagaaac cabbabtabc 6600 
abgacabtaa ccbataaaaa taggcgbabc acgaggcccb bbcgtc 664 6 



<210> 136 
<211> 6565 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVRi012x/s containing HIV genes 
<400> 136 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagctcccg gagacggtca 60 
cagcbbgbcb gtaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 
bbggcgggbg bcggggcbgg cbbaactabg cggcabcaga gcagattgba ctgagagtgc 180 
accabatgcg gtgbgaaata ccgcacagab gcgtaaggag aaaataccgc atcagabtgg 24 0 
cbabtggcca btgcabacgt bgbabccaba bcataababg bacatttaba tbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgatb abbgacbagb babbaabagb aabcaabbac 360 
ggggbcabba gtbcabagcc catatatgga gbbccgcgtt acabaacbta cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaataabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagtabb tacggbaaac 540 
bgcccactbg gcagbacabc aagbgtatca fcabgccaagt acgcccccba btgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbatgc ccagbacabg accbtabggg actbbccbac 660 
btggcagbac atctacgbab bagbcatcgc tabbaccabg gtgatgcggb bbtggcagba 720 
cabcaabggg cgbggabagc ggbbbgactc acggggabbb ccaagtcbcc accccabbga 780 
cgbcaabggg agbbtgbbbb ggcaccaaaa tcaacgggac bbbccaaaab gbcgtaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggtcb ababaagcag 900 
agcbcgtbba gbgaaccgtc agatcgcctg gagacgccab ccacgcbgtb bbgaccbcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcatbg gaacgcggab 1020 
tccccgbgcc aagagbgacg taagbaccgc cbabagacbc babaggcaca ccccbbbggc 1080 
bcbbabgcab gcbabacbgb btbbggcbbg gggccbabac acccccgctb ccbbabgcba 1140 
baggtgatgg babagcbbag ccbabaggbg bgggbbabbg accabbabtg accacbcccc 1200 



- 313 - 



WO 02/32943 PCT/US01/25721 

tatbggbgac gabacbbbcc attactaatc cataacatgg ctctttgcca caacbabcbc 1260 
tattggcbat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccabbbabba tttacaaatt cacatataca acaacgccgb cccccgtgcc 1380 
cgcagttttt abbaaacaba gcgtgggatc tccacgcgaa bcbcgggfcac gbgbtccgga 1440 
catgggcbct tcbccggbag cggcggagct tccacatccg agccctggtc ccabgccbcc 1500 
agcggctcat ggbcgctcgg cagctccttg ctcctaacag tggaggccag acbbaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgbgbcb 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgbt gbabbcbgab aagagtcaga ggtaactccc 1740 
gbtgcggbgc tgttaacggt ggagggcagt gtagtctgag cagbacbcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagacbgb tcctttccah gggtcttbtc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gababcgcgg ccgctctaga gatatcgcca 1920 
ccabgcgbgb tcgtggtatc ccgcgtaact ggccgcagtg gbggabgbgg ggtatcctgg 1980 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 
gtgttccggt ttggaccgac gctaaaacca cccbgtbcbg cgcttccgac accaaagcct 2100 
acgaccgtga agtbcacaac gtttgggcta cccacgctfcg cgbbccgacc gacccgaacc 2160 
cgcaggaaab cgtbctggaa aacgbbaccg ' aaaacttcaa cabgbggaaa aacgacabgg 2 22 0 
btgaccagat gcacgaagac abcabcbccc tgbgggacca gbcccbgaaa ccgbgcgbba 22 8 0 
aacbgacccc gcbgtgcgbb acccbgcacb gcaccaacgc taccbbcaaa aacaacgbba 2340 
ccaacgacab gaacaaagaa abccgbaacb gcbcctbcgc abgcabcaac tgcaacgcbb 24 0 0 
ccaccatcac ccaggcbtgc ccgaaagtba acbtcgaccc gabcccgabc cactacbgcg 2460 
cbccggctgg tbacgcbatc cbgaaatgca acaacaaaac cbbcbccggt aaaggtccgb 2520 
gcaacaacgb ttccaccgtt cagbgcaccc abggbabcaa accggbtgbb tccacccagc 2580 
bgctgctgaa cggbbccctg gctgaaaaag. aaabcabcat ccgbbccgaa aaccbgaccg 2640 
acaacgtbaa aaccabcabc gbbcacctga acaaabccgb bgaaabcgbt tgcacccgbg 2700 
ctagcgctta cbgcaacatc bccggbbcca aatggaacga aacccbgaaa cgbgbbaaag 2760 
aaaaacbgca ggaaaacbac aacaacaaca aaaccabcaa abbcgcbccg tcctccggbg 2820 
gtgaccbgga aatcaccacc cacbcctbca actgccgbgg bgaatbctbc tacbgcaaca 2 88 0 
ccacccgbct gbbcaacaac aacgctaccg aagacgaaac cabcaccctg ccgtgccgba 2940 
bcaaacagab catcaacabg bggcagggtg bbggbcgbgc babgbacgcb ccgccgabcg 3000 
cbggbaacab caccbgcaaa bccaacabca ccggbcbgcb gcbggttcgb gacggbggbg 306 0 
aagacaacaa aaccgaagaa abcbbccgbc cgggtggbgg baacabgaaa gacaacbggc 312 0 
gbbccgaacb gbacaaabac aaagbbabcg aacbgaaacc gctgggbatc gcbccgaccg 3180 
gbgcbaagcb baccgtbcag gctcgbcagc tgcbgbccbc cabcgbbcag cagcagbcca 3240 
accbgcbgcg bgcbatcgaa gcbcagcagc acabgcbgca gcbgaccgbb bggggbabca 3300 
aacagcbgca gacccgbgbb cbggcbabcg aacgbbaccb gaaagaccag cagctcgaga 3360 
bcbggaacaa cabgaccbgg abggaabggg accgbgaaab cbccaacbac accgacacca 3420 
bcbaccgbcb gcbggaagac bcccagaccc agcaggaaaa aaacgaaaaa gacctgcbgg 34 8 0 
ctcbggacbc cbggaaaaac cbgbggtccb ggbbcgacab cbccaacbgg cbgbggbgag 354 0 
gabccagabc bgcbgtgccb bcbagbbgcc agccabctgb bgbbbgcccc bcccccgbgc 3600 
cbbccbbgac ccbggaaggb gccactccca cbgtccbtbc cbaabaaaab gaggaaabbg 3660 
cabcgcabtg tcbgagbagg bgbcabtcba bbcbgggggg bggggbgggg caggacagca 3720 
agggggagga bbgggaagac aabagcaggc abgcbgggga bgcggbgggc bctabgggba 3780 
cccaggbgcb gaagaabbga cccggbtccb cctgggccag aaagaagcag gcacabcccc 3 84 0 
ttcbcbgbga cacacccbgt ccacgccccb ggtbcbbagt tccagcccca cbcabaggac 3900 
acbcabagct caggagggcb ccgccttcaa bcccacccgc taaagbacbb ggagcggbcb 3960 
ctcccbccct cabcagccca ccaaaccaaa cctagcctcc aagagbggga agaaabbaaa 4020 
gcaagabagg cbabbaagbg cagagggaga gaaaabgccb ccaacatgtg aggaagbaab 4080 
gagagaaabc abagaabbbb aaggccabga bbtaaggcca bcabggccbb aatcbbccgc 414 0 
ttccbcgctc acbgacbcgc bgcgctcggb cgtbcggcbg cggcgagcgg batcagcbca 4200 
ctcaaaggcg gbaabacggb babccacaga abcaggggab aacgcaggaa agaacabgbg 4260 
agcaaaaggc cagcaaaagg ccaggaaccg baaaaaggcc gcgbtgcbgg cgbbttbcca 432 0 
baggcbccgc cccccbgacg agcabcacaa aaabcgacgc bcaagbcaga ggbggcgaaa 43 80 
cccgacagga cbabaaagab accaggcgbb bcccccbgga agcbcccbcg bgcgctcbcc 444 0 
bgbtccgacc cbgccgcbba ccggabacct gtccgcctbb ctcccbtcgg gaagcgtggc 4500 
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gctttctcat agctcacgcb gtaggbatct cagttcggtg taggtcgttc gctccaagct 4560 
gggctgtgtg cacgaacccc ccgtbcagcc cgaccgcbgc gccttatccg gtaactatcg 4620 
bcbbgagbcc aacccggtaa gacacgactt abcgccactg gcagcagcca ctggtaacag 4680 
gattagcaga gcgaggtabg taggcggtgc fcacagagttc bbgaagbggb ggccbaacba 4740 
cggctacact agaagaacag tafcttggtat cbgcgcbctg ctgaagccag ttaccttcgg 4800 
aaaaagagtt ggtagctctfc gatccggcaa acaaaccacc gctggbagcg gbggbbbbbb 4860 
bgbbbgcaag cagcagabba cgcgcagaaa aaaaggatct caagaagatc cbbtgabcbb 4920 
ttctacgggg tctgacgctc agtggaacga aaacbcacgb baagggatbb tggtcatgag 4980 
attatcaaaa aggabcbbca cctagatcct tttaaatbaa aaatgaagtt ttaaatcaat 5040 
ctaaagtata tatgagtaaa cbtggbctga cagttaccaa tgcttaatca gtgaggcacc 5100 
batctcagcg atctgtctat ttcgttcatc catagttgcc bgactcgggg ggggggggcg 5160 
cbgaggbcbg cctcgtgaag aaggtgttgc tgacbcatac caggcctgaa tcgccccatc 5220 
abccagccag aaagtgaggg agccacggbb gatgagagct btgbtgbagg tggaccagtt 5280 
ggbgabbbbg aactbtbgct bbgccacgga acggbctgcg tbgbcgggaa gabgcgbgab 5340 
ctgabccbtc aactcagcaa aagbtcgabt babbcaacaa agccgccgbc ccgtcaagtc 54 00 
agcgtaatgc bcbgccagtg bbacaaccaa tbaaccaabb ctgatbagaa aaacbcabcg 5460 
agcatcaaab gaaacbgcaa tttatbcata bcaggatbab caataccata btbbbgaaaa 5520 
agccgbbbcb gbaabgaagg agaaaacbca ccgaggcagb bccataggat ggcaagabcc 5580 
tggtatcggt ctgcgabtcc gactcgbcca acabcaabac aaccbatbaa btbccccbcg 5640 
bcaaaaabaa ggbbabcaag tgagaaabca ccabgagbga cgacbgaabc cggtgagaat 57 00 
ggcaaaagcb babgcabbbc bbbccagacb tgbbcaacag gccagccabb acgcbcgbca 5760 
tcaaaabcac bcgcabcaac caaaccgbta bbcabbcgbg abbgcgccbg agcgagacga 5820 
aabacgcgab cgctgttaaa aggacaabba caaacaggaa tcgaatgcaa ccggcgcagg 5880 
aacacbgcca gcgcatcaac aatabbbbca cctgaatcag gababbcbbc baabaccbgg 5940 
aabgcbgbbb bcccggggab cgcagbggbg agtaaccabg cabcabcagg agbacggaba 6000 
aaatgcbbga bggbcggaag aggcataaab bccgbcagcc agtbbagtcb gaccabcbca 6060 
bcbgbaacab cabbggcaac gcbaccbbbg ccabgbbbca gaaacaacbc bggcgcatcg 6120 
ggcbbcccab acaatcgaba gabbgbcgca ccbgabbgcc cgacatbabc gcgagcccab 6180 
bbabacccat ataaabcagc atccabgtbg gaabbbaatc gcggccbcga gcaagacgbb 6240 
bcccgbtgaa batggcbcat aacaccccbb gbatbacbgb btabgtaagc agacagtbbt 63 0 0 
atbgbbcabg abgatababb bbbabcbbgb gcaabgbaac abcagagabb bbgagacaca 6360 
acgbggcbtb cccccccccc ccabbabbga agcabtbatc agggbbatbg bcbcabgagc 6420 
ggatacabab bbgaabgbat bbagaaaaab aaacaaabag gggbbccgcg cacabbbccc 64 80 
cgaaaagbgc cacctgacgt cbaagaaacc abbabtabca bgacatbaac cbabaaaaab 6540 
aggcgtabca cgaggcccbb bcgbc 6565 



<210> 137 
<211> 6526 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 137 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagctbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
tbggcgggtg tcggggcbgg cbbaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accatabgcg gtgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbabbggcca bbgcatacgb bgtatccaba bcabaababg bacabbbaba bbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgabb abbgacbagb batbaabagt aabcaabbac 360 
ggggtcabta gbbcabagcc catababgga gbbccgcgbb acataactba cggbaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 
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cabagbaacg ccaabaggga cbtbccabbg acgtcaatgg gtggagtatt tacggtaaac 540 
bgcccacbbg gcagtacatc aagbgbabca babgccaagb acgcccccta bbgacgbcaa 600 
tgacggbaaa tggcccgcct ggcattafcgc ccagtacatg accttabggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgbggabagc ggbbbgacbc acggggattfc ccaagtctcc accccahbga 780 
cgbcaabggg agtttgtttt ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
ctccgcccca bbgacgcaaa tgggcggtag gcgbgbacgg tgggaggtct atabaagcag 900 
agctcgttta gtgaaccgbc agatcgcctg gagacgccab ccacgctgbb bbgacctcca 960 
bagaagacac cgggaccgab ccagcctccg cggccgggaa cggbgcabbg gaacgcggab 102 0 
bccccgbgcc aagagbgacg baagtaccgc cbabagacbc babaggcaca ccccbbbggc 1080 
bctbabgcab gcbabacbgb bbbbggcbbg gggccbatac acccccgctb cctbatgcba 1140 
baggtgabgg babagcbbag ccbabaggbg tgggbbatbg accabbabbg accacbcccc 1200 
babbggbgac gabacbbbcc abbacbaabc cataacabgg ctcbtbgcca caacbatcbc 1260 
babbggcbat atgccaabac tcbgbccbbc agagacbgac acggacbcbg babbbbbaca 1320 
ggabggggbc ccabbbabta bbtacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbtbbt abbaaacaba gcgbgggabc bccacgcgaa bcbcgggbac gbgbbccgga 1440 
cabgggcbct bcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggctcab ggbcgcbcgg cagcbcctbg cbccbaacag bggaggccag actbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gtabgbgbcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gtabbcbgab aagagbcaga ggbaacbccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagb gbagbcbgag cagbacbcgt bgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagact aacagacbgb tccbbbccat gggbctttbc 1860 
bgcagbcacc gbcgbcgaca cgbgbgabca gababcgcgg ccgcbcbaga gababcgcca 1920 
ccabgcgbgb bcgbggtabc ccgcgbaacb ggccgcagtg gbggabgbgg ggbabcctgg 1980 
gtbbctggab gabcabcabc tgccgbgbbg tbggbaacab gbgggbbacc gbbbacbacg 2040 
gbgbbccggb btggaccgac gcbaaaacca ccctgbbcbg cgctbccgac accaaagccb 2100 
acgaccgbga agbbcacaac gbbbgggcba cccacgcbbg cgtbccgacc gacccgaacc 2160 
cgcaggaaab cgbbcbggaa aacgtbaccg aaaacbbcaa cabgbggaaa aacgacabgg 2220 
bbgaccagab gcacgaagac abcabcbccc bgbgggacca gbcccbgaaa ccgbgcgtba 2280 
aacbgacccc gcbgbgcgbt acccbgcacb gcaccaacgc baccbbcaaa aacaacgtba 2340 
ccaacgacab gaacaaagaa atccgbaacb gcbcctbcgc abgcabcaac bgcaacgcbb 2400 
ccaccabcac ccaggcbbgc ccgaaagbba acbbcgaccc gabcccgabc cacbacbgcg 2460 
cbccggcbgg bbacgcbatc cbgaaabgca acaacaaaac cbbcbccggb aaaggbccgb 2520 
gcaacaacgb bbccaccgbb cagbgcaccc abggbabcaa accggtbgbb bccacccagc 2580 
tgcbgcbgaa cggbbcccbg gcbgaaaaag aaabcatcab ccgtbccgaa aaccbgaccg 2640 
acaacgbbaa aaccabcatc gbbcacctga acaaabccgb bgaaabcgbb bgcacccgtg 2700 
cbagcgcbba cbgcaacabc tccggbbcca aabggaacga aacccbgaaa cgbgbbaaag 2760 
aaaaacbgca ggaaaacbac aacaacaaca aaaccabcaa abtcgcbccg tcctccggtg 2820 
gbgaccbgga aabcaccacc cacbccbbca acbgccgbgg bgaabbcbbc bacbgcaaca 2880 
ccgcbagcct gccgbgccgb abcaaacaga tcabcaacat gbggcagggb gbbggbcgbg 2940 
cbatgbacgc bccgccgabc gcbggbaaca bcaccbgcaa abccaacabc accggbcbgc 3000 
bgcbggbbcg bgacggbggb gaagacaaca aaaccgaaga aabcbbccgt ccgggfcggbg 3060 
gbaacabgaa agacaacbgg cgtbccgaac bgbacaaata caaagbbabc gaacbgaaac 3120 
cgcbgggbab cgcbccgacc ggbgcbaagc bbaccgbbca ggctcgbcag cbgcbgbccb 3180 
ccabcgbbca gcagcagbcc aacctgcbgc gbgcbabcga agcbcagcag cacatgcbgc 324 0 
agcbgaccgb tbggggbabc aaacagcbgc agacccgbgb bcbggcbabc gaacgbbacc 3300 
bgaaagacca gcagcbcgag abcbggaaca acabgaccbg gabggaabgg gaccgbgaaa 3360 
bctccaacba caccgacacc abcbaccgbc bgcbggaaga cbcccagacc cagcaggaaa 3420 
aaaacgaaaa agaccbgcbg gcbcbggacb ccbggaaaaa ccbgbggbcc bggbbcgaca 3480 
bcbccaacbg gcbgbggbga ggabccagab cbgcbgbgcc bbcbagtbgc cagccabcbg 3540 
bbgbbbgccc cbcccccgtg ccbbccbbga cccbggaagg bgccactccc actgbccbbb 3600 
ccbaabaaaa bgaggaaatb gcabcgcabb gbcbgagbag gbgbcabbcb abbcbggggg 3660 
gbggggbggg gcaggacagc aagggggagg abbgggaaga caabagcagg catgcbgggg 3720 
abgcggbggg cbcbabgggb acccaggbgc bgaagaabbg acccggbbcc bccbgggcca 3780 
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gaaagaagca ggcacatccc cbbctcbgtg acacaccctg tccacgcccc tggttcttag 3 840 
ttccagcccc acbcabagga cactcabagc tcaggagggc tccgcctbca atcccacccg 3 900 
ctaaagtact tggagcggtc tctccctccc tcatcagccc accaaaccaa acctagcctc 3960 
caagagbggg aagaaattaa agcaagatag gcbatbaagb gcagagggag agaaaatgcc 4 020 
tccaacatgt gaggaagtaa tgagagaaat catagaattt taaggccatg atttaaggcc 40 80 
atcatggcct taatcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 4140 
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aabcagggga 4200 
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 4260 
cgcgttgctg gcgtttttcc abaggcbccg cccccctgac gagcatcaca aaaatcgacg 4320 
ctcaagtcag aggtggcgaa acccgacagg acbataaaga taccaggcgt ttccccctgg 43 80 
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 4440 
bctcccbbcg ggaagcgtgg cgctttctca bagctcacgc tgtaggtatc bcagbbcggb 4500 
gtaggtcgtt cgctccaagc tgggcbgbgb gcacgaaccc cccgttcagc ccgaccgctg 4560 
cgccbbatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 4620 
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 4680 
cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta bcbgcgcbcb 4740 
gctgaagcca gbbaccbbcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 4 800 
cgctggtagc ggtggttttt ttgtttgcaa gcagcagabt acgcgcagaa aaaaaggatc 4860 
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 4 920 
ttaagggatt bbggbcabga gattatcaaa aaggatctbc acctagatcc btbtaaabba 4980 
aaaabgaagb bbbaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 504 0 
abgcbbaabc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 5100 
ctgactcggg gggggggggc gcbgaggbcb gcctcgtgaa gaaggtgttg ctgactcata 5160 
ccaggcctga atcgccccat catccagcca gaaagtgagg gagccacggt bgabgagagc 5220 
tttgttgtag gbggaccagt tggtgatttt gaacttbbgc tttgccacgg aacggtctgc 5280 
gttgtcggga agatgcgtga bcbgatcctt caactcagca aaagttcgat bbabtcaaca 5340 
aagccgccgt cccgtcaagt cagcgtaatg ctctgccagt gttacaacca attaaccaat 54 00 
bcbgabtaga aaaactcatc gagcatcaaa tgaaactgca abbbabbcab atcaggatta 5460 
bcaabaccab atttttgaaa aagccgtttc tgtaatgaag gagaaaactc accgaggcag 552 0 
ttccatagga tggcaagatc ctggtatcgg tctgcgattc cgactcgtcc aacatcaata 5580 
caacctatta atttcccctc gtcaaaaata aggttatcaa gbgagaaabc accatgagtg 5640 
acgactgaat ccggtgagaa tggcaaaagc bbabgcatbb cbbbccagac ttgttcaaca 5700 
ggccagccat tacgctcgtc afccaaaatca ctcgcatcaa ccaaaccgfct attcattcgt 5760 
gattgcgcct gagcgagacg aaatacgcga tcgctgttaa aaggacaatt acaaacagga 5820 
atcgaatgca accggcgcag gaacactgcc agcgcatcaa caatattttc acctgaatca 58 80 
ggatattctt ctaatacctg gaatgctgtt ttcccgggga tcgcagtggt gagtaaccat 5940 
gcafccatcag gagtacggat aaaatgcttg atggtcggaa gaggcataaa ttccgtcagc 6000 
cagtttagtc tgaccatctc atctgtaaca tcattggcaa cgctaccttt gccatgtttc 6060 
agaaacaact ctggcgcatc gggcttccca tacaatcgat agattgtcgc acctgattgc 6120 
ccgacattat cgcgagccca tttataccca tataaatcag catccatgtt ggaatttaat 6180 
cgcggcctcg agcaagacgt tbcccgttga abatggcbca taacacccct bgtatbacbg 6240 
bbtabgbaag cagacagbbb babbgbbcab gabgatabab tbtbabcbbg bgcaabgbaa 6300 
catcagagab bbbgagacac aacgbggcbb tccccccccc cccabbabbg aagcabbbab 6360 
cagggbbabb gbcbcatgag cggabacaba bbbgaabgba bbbagaaaaa baaacaaaba 6420 
g999tbccgc gcacabbbcc ccgaaaagbg ccaccbgacg bcbaagaaac cabbabbabc 64 80 
abgacatbaa ccbabaaaaa baggcgbabc acgaggcccb bbcgbc 6526 



<210> 138 
<211> 6607 
<212> DNA 

<213> Arbificial Sequence 
<220> 
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<223> plasmid pVR1012x/s containing HIV genes 

<400> 138 

tcgcgcgttt cggtgabgac ggtgaaaacc tctgacacat gcagctcccg gagacggbca 60 
cagctbgbct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagbgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgbaaggag aaaataccgc atcagatbgg 240 
cbattggcca ttgcatacgt tgtatccata tcabaabatg tacatttata btggcbcatg 300 
tccaacatta ccgccatgtt gacattgatt abbgacbagt tattaatagt aatcaatbac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaabgg 420 
cccgcctggc bgaccgccca acgacccccg cccattgacg tcaataatga cgtatgtbcc 480 
catagtaacg ccaataggga cttbccattg acgbcaabgg gbggagtatt tacggbaaac 540 
tgcccacbtg gcagbacatc aagtgtabca tatgccaagt acgcccccta tbgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattabgc ccagtacabg accttabggg actttccbac 660 
ttggcagbac atctacgtat bagtcatcgc tabbaccatg gtgatgcggt bttggcagta 720 
catcaatggg cgtggatagc ggbttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgbgtacgg tgggaggtct atataagcag 900 
agctcgttba gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt btgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgbgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 114 0 
taggtgatgg tatagcttag cctataggtg tgggttabtg accabtattg accactcccc 1200 
tatbggtgac gatactttcc attacbaatc cataacatgg ctctttgcca caactatcbc 1260 
fcattggctat abgccaatac tctgtccttc agagacbgac acggactctg tattttfcaca 1320 
ggabggggtc ccatttabta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt abtaaacata gcgbgggatc tccacgcgaa tctcgggtac gtgttccgga 14 4 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg bggcggtagg gtabgtgtcb 1620 
gaaaatgagc gtggagatfcg ggctcgcacg gctgacgcag atggaagact baaggcagcg 1680 
gcagaagaag abgcaggcag ctgagttgtt gtabtcbgat aagagtcaga ggtaactccc 174 0 
gttgcggbgc tgttaacggt ggagggcagt gtagtcfcgag cagtactcgb tgctgccgcg 1800 
cgcgccacca gacataabag ctgacagact aacagactgt tcctttccab gggtctttbc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaacb ggccgcagtg gtggatgtgg ggtatcctgg 1980 
gbttctggat gatcabcabc bgccgtgttg ttggtaacat gtgggfctacc gtttactacg 204 0 
gtgttccggt ttggaccgac gctaaaacca cccbgttctg cgcfctccgac accaaagcct 2100 
acgaccgtga agttcacaac gtbtgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 
ttgaccagat gcacgaagac atcabcbccc tgtgggacca gtccctgaaa ccgtgcgtta 2280 
aactgacccc gctgtgcgbt accctgcacb gcaccaacgc taccttcaaa aacaacgtta 2340 
ccaacgacab gaacaaagaa atccgtaacb gctccttcgc atgcatcaac bgcaacgcfct 2400 
ccaccatcac ccaggcttgc ccgaaagtta actbcgaccc gatcccgatc cactactgcg 24 60 
ctccggcbgg ttacgctabc ctgaaatgca acaacaaaac cttctccggt aaaggbccgt 2520 
gcaacaacgt btccaccgtt cagtgcaccc abggbatcaa accggttgtb tccacccagc 2580 
tgctgctgaa cggttccctg gctgaaaaag aaabcatcat ccgttccgaa aacctgaccg 264 0 
acaacgtbaa aaccatcatc gttcacctga acaaatccgt tgaaatcgtt tgcacccgtc 2700 
cgaacaacaa cacccgtaaa tccatgcgba tcggtccggg bcagaccttc tacgctaccg 2760 
gtgacatcat cggtgacatc cgtcaggctt acbgcaacat ctccggtbcc aaatggaacg 2820 
aaaccctgaa acgtgtbaaa gaaaaacbgc aggaaaacta caacaacaac aaaaccatca 2880 
aatbcgcbcc gtcctccggt ggtgacctgg aaatcaccac ccactccbtc aactgccgtg 2940 
gtgaabtctt ctactgcaac accgctagcc tgccgtgccg bafccaaacag atcatcaaca 3000 
tgtggcaggg tgttggtcgt gcbatgtacg ctccgccgat cgctggtaac atcacctgca 3 060 
aatccaacab caccggtctg cbgctggttc gtgacggtgg bgaagacaac aaaaccgaag 3120 
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aaatcttccg tccgggtggt ggtaacatga aagacaactg gcgttccgaa ctgtacaaat 3180 

acaaagttat cgaactgaaa ccgctgggta tcgctccgac cggtgctaag cttaccgttc 3240 

aggctcgtca gctgctgtcc tccatcgttc agcagcagtc caacctgchg cgtgctatcg 3300 

aagctcagca gcacatgctg cagctgaccg tttggggtat caaacagctg cagacccgtg 3360 

ttctggctat cgaacgttac ctgaaagacc agcagctcga gabctggaac aacatgacct 3420 

ggatggaatg ggaccgtgaa atctccaact acaccgacac catctaccgt ctgctggaag 34 80 

actcccagac ccagcaggaa aaaaacgaaa aagacctgct ggctctggac tcctggaaaa 3540 

acctgtggtc ctggttcgac atctccaact ggctgtggtg aggatccaga tctgctgtgc 3600 

cttctagttg ccagccatct gfctgtttgcc cctcccccgt gccttccttg accctggaag 3660 

gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta 3720 

ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag 37 80 

acaatagcag gcatgctggg gatgcggtgg gctctatggg tacccaggtg ctgaagaatt 3840 

gacccggttc ctcctgggcc agaaagaagc aggcacatcc ccttctctgt gacacaccct 3900 

gtccacgccc ctggttctta gttccagccc cactcatagg acactcatag ctcaggaggg 3960 

ctccgccttc aatcccaccc gctaaagtac ttggagcggt ctctccctcc ctcatcagcc 4020 

caccaaacca aacctagcct ccaagagtgg gaagaaatta aagcaagata ggctattaag 4080 

tgcagaggga gagaaaatgc ctccaacatg tgaggaagta atgagagaaa tcatagaatt 4140 

ttaaggccat gatttaaggc catcatggcc ttaatcttcc gcttcctcgc tcactgactc 4200 

gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg 4260 

gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa 4320 

ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga 4 380 

cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag 4440 

ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct 4500 

taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg 4560 

ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc 4620 

ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt 4680 

aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta 4740 

tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac 4800 

agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc 4860 

ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat 4920 

tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc 4 980 

tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt 5040 

cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta 5100 

aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct 5160 
atttcgttca' tccatagttg cctgactcgg gggggggggg cgctgaggtc tgcctcgtga 5220 

agaaggtgtt gctgactcat accaggcctg aatcgcccca tcatccagcc agaaagtgag 5280 

ggagccacgg ttgatgagag ctttgttgta ggtggaccag ttggtgattt tgaacttttg 5340 

ctttgccacg gaacggtctg cgttgtcggg aagatgcgtg atctgatcct tcaactcagc 5400 

aaaagttcga tttattcaac aaagccgccg tcccgtcaag tcagcgtaat gctctgccag 5460 

tgttacaacc aattaaccaa ttctgattag aaaaactcat cgagcatcaa atgaaactgc 5520 

aatttattca tatcaggatt atcaatacca tatttttgaa aaagccgttt ctgtaatgaa 5580 

ggagaaaact caccgaggca gttccatagg atggcaagat cctggtatcg gtctgcgatc 5640 

ccgactcgtc caacatcaat acaacctatt aatttcccct cgtcaaaaat aaggttatca 5700 

agtgagaaat caccatgagt gacgactgaa tccggtgaga atggcaaaag cttatgcatt 5760 

tctttccaga cttgttcaac aggccagcca ttacgctcgt catcaaaatc actcgcatca 5820 

accaaaccgt tattcattcg tgattgcgcc tgagcgagac gaaatacgcg atcgctgtta 5880 

aaaggacaat tacaaacagg aatcgaatgc aaccggcgca ggaacactgc cagcgcatca 594 0 

acaatatttt cacctgaatc aggatattct tctaatacct ggaatgctgt tttcccgggg 6000 

atcgcagtgg tgagtaacca tgcatcatca ggagtacgga taaaatgctt gatggtcgga 6060 

agaggcataa attccgtcag ccagtttagt ctgaccatct catctgtaac atcattggca 6120 

acgctacctt tgccatgttt cagaaacaac tctggcgcat cgggcttccc atacaatcga 6180 

tagattgtcg cacctgattg cccgacatta tcgcgagccc atthataccc atataaatca 6240 

gcatccatgt tggaatttaa tcgcggcctc gagcaagacg tttcccgttg aatatggctc 6300 

ataacacccc ttgtattact gtttatgtaa gcagacagtt ttattgttca tgatgatata 6360 

tttttatctt gtgcaatgta acatcagaga ttttgagaca caacgtggct ttcccccccc 6420 
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ccccattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 6480 

atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 6540 

gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc 6600 
tttcgtc 6607 



<210> 139 
<211> 6676 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 139 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 1980 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 
gtgttccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 
ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 2280 
aactgacccc gctgtgcgtt accctgcact gcaccaacgc taccttcaaa aacaacgtta 2340 
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ccaacgacat gaacaaagaa atccgtaact gcbccbbcaa caccaccacc gaaatccgtg 2400 
acaaaaaaca gcagggfctac gcbcbgbbcb accgtccgga cabcgbbcbg ctgaaagaaa 2460 
accgtaacaa ctccaacaac tccgaataca tcctgatcaa ctgcaacgct tccaccatca 2520 
cccaggcttg cccgaaagbb aacttcgacc cgatcccgat ccacbacbgc gctccggctg 2580 
gttacgctat cctgaaatgc aacaacaaaa ccttctccgg taaaggtccg tgcaacaacg 2640- 
tttccaccgt tcagtgcacc catggtatca aaccggttgt ttccacccag ctgctgctga 2700 
acggttccct ggctgaaaaa gaaabcabca bccgttccga aaaccbgacc gacaacgtta 2760 
aaaccatcat cgttcacctg aacaaatccg bbgaaabcgt bbgcacccgb gctagcgcth 2820 
actgcaacat ctccggttcc aaabggaacg aaaccctgaa acgtgttaaa gaaaaactgc 2880 
aggaaaacta caacaacaac aaaaccatca aabbcgcbcc gtcctccggt ggtgacctgg 2940 
aaatcaccac ccactccttc aactgccgtg gtgaattctt ctactgcaac accacccgtc 3000 
tgtbcaacaa caacgctacc gaagacgaaa ccatcaccct gccgbgccgb atcaaacaga 3060 
bcabcaacab gtggcagggt gbbggbcgbg cbabgbacgc tccgccgatc gctggbaaca 3120 
bcacctgcaa atccaacabc accggbctgc bgcbggbbcg bgacggbggb gaagacaaca 3180' 
aaaccgaaga aatcbbccgb ccgggbggbg gbaacabgaa agacaacbgg cgttccgaac 3240 
bgbacaaaba caaagttatc gaacbgaaac cgcbgggbat cgcbccgacc ggbgcbaagc 3300 
bbaccgbbca ggcbcgbcag cbgctgbccb ccabcgbbca gcagcagtcc aaccbgcbgc 3360 
gbgctatcga agcbcagcag cacabgctgc agcbgaccgt bbggggbatc aaacagcbgc 3420 
agacccgbgt bcbggctabc gaacgttacc bgaaagacca gcagctcgag atctggaaca 34 80 
acatgaccbg gabggaatgg gaccgtgaaa bcbccaacba caccgacacc atctaccgbc 3540 
tgcbggaaga cbcccagacc cagcaggaaa aaaacgaaaa agacctgcbg gcbcbggacb 3600 
ccbggaaaaa ccbgbggbcc bggtbcgaca bcbccaacbg gcbgbggbga ggabccagab 3660 
cbgcbgbgcc bbcbagbtgc cagccabcbg bbgbbtgccc ctcccccgbg ccbtccbbga 3720 
cccbggaagg bgccacbccc acbgbccbbb ccbaataaaa tgaggaaabb gcatcgcabb 3780 
gbcbgagtag gtgtcabtcb abbcbggggg gtggggbggg gcaggacagc aagggggagg 3 84 0 
abbgggaaga caatagcagg cabgctgggg atgcggtggg ctcbatgggb acccaggbgc 3900 
tgaagaatbg acccggbbcc bcctgggcca gaaagaagca ggcacabccc cttcbcbgbg 3960 
acacacccbg bccacgcccc bggbtcbbag bbccagcccc acbcatagga cactcatagc 4020 
tcaggagggc bccgcctbca abcccacccg ctaaagtacb bggagcggbc bcbccctccc 4 080 
bcabcagccc accaaaccaa accbagccbc caagagtggg aagaaabbaa agcaagabag 414 0 
gctabbaagb gcagagggag agaaaabgcc tccaacabgb gaggaagbaa bgagagaaab 4200 
cabagaatbb baaggccabg abbbaaggcc atcabggcct baabctbccg cbtccbcgcb 4260 
cacbgactcg cbgcgctcgg bcgbbcggcb gcggcgagcg gbabcagcbc acbcaaaggc 4320 
ggtaabacgg bbabccacag aabcagggga baacgcagga aagaacabgb gagcaaaagg 43 80 
ccagcaaaag gccaggaacc gbaaaaaggc cgcgbbgcbg gcgbbttbcc ataggcbccg 4440 
cccccctgac gagcabcaca aaaabcgacg ctcaagtcag aggbggcgaa acccgacagg 4500 
acbabaaaga baccaggcgb tbccccctgg aagcbcccbc gbgcgctcbc ctgtbccgac 4560 
ccbgccgcbb accggatacc tgtccgcctb bcbccctbcg ggaagcgbgg cgctbtcbca 4620 
bagcbcacgc bgbaggbabc bcagbbcggb gtaggbcgbb cgcbccaagc tgggcbgbgb 4680 
gcacgaaccc cccgbtcagc ccgaccgcbg cgccbbabcc ggbaacbabc gtcbbgagbc 4740 
caacccggba agacacgacb babcgccacb ggcagcagcc acbggtaaca ggatbagcag 4 800 
agcgaggtab gbaggcggbg ctacagagbt ctbgaagbgg bggcctaacb acggcbacac 4 860 
tagaagaaca gbabbbggta bcbgcgctcb gcbgaagcca gbbaccbbcg gaaaaagagb 4920 
bggbagctcb bgabccggca aacaaaccac cgcbggbagc ggbggbbbbb bbgbbbgcaa 4980 
gcagcagabb acgcgcagaa aaaaaggatc bcaagaagab ccbbbgatcb bbbcbacggg 5040 
gbcbgacgcb cagbggaacg aaaacbcacg bbaagggabb bbggbcatga gabbabcaaa 5100 
aaggabctbc accbagabcc bbbtaaatba aaaabgaagb bbbaaabcaa bcbaaagbat 5160 
ababgagtaa acbbggbcbg acagbbacca atgctbaatc agbgaggcac cbatcbcagc 5220 
gabcbgbcba bbbcgbtcat ccabagbtgc ctgacbcggg gggggggggc gcbgaggbcb 5280 
gccbcgbgaa gaaggbgbbg cbgacbcaba ccaggccbga abcgccccab cabccagcca 5340 
gaaagbgagg gagccacggt bgabgagagc btbgbbgbag gbggaccagb bggtgabbbb 5400 
gaacbbbbgc bbbgccacgg aacggbcbgc gbbgtcggga agabgcgtga bcbgabccbb 5460 
caacbcagca aaagbbcgat bbatbcaaca aagccgccgb cccgbcaagb cagcgbaatg 5520 
cbcbgccagb gbbacaacca abbaaccaab bcbgabbaga aaaacbcabc gagcabcaaa 5580 
tgaaacbgca abbbabbcab abcaggatba bcaabaccab abbbbbgaaa aagccgbbbc 5640 
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tgtaatgaag gagaaaacbc accgaggcag ttccatagga tggcaagatc ctggtatcgg 5700 
tctgcgattc cgactcgtcc aacatcaata caacctatta atfctcccctc gtcaaaaata 5760 
aggttatcaa gtgagaaatc accatgagtg acgactgaat ccggfcgagaa tggcaaaagc 5820 
ttatgcattt ctttccagac ttgttcaaca ggccagccat tacgctcgtc atcaaaatca 5880 
ctcgcatcaa ccaaaccgtt attcattcgt gattgcgcct gagcgagacg aaatacgcga 5940 
tcgctgttaa aaggacaatt acaaacagga atcgaatgca accggcgcag gaacactgcc 6000 
agcgcatcaa caatattttc acctgaatca ggatattctfc ctaatacctg gaatgctgtt 6060 
ttcccgggga tcgcagtggt gagtaaccat gcatcatcag gagtacggat aaaatgcttg 6120 
atggtcggaa gaggcataaa ttccgtcagc cagtttagtc tgaccatctc atctgtaaca 6180 
bcattggcaa cgctaccttt gccatgtttc agaaacaact ctggcgcatc gggcttccca 6240 
tacaatcgat agattgtcgc acctgattgc ccgacattat cgcgagccca tttataccca 63 00 
tataaatcag catccatgtt ggaatttaat cgcggcctcg agcaagacgt ttcccgttga 63 60 
atatggctca taacacccct tgtattactg tttatgtaag cagacagttt tattgttcat 6420 
gatgatatat ttttatcttg tgcaatgtaa catcagagat tttgagacac aacgtggctt 6480 
bccccccccc cccattattg aagcatttat cagggttatt gtctcatgag cggatacata 6540 
tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg 6600 
ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa taggcgtatc 6660 
acgaggccct ttcgtc 6676 



<210> 140 
<211> 6637 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/6 containing HIV genes 
<400> 140 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 78 0 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
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agcggctcat ggtcgcbcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac caghgtgccg cacaaggccg tggcggbagg gtatgtgtct 1620 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag cbgagtbgbb gtabtctgat aagagtcaga ggbaacbccc 1740 

gttgcggtgc bgbbaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacataatag cbgacagacfc aacagactgt bcctbbccab gggtcbbbbc 1860 

bgcagbcacc gtcgtcgaca cgtgbgatrca gatatcgcgg ccgctctaga gatatcgcca 1920 

ccabgcgtgb bcgtggbabc ccgcgbaact ggccgcagtg gbggabgbgg ggbabccbgg 1980 

gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtbtacbacg 2040 

gtgbtccggt bbggaccgac gctaaaacca cccbgbbcbg cgcbbccgac accaaagcct 2100 

acgaccgbga agtbcacaac gbbtgggcba cccacgcbbg cgbbccgacc gacccgaacc 2160 

cgcaggaaab cgtbctggaa aacgbbaccg aaaacbbcaa cabgbggaaa aacgacafcgg 2220 

tbgaccagab gcacgaagac abcabcbccc bgbgggacca gbccctgaaa ccgbgcgtta 2280 

aacbgacccc gctgtgcgbb acccbgcact gcaccaacgc baccbbcaaa aacaacgbba 2340 

ccaacgacab gaacaaagaa abccgbaacb gcbccbbcaa caccaccacc gaaabccgtg 2400 

acaaaaaaca gcagggbtac gcbcbgbtcb accgtccgga cabcgttcbg cbgaaagaaa 24 60 

accgfcaacaa cbccaacaac bccgaabaca bccbgabcaa cbgcaacgcb bccaccabca 2520 

cccaggctbg cccgaaagbt aacbtcgacc cgafccccgab ccacbacbgc gctccggcbg 2580 

gbbacgcbab ccbgaaabgc aacaacaaaa ccbbcbccgg baaaggbccg bgcaacaacg 2640 

bbbccaccgb bcagbgcacc cabggbabca aaccggtbgb bbccacccag cbgcbgcbga 2700 

acggbbcccb ggcbgaaaaa gaaabcabca bccgbbccga aaaccbgacc gacaacgbba 2760 

aaaccabcab cgbtcaccbg aacaaabccg ttgaaatcgb tbgcacccgt gcbagcgcbb 2820 

acbgcaacab cbccggbtcc aaabggaacg aaaccctgaa acgbgbbaaa gaaaaactgc 2880 

aggaaaacta caacaacaac aaaaccatca aabbcgcbcc gbcctccggt ggbgaccbgg 2940 

aaabcaccac ccacbccbbc aacbgccgbg gtgaabbcbb cbacbgcaac accgcbagcc 3000 

bgccgbgccg babcaaacag abcabcaaca bgbggcaggg bgbbggbcgt gcbabgbacg 3060 

cbccgccgab cgcbggbaac abcaccbgca aabccaacab caccggbcbg cbgcbggbbc 3120 

gbgacggbgg bgaagacaac aaaaccgaag aaabcbbccg bccgggbggt ggbaacabga 3180 

aagacaacbg gcgbbccgaa cbgbacaaab acaaagbbat cgaacbgaaa ccgcbgggba 3240 

bcgcbccgac cggbgcbaag cbtaccgbbc aggcbcgbca gcbgcbgtcc bccabcgbbc 3300 

agcagcagbc caaccbgcbg cgbgcbabcg aagcbcagca gcacabgcbg cagcbgaccg 3360 

bbbggggbab caaacagcbg cagacccgbg btcbggcbab cgaacgbbac cbgaaagacc 3420 

agcagcbcga gabctggaac aacabgaccb ggabggaabg ggaccgbgaa abctccaacb 34 80 

acaccgacac catctaccgb cbgcbggaag acbcccagac ccagcaggaa aaaaacgaaa 354 0 

aagacctgcb ggctcbggac bcctggaaaa accbgbggbc cbggbbcgac abcbccaacb 3600 

ggcbgbggbg aggatccaga bcbgcbgbgc cbbcbagbbg ccagccabcb gbbgbbbgcc 3660 

ccbcccccgb gccbbccbbg acccbggaag gtgccacbcc cacbgbccbb bccbaabaaa 3720 

abgaggaaab bgcabcgcab bgbcbgagba ggbgbcabbc babbcbgggg ggbggggbgg 37 80 

ggcaggacag caagggggag gabbgggaag acaabagcag gcabgcbggg gabgcggtgg 3840 

gctcbabggg bacccaggbg cbgaagaabb gacccggbbc cbccbgggcc agaaagaagc 3900 

aggcacatcc ccbtcbcbgb gacacacccb gbccacgccc cbggtbcbba gbbccagccc 3960 

cacbcabagg acacbcabag cbcaggaggg ctccgccbbc aabcccaccc gcbaaagbac 4020 

bbggagcggb cbcbcccbcc cbcabcagcc caccaaacca aaccbagccb ccaagagbgg 40 BO 

gaagaaabba aagcaagaba ggcbabbaag bgcagaggga gagaaaabgc cbccaacatg 4140 

bgaggaagba abgagagaaa bcabagaabb bbaaggccab gabbbaaggc cabcabggcc 4200 

ttaatcbtcc gcbbccbcgc bcacbgacbc gcbgcgcbcg gbcgbbcggc bgcggcgagc 4260 

ggbabcagcb cacbcaaagg cggbaabacg gbbabccaca gaabcagggg abaacgcagg 4320 

aaagaacabg bgagcaaaag gccagcaaaa ggccaggaac cgbaaaaagg ccgcgbbgcb 43 80 

ggcgbbbtbc cabaggcbcc gccccccbga cgagcabcac aaaaabcgac gcbcaagtca 4440 

gaggtggcga aacccgacag gacbabaaag abaccaggcg bbbccccctg gaagcbcccb 4500 

cgbgcgctcb ccbgbbccga cccbgccgct baccggabac cbgbccgccb ttcbccctbc 4560 

gggaagcgbg gcgcbbbcbc abagcbcacg cbgtaggbab cbcagbbcgg tgbaggbcgt 4620 

bcgcbccaag cbgggcbgbg bgcacgaacc ccccgbbcag cccgaccgcb gcgccbtabc 4680 

cggbaacfcab cgbcbbgagb ccaacccggb aagacacgac btabcgccac bggcagcagc 4740 

cacbggbaac aggabbagca gagcgaggta bgtaggcggb gcbacagagb tcbbgaagtg 4800 
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gtggcctaac tacggctaca ctagaagaac 
agttaccttc ggaaaaagag tbggbagctc 
cggtggtttt tttgtttgca agcagcagat 
bcctbtgabc btbbcbacgg ggtctgacgc 
tbbggbcatg agatbabcaa aaaggahctt 
tbttaaatca atcfcaaagta tatabgagba 
cagtgaggca cctabctcag cgatctgtct 
gggggggggg cgctgaggtc tgcctcgtga 
aatcgcccca tcatccagcc agaaagtgag 
ggtggaccag bbggbgattb tgaactbtbg 
aagatgcgtg abctgatcct tcaactcagc 
tcccgtcaag tcagcgtaat gctctgccag 
aaaaactcat cgagcatcaa atgaaactgc 
tabbbbtgaa aaagccgttt ctgtaatgaa 
atggcaagat cctggtatcg gtctgcgatt 
aabbbcccct cgtcaaaaat aaggttatca 
tccggtgaga atggcaaaag cttatgcatt 
Ctacgctcgt catcaaaatc actcgcatca 
tgagcgagac gaaatacgcg atcgctgtta 
aaccggcgca ggaacactgc cagcgcabca 
tctaatacct ggaatgctgt tttcccgggg 
ggagtacgga taaaabgcbb gabggtcgga 
ctgaccabcb cabctgtaac atcattggca 
bctggcgcab cgggctbccc abacaabcga 
bcgcgagccc abbbataccc abataaabca 
gagcaagacg bbtcccgbbg aatabggctc 
gcagacagbb bbabtgbbca bgabgababa 
bbbbgagaca caacgbggcb tbcccccccc 
bgbctcatga gcggabacab abbbgaabgb 
cgcacabbbc cccgaaaagb gccaccbgac 
acctabaaaa abaggcgbab cacgaggccc 



agbabbtggb abcbgcgcbc bgctgaagcc 4860 
btgabccggc aaacaaacca ccgctggbag 4 920 
bacgcgcaga aaaaaaggab cbcaagaaga 4980 
bcagbggaac gaaaacbcac gtbaagggab 5040 
caccbagabc cbbbtaaatb 'aaaaatgaag 5100 
aacbbggbcb gacagbbacc aabgcbtaab 5160 
abttcgtbca bccabagbbg ccbgactcgg 5220 
agaaggbgbb gctgacbcat accaggccbg 5280 
ggagccacgg bbgatgagag cbbtgbbgba 5340 
cbbtgccacg gaacggbctg cgbtgtcggg 5400 
aaaagbtcga bbtatbcaac aaagccgccg 5460 
tgttacaacc aatbaaccaa ttctgatbag 5520 
aabbbabbca batcaggatb abcaatacca 5580 
ggagaaaact caccgaggca gbtccatagg 5640 
ccgacbcgbc caacabcaat acaacctatb 5700 
agtgagaaab caccatgagt gacgacbgaa 5760 
tcbbbccaga cbtgtbcaac aggccagcca 5820 
accaaaccgb tabbcabbcg bgattgcgcc 5880 
aaaggacaab tacaaacagg aabcgaatgc 5940 
acaatabbtb caccbgaatc aggabattcb 6000 
abcgcagbgg bgagbaacca tgcabcatca 6060 
agaggcataa atbccgbcag ccagtbbagb 6120 
acgctaccbt tgccabgbbb cagaaacaac 6180 
tagabbgbcg caccbgabtg cccgacatba 6240 
gcabccatgb bggaattbaa tcgcggcctc 6300 
abaacacccc btgbattacb gtbbabgtaa 6360 
bbtbbabcbt gtgcaatgba acabcagaga 6420 
ccccabtabb gaagcatbba bcagggbbab 6480 
abbbagaaaa abaaacaaab aggggbbccg 6540 
gbcbaagaaa ccabbabbat catgacabba 6600 
bbbcgbc 6637 



<210> 141 
<211> 6718 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 141 

bcgcgcgbbb cggtgabgac ggtgaaaacc tcbgacacab gcagcbcccg gagacggtca 60 
cagcbbgbcb gtaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 120 
btgSJcgggtg tc gggg c ^gg cbbaacbatg cggcabcaga gcagatbgba cbgagagbgc 180 
accababgcg gtgbgaaaba ccgcacagab gcgtaaggag aaaataccgc abcagatbgg 240 
cbatbggcca bbgcabacgt bgtabccaba bcataatabg tacabtbaba bbggctcabg 300 
bccaacabba ccgccabgbt gacabtgabb abbgacbagb babbaabagb aabcaatbac 360 
ggggbcabta gtbcabagcc cabatabgga gtbccgcgtb acabaactba cggtaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaatga cgbabgbbcc 480 
cabagtaacg ccaabaggga cbbbccabtg acgtcaabgg gtggagbabb tacggtaaac 540 
tgcccactbg gcagbacabc aagbgbabca babgccaagb acgcccccba btgacgtcaa 600 
bgacggbaaa bggcccgccb ggcattabgc ccagbacatg accbtabggg actbbcctac 660 
bbggcagtac atctacgbab tagbcabcgc battaccatg gbgabgcggb bbbggcagba 720 
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catcaatggg cgtggatagc ggbbbgacbc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa bgggcggbag gcgtgtacgg tgggaggtct ababaagcag 900 
agcbcgbbba gtgaaccgbc agabcgccbg gagacgccat ccacgctgbb ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggbgcabbg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctbtggc 1080 
tcttatgcafc gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tabagcbbag cctataggtg tgggttattg accattattg accactcccc 12 00 
tattggtgac gatactttcc attactaatc cataacatgg cbcbbbgcca caacfcatctc 1260 
tattggctat atgccaabac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccabttabba tttacaaatt cacabataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa hctcgggtac gtgttccgga 1440 
catgggctct fcctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag bggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg bggcggbagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggcbcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagbctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctbtccat gggtcttttc 1860 
bgcagbcacc gtcgfccgaca cgtgtgatca gatahcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgbgb tcgtggtatc ccgcgtaact ggccgcagtg gbggatgbgg ggtabccbgg 1980 
gtbtcbggab gabcatcatc bgccgbgbbg bbggbaacab gtgggbbacc gbttacbacg 2040 
gbgttccggt ttggaccgac gcbaaaacca cccbgtbcbg cgcbtccgac accaaagccb 2100 
acgaccgtga agbbcacaac gbbbgggcta cccacgcbbg cgtbccgacc gacccgaacc 2160 
cgcaggaaab cgbtcbggaa aacgtbaccg aaaacttcaa catgbggaaa aacgacabgg 2220 
tbgaccagab gcacgaagac atcabctccc tgtgggacca gtccctgaaa ccgtgcgbba 2280 
aacbgacccc gctgbgcgbb acccbgcacb gcaccaacgc taccbtcaaa aacaacgbba 2340 
ccaacgacab gaacaaagaa abccgbaacb gcb'cctbcaa caccaccacc gaaatccgbg 2400 
acaaaaaaca gcagggbtac gcbcbgbbcb accgtccgga cabcgbbcbg cbgaaagaaa 2460 
accgtaacaa ctccaacaac bccgaataca tccbgatcaa ctgcaacgct tccaccabca 2520 
cccaggctbg cccgaaagtt aacttcgacc cgatcccgab ccactactgc gcbccggcbg 2580 
gbbacgctat ccbgaaabgc aacaacaaaa cctbcbccgg taaaggbccg bgcaacaacg 2640 
bbtccaccgt tcagtgcacc catggbatca aaccggtbgt bbccacccag ctgcbgcbga 2700 
acggbtcccb ggcbgaaaaa gaaabcabca bccgbbccga aaaccbgacc gacaacgbba 2760 
aaaccabcab cgbtcaccbg aacaaabccg ttgaaabcgt ttgcacccgt ccgaacaaca 2820 
acacccgtaa atccatgcgb abcggtccgg gbcagaccbt cbacgcbacc ggbgacabca 2880 
tcggbgacab ccgbcaggcb bacbgcaaca bctccggbbc caaatggaac gaaacccbga 294 0 
aacgtgbbaa agaaaaacbg caggaaaact acaacaacaa caaaaccabc aaabbcgctc 3000 
cgbccbccgg tggbgaccbg gaaatcacca cccacbccbt caacbgccgb ggtgaabbct 3060 
bcbacbgcaa caccgctagc cbgccgtgcc gtabcaaaca gabcatcaac abgtggcagg 3120 
gtgbbggtcg tgcbabgbac gcbccgccga bcgcbggbaa cabcaccbgc aaabccaaca 3180 
bcaccggbcb gctgctggbb cgtgacggtg gbgaagacaa fcaaaaccgaa gaaabctbcc 3240 
gtccgggbgg bggtaacabg aaagacaacb ggcgbbccga acbgtacaaa bacaaagtta 3300 
bcgaacbgaa accgctgggt abcgcbccga ccggtgcbaa gcbbaccgbb caggcbcgbc 3360 
agctgcbgtc ctccatcgbt cagcagcagb ccaaccbgct gcgbgcbabc gaagcbcagc 3420 
agcacatgcb gcagcbgacc gbtbggggta bcaaacagcb gcagacccgb gbbcbggcba 3480 
tcgaacgtta ccbgaaagac cagcagcbcg agabcbggaa caacabgacc bggabggaab 3540 
gggaccgtga aabctccaac bacaccgaca ccatctaccg tcbgcbggaa gacbcccaga 3600 
cccagcagga aaaaaacgaa aaagaccbgc bggcbcbgga cbcctggaaa aaccbgbggt 3660 
ccbggbbcga cabctccaac tggcbgbggb gaggatccag abctgcbgbg ccbbcbagtb 3720 
gccagccabc tgbbgbbbgc ccctcccccg bgccttccbt gaccctggaa ggbgccacbc 3780 
ccactgbccb ttccbaabaa aabgaggaaa tbgcatcgca bbgtctgagb aggbgbcatb 3840 
cbabtcbggg gggbggggbg gggcaggaca gcaaggggga ggatbgggaa gacaabagca 3900 
ggcatgctgg ggabgcggtg ggctcbatgg gtacccaggb gcbgaagaab bgacccggbb 3960 
ccbccbgggc cagaaagaag caggcacatc cccbbcbcbg tgacacaccc tgbccacgcc 4020 
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ccbggtbcbb agtfcccagcc ccactcatag gacactcata gctcaggagg gcbccgccbb 4080 
caatcccacc cgctaaagta cttggagcgg tcbcbcccbc cctcatcagc ccaccaaacc 4140 
aaacctagcc tccaagagtg ggaagaaatt aaagcaagat aggcbabtaa gtgcagaggg 4200 
agagaaaatg ccbccaacab gtgaggaagt aatgagagaa atcabagaab tttaaggcca 4260 
bgabbbaagg ccatcatggc cbbaabcbbc cgcbbccbcg cbcacbgacb cgctgcgcbc 4320 
ggbcgbbcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggbbabccac 43 80 
agaatcaggg gabaacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 4440 
ccgtaaaaag gccgcgtbgc tggcgttttt ccabaggctc cgcccccctg acgagcatca 4500 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 4560 
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc bbaccggata 4620 
cctgtccgcc bbbcbcccbb cgggaagcgt ggcgctbbcb cabagcbcac gctgtaggta 4680 
tctcagttcg gbgbaggbcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 4740 
gcccgaccgc bgcgccbbab ccggtaacta tcgtcttgag tccaacccgg taagacacga 4800 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggb atgtaggcgg 4860 
tgcbacagag bbcbbgaagb ggtggcctaa ctacggctac actagaagaa cagtatttgg 4920 
tatctgcgct ctgchgaagc cagttacctt cggaaaaaga gbbggbagcb cbbgabccgg 4 980 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 5040 
aaaaaaagga bcbcaagaag abcctbbgat cttttctacg gggbcbgacg ctcagtggaa 5100 
cgaaaactca cgttaaggga ttttggtcat gagabtabca aaaaggatct tcacctagat 5160 
ccbbtbaaab taaaaatgaa gttttaaatc aatctaaagfc atatatgagb aaacttggtc 5220 
bgacagbbac caabgctbaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 5280 
atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg aagaaggtgt 534 0 
tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga gggagccacg 5400 
gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt gctttgccac 5460 
ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc htcaacfccag caaaagttcg 5520 
atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca gtgtfcacaac 5580 
caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg caatttattc 5640 
atatcaggat tatcaatacc atatttttga aaaagccgtb tctgtaatga aggagaaaac 5700 
tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat tccgactcgt 5760 
ccaacatcaa tacaacctat taatttcccc bcgtcaaaaa baaggbbabc aagbgagaaa 5820 
tcaccabgag bgacgactga abccggbgag aabggcaaaa gcbtabgcab btcbbbccag 5880 
acbtgtbcaa caggccagcc abtacgctcg bcabcaaaat cacbcgcabc aaccaaaccg 5940 
bbatbcatbc gbgabbgcgc cbgagcgaga cgaaabacgc gabcgcbgbb aaaaggacaa 6000 
bbacaaacag gaatcgaabg caaccggcgc aggaacacbg ccagcgcabc aacaababbb 6060 
bcaccbgaab caggabatbc bbcbaatacc bggaabgctg bbbtcccggg gatcgcagtg 6120 
gbgagbaacc atgcabcabc aggagtacgg ataaaabgct bgatggbcgg aagaggcaba 6180 
aabtccgbca gccagtbtag bcbgaccabc bcabcbgbaa catcabbggc aacgctaccb 6240 
tbgccabgtb bcagaaacaa cbctggcgca bcgggctbcc cabacaabcg abagabbgbc 6300 
gcaccbgatb gcccgacabb abcgcgagcc catbtatacc cababaaatc agcabccabg 6360 
bbggaabbta atcgcggcct cgagcaagac gbbbcccgbb gaababggcb cabaacaccc 6420 
ctbgbabbac bgtbbatgba agcagacagb bbbatbgbbc atgabgabab abbbbbabcb 6480 
tgbgcaabgb aacabcagag abbtbgagac acaacgbggc bbbccccccc cccccatbab 6540 
tgaagcabbb abcagggbba bbgbctcabg agcggabaca babbbgaabg babbbagaaa 6600 
aabaaacaaa baggggbbcc gcgcacabbb ccccgaaaag bgccaccbga cgbcbaagaa 6660 
accabbabba bcabgacabb aacctabaaa aabaggcgba bcacgaggcc cbbbcgtc 6718 



<210> 142 
<211> 6790 
<212>. DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
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<400> 142 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cbtaacbabg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctatfcggcca ttgcatacgt bgtatccaba tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacabbgatb attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc cababatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccahtgacg tcaataatga cgbabgbbcc 4 80 
catagtaacg ccaataggga cbbbccatbg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta tbgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
tbggcagbac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat ghcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgbgtacgg tgggaggtct atataagcag 900 
agcbcgbbba gtgaaccgtc agatcgcctg gagacgccat ccacgcbgtb bbgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccbtabgcba 114 0 
taggtgatgg babagctbag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tabbggctab atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tbbacaaabb cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tcbcgggbac gtgttccgga 1440 
cabgggctcb tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbabtctgab aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgb bgctgccgcg 1800 
cgcgccacca gacataatag cbgacagacb aacagacbgb bccbbbccab gggtctbtbc 1860 
bgcagbcacc gbcgtcgaca cgbgbgatca gabatcgcgg ccgcbctaga gababcgcca 1920 
ccabgcgtgt bcgtggbatc ccgcgbaacb ggccgcagbg gtggabgbgg ggtabcctgg 1980 
gtbbcbggat gabcatcabc tgccgbgttg bbggtaacab gbgggttacc gbbtacbacg 2040 
gtgbbccggt tbggaccgac gcbaaaacca cccbgttcbg cgcbbccgac accaaagcct 2100 
acgaccgtga agttcacaac gbbbgggcba cccacgctbg cgtbccgacc gacccgaacc 2160 
cgcaggaaat cgbtcbggaa aacgbbaccg aaaactbcaa cabgbggaaa aacgacabgg 2220 
ttgaccagab gcacgaagac abcabctccc tgtgggacca gtcccbgaaa ccgtgcgbba 2280 
aacbgacccc gcbgbgcgtb accctgcacb gcaccaacgc bagccgtaac bgctccttca 2340 
acaccaccac cgaaabccgb gacaaaaaac agcagggtba cgcbcbgbbc taccgtccgg 2400 
acabcgttct gctgaaagaa aaccgbaaca acbccaacaa ctccgaatac abcctgabca 2460 
actgcaacgc bbccaccatc acccaggcbb gcccgaaagb baacbtcgac ccgabcccga 2520 
tccacbacbg cgctccggcb ggtbacgcta bccbgaaabg caacaacaaa accbbctccg 2580 
gbaaaggtcc gbgcaacaac gbttccaccg bbcagbgcac ccabggbabc aaaccggtbg 2640 
tbbccaccca gcbgcbgctg aacggbtccc bggctgaaaa agaaabcabc atccgttccg 2700 
aaaaccbgac cgacaacgtt aaaaccatca bcgbtcaccb gaacaaabcc gtbgaaatcg 2760 
tbbgcacccg bccgaacaac aacacccgta aatccafcgcg tatcggbccg ggbcagacct 2820 
tctacgcbac cggbgacatc atcggbgaca bccgbcaggc ttacbgcaac abctccggbb 2880 
ccaaabggaa cgaaaccctg aaacgbgtta aagaaaaacb gcaggaaaac tacaacaaca 2940 
acaaaaccat caaattcgcb ccgbccbccg gbggtgacct ggaaatcacc acccactcct 3000 
tcaactgccg bggbgaattc btcbacbgca acaccacccg tcbgbbcaac aacaacgcba 3060 
ccgaagacga aaccabcacc cbgccgtgcc gbabcaaaca gabcatcaac abgtggcagg 3120 
gbgbbggtcg bgctabgbac gctccgccga bcgcbggbaa cabcaccbgc aaatccaaca 3180 
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bcaccggtcb gctgctggtt cgtgacggtg gtgaagacaa caaaaccgaa gaaatcttcc 3240 
gtccgggtgg tggtaacatg aaagacaact ggcgbbccga acbgbacaaa tacaaagtta 3300 
tcgaactgaa accgctgggh atcgctccga ccggtgctaa gcttaccght caggcbcgbc 3360 
agctgctgtc ctccatcgtt cagcagcagt ccaacctgct gcgtgctatc gaagctcagc 3420 
agcacabgcb gcagctgacc gbbbggggba tcaaacagct gcagacccgt gtbcbggcba 3480 
tcgaacgtta ccfcgaaagac cagcagcbcg agatctggaa caacatgacc tggatggaat 3540 
gggaccgtga aatctccaac tacaccgaca ccafcctaccg tctgctggaa gactcccaga 3600 
cccagcagga aaaaaacgaa aaagaccbgc tggctctgga cbccbggaaa aacctgtggt 3660 
cctggttcga catctccaac bggcbgbggt acabcaaaab cbbcabcabg atcgthggtg 3720 
gtctgatcgg tctgcgtatc atcttcgctg bbcbgbccab ctgaggabcc agabctgctg 3780 
tgccbtcbag ttgccagcca tctgttgbtb gccccbcccc cgbgccbtcc tbgaccctgg 3840 
aaggtgccac bcccactgtc ctttccbaat aaaatgagga aabtgcatcg cabtgtcbga 3900 
gbaggtgtca btcbabbcbg gggggbgggg tggggcagga cagcaagggg gaggattggg 3960 
aagacaabag caggcabgct ggggabgcgg tgggcbcbab gggtacccag gtgctgaaga 4020 
abtgacccgg tbccbccbgg gccagaaaga agcaggcaca tccccttcbc tgtgacacac 4080 
cctgtccacg ccccbggbtc btagttccag ccccacbcab aggacacbca tagcbcagga 4140 
gggctccgcc btcaatccca cccgcbaaag tactbggagc ggbctctccc tcccbcabca 4200 
gcccaccaaa ccaaacctag cctccaagag fcgggaagaaa btaaagcaag ataggctatt 4260 
aagtgcagag ggagagaaaa tgcctccaac atgtgaggaa gbaatgagag aaabcabaga 4320 
atttbaaggc catgatbtaa ggccabcatg gccbtaatct bccgcttcct cgctcacbga 4380 
cbcgcbgcgc tcggtcgttc ggcbgcggcg agcggbatca gctcactcaa aggcggbaat 444 0 
a cggbbatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 4500 
aaaggccagg aaccgbaaaa aggccgcgtb gcbggcgtbt ttccataggc bccgcccccc 4560 
tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggacbata 4620 
aagataccag gcgtbtcccc ctggaagctc cctcgbgcgc bcbccbgttc cgaccctgcc 4680 
gcbbaccgga bacctgbccg ccbbbcbccc tbcgggaagc gtggcgcttt cbcabagcbc 4740 
acgcbgbagg babctcagtb cggtgtaggt cgtbcgctcc aagcbgggcb gbgbgcacga 4 800 
accccccgbb cagcccgacc gctgcgcctb abccggbaac babcgbcbtg agbccaaccc 4860 
ggtaagacac gacttabcgc cactggcagc agccacbggb aacaggabta gcagagcgag 4 920 
gtabgbaggc ggbgcbacag agtfccbbgaa gbggtggccb aacbacggcb acactagaag 4980 
aacagbatbb ggbabctgcg cbcbgcbgaa gccagttacc btcggaaaaa gagtbggbag 5040 
ctcbbgabcc ggcaaacaaa ccaccgcbgg tagcggbggb ttbtbbgttt gcaagcagca 5100 
gabbacgcgc agaaaaaaag gabcbcaaga agabccbbbg abcbbbtcba cggggbcbga 5160 
cgcbcagtgg aacgaaaact cacgbbaagg gabbbbggbc abgagatbat caaaaaggat 5220 
cbbcacctag abccbbbbaa attaaaaabg aagbbbbaaa bcaatcbaaa gbabatabga 5280 
gbaaacbbgg bcbgacagtt accaabgctb aabcagbgag gcacctatct cagcgabcbg 5340 
bcbabbbcgt tcabccabag bbgccbgacb cggggggggg gggcgctgag gbcbgcctcg 5400 
tgaagaaggb gtbgctgact cabaccaggc cbgaabcgcc ccabcabcca gccagaaagb 5460 
gagggagcca cggbbgabga gagcbbbgtb gbaggbggac cagtbggtga tbbbgaacbb 5520 
btgctbbgcc acggaacggt ctgcgttgbc gggaagabgc gbgatcbgab ccbtcaacbc 5580 
agcaaaagbb cgabbtabbc aacaaagccg ccgbcccgtc aagtcagcgt aabgcbctgc 5640 
cagbgbbaca accaabbaac caabtctgab bagaaaaacb cabcgagcab caaabgaaac 5700 
bgcaabbtab bcatabcagg abbatcaaba ccatabtbtb gaaaaagccg ttbcbgbaab 5760 
gaaggagaaa acbcaccgag gcagbbccab aggabggcaa gabccbggba tcggbcbgcg 5820 
abbccgacbc gbccaacabc aabacaaccb abbaabtfccc ccbcgbcaaa aabaaggbba 5880 
bcaagbgaga aabcaccabg agbgacgacb gaatccggbg agaatggcaa aagcbbabgc 594 0 
abtbcbtbcc agacttgtbc aacaggccag ccatbacgcb cgbcabcaaa abcacbcgca 6000 
bcaaccaaac cgbbatbcat bcgtgabbgc gccbgagcga gacgaaafcac gcgatcgcbg 6060 
bbaaaaggac aabbacaaac aggaabcgaa bgcaaccggc gcaggaacac bgccagcgca 6120 
tcaacaabab bbbcacctga abcaggabab bcbbcbaaba cctggaabgc bgbbbtcccg 6180 
gggabcgcag bggtgagbaa ccabgcabca bcaggagbac ggabaaaabg cbbgatggbc 6240 
ggaagaggca baaatbccgt cagccagbbt agbcbgacca bcbcabcbgb aacabcabtg 6300 
gcaacgcbac cbtbgccabg bbtcagaaac aacbcbggcg cabcgggcbb cccabacaab 6360 
cgabagabbg bcgcaccbga bbgcccgaca btabcgcgag cccabtbata cccababaaa 6420 
tcagcabcca bgtbggaabb baatcgcggc ctcgagcaag acgbbtcccg bbgaababgg 64 80 
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ctcataacac cccttgtatt actgtttatg taagcagaca gttttattgt tcatgatgat 6540 

atatttttat cttgtgcaat gtaacatcag agattttgag acacaacgtg gctttccccc 6600 

cccccccatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 6660 

tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 6720 

gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg 6780 

ccctttcgtc 6790 



<210> 143 
<211> 6658 
<212> DNA 

<213> Artificial" Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 143 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 1B0 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tatfcaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttatfcg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 1980 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 
gtgttccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 
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ttgaccagat gcacgaagac atcatctccc tgbgggacca gbcccbgaaa ccgbgcgbba 2280 
aactgacccc gctgbgcgbb accctgcact gcaccaacgc tagcatcaac tgcaacgctt 2340 
ccaccatcac ccaggcttgc ccgaaagtta acttcgaccc gatcccgatc cactactgcg 2400 
ctccggctgg ttacgctatc ctgaaatgca acaacaaaac cttctccggt aaaggtccgb 2460 
gcaacaacgt ttccaccgtt cagtgcaccc abggbatcaa accggbbgbb tccacccagc 2520 
tgctgctgaa cggttccctg gctgaaaaag aaabcabcat ccgttccgaa aacctgaccg 2580 
acaacgbbaa aaccatcatc gttcacctga acaaatccgt tgaaatcgtt tgcacccgtc 2640 
cgaacaacaa cacccgtaaa tccatgcgta tcggtccggg bcagaccbbc tacgctaccg 2700 
gbgacatcab cggbgacatc cgbcaggcbb acbgcaacat cbccggtbcc aaabggaacg 2760 
aaaccctgaa acgbgtbaaa gaaaaactgc aggaaaacba caacaacaac aaaaccabca 2820 
aabbcgcbcc gbcctccggb ggbgacctgg aaabcaccac ccactccbbc aacbgccgbg 2880 
gbgaatbcbb cbacbgcaac accacccgbc bgbbcaacaa caacgcbacc gaagacgaaa 2940 
ccabcacccb gccgbgccgb atcaaacaga bcabcaacab gbggcagggb gbbggbcgbg 3000 
cbabgtacgc bccgccgabc gcbggbaaca bcaccbgcaa abccaacabc accggtcbgc 3060 
bgcbggbbcg bgacggtggb gaagacaaca aaaccgaaga aatcbbccgb ccgggtggbg 3120 
gbaacabgaa agacaactgg cgbbccgaac bgbacaaaba caaagbbabc gaacbgaaac 3180 
cgcbgggbab cgcbccgacc ggbgcbaagc btaccgbbca ggcbcgbcag cbgcbgbccb 3240 
ccabcgbbca gcagcagbcc aaccbgctgc gtgcbatcga agcbcagcag cacabgcbgc 3300 
agcbgaccgb bbggggtabc aaacagcbgc agacccgbgb bctggcbabc gaacgtbacc 33 60 
bgaaagacca gcagcbcgag abcbggaaca acabgaccbg gatggaabgg gaccgtgaaa 3420 
bcbccaacba caccgacacc abcbaccgbc bgcbggaaga cbcccagacc cagcaggaaa 3480 
aaaacgaaaa agaccbgcbg gctcbggacb ccbggaaaaa cctgbggbcc bggbbcgaca 3540 
bcbccaactg gcbgtggbac abcaaaabcb bcabcabgab cgttggbggb cbgabcggbc 3600 
bgcgbabcab cbbcgcbgbb cbgbccabcb gaggabccag abcbgcbgbg ccbbctagbb 3660 
gccagccabc bgbbgbbbgc cccbcccccg bgccbbccbb gacccbggaa ggbgccacbc 3720 
ccacbgtcct bbcctaataa aatgaggaaa btgcabcgca tbgbcbgagb aggbgtcabb 3780 
cbatbctggg gggtggggbg gggcaggaca gcaaggggga ggatbgggaa gacaatagca 3840 
ggcabgcbgg ggatgcggbg ggcbcbabgg gtacccaggb gctgaagaab bgacccggbb 3900 
ccbccbgggc cagaaagaag caggcacabc cccbbcbcbg bgacacaccc bgbccacgcc 3960 
ccbggtbcbb agbbccagcc ccacbcabag gacacbcaba gctcaggagg gcbccgccbb 4020 
caabcccacc cgcbaaagta cbtggagcgg bcbcbcccbc cctcabcagc ccaccaaacc 4080 
aaacctagcc bccaagagbg ggaagaaabb aaagcaagab aggcbabbaa gbgcagaggg 4140 
agagaaaabg ccbccaacab gbgaggaagb aabgagagaa abcatagaat bbbaaggcca 4200 
bgabbbaagg ccabcatggc cbbaabcbbc cgcbbccbcg cbcacbgacb cgcbgcgcbc 4260 
ggtcgbbcgg ctgcggcgag cggbabcagc bcacbcaaag gcggbaabac ggbbabccac 4320 
agaabcaggg gabaacgcag gaaagaacab gtgagcaaaa ggccagcaaa aggccaggaa 43 80 
ccgbaaaaag gccgcgtbgc tggcgbbtbb ccabaggcbc cgccccccbg acgagcabca 4440 
caaaaabcga cgctcaagbc agaggbggcg aaacccgaca ggactabaaa gabaccaggc 4 500 
gbbbccccct ggaagctccc tcgbgcgcbc bccbgbtccg accctgccgc bbaccggaba 4560 
ccbgbccgcc bbbctcccbb cgggaagcgb ggcgcbbbcb cabagcbcac gcbgbaggba 4620 
bcbcagbtcg gbgbaggbcg bbcgcbccaa gcbgggcbgb gbgcacgaac cccccgbbca 4680 
gcccgaccgc bgcgcctbab ccggbaacba bcgbcbbgag bccaacccgg baagacacga 4740 
cbtabcgcca cbggcagcag ccacbggtaa caggabtagc agagcgaggb abgbaggcgg 4800 
bgcbacagag bbcbtgaagb ggtggcctaa cbacggcbac acbagaagaa cagbabbbgg 4 860 
batcbgcgcb cbgctgaagc cagbbaccbb cggaaaaaga gbtggbagcb cbbgabccgg 4920 
caaacaaacc accgcbggba gcggbggbbb bbbtgbtbgc aagcagcaga bbacgcgcag 4980 
aaaaaaagga bcbcaagaag abcctbbgab cbbbbcbacg gggbcbgacg cbcagtggaa 5040 
cgaaaacbca cgttaaggga bbbbggbcab gagabbabca aaaaggabcb bcacctagab 5100 
ccbbbtaaab baaaaatgaa gbbbbaaabc aabcbaaagb abababgagb aaacbbggbc 5160 
bgacagbbac caatgctbaa bcagbgaggc accbabcbca gcgatcbgbc babbbcgbbc 5220 
abccabagtb gcctgacbcg gggggggggg gcgcbgaggb cbgccbcgbg aagaaggbgb 5280 
bgcbgacbca baccaggccb gaabcgcccc abcabccagc cagaaagbga gggagccacg 5340 
gbbgabgaga gcbbbgbbgb aggbggacca gbbggbgabb bbgaacbbbb gcbbbgccac 54 00 
ggaacggbcb gcgbbgtcgg gaagabgcgb gabcbgabcc bbcaactcag caaaagbbcg 5460 
atbbabbcaa caaagccgcc gbcccgbcaa gbcagcgbaa bgcbcbgcca gtgbbacaac 5520 
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caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg caatttattc 5580 

atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga aggagaaaac 5640 

tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat tccgactcgt 5700 

ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc aagtgagaaa 5760 

tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat ttctttccag 5820 

achtgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc aaccaaaccg 5880 

ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa 5940 

ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaatattt 6000 

tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg gatcgcagtg 6060 

gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg aagaggcata 6120 

aattccgtca gccagtttag tctgaccatc tcatchgtaa catcattggc aacgctacct 6180 

ttgccatgtt tcagaaacaa' ctctggcgca tcgggcttcc catacaatcg atagattgtc 6240 

gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc agcatccatg 6300 

ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct cataacaccc 6360 

cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat atttttatct 6420 

tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc cccccattat 6480 

tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 6540 

aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa 6600 

accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtc 6658 



<210> 144 
<211> 6577 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plaemid pVR1012x/s containing HIV genes 
<400> 144 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 
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catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggcbcat ggbcgcbcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgbb gtattctgat aagagtcaga ggbaacbccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgbgb tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtabccbgg 1980 
gtttctggat gatcatcatc tgccgbgbbg ttggtaacat ghgggttacc gtttactacg 2040 
gtgttccggt bbggaccgac gctaaaacca cccbgtbcbg cgcttccgac accaaagcct 2100 
acgaccgbga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaab cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 
ttgaccagat gcacgaagac abcabcbccc tgtgggacca gbccctgaaa ccgtgcgtta 2280 
aactgacccc gcbgtgcgbt accctgcact gcaccaacgc bagcatcaac tgcaacgctt 2340 
ccaccatcac ccaggcbbgc ccgaaagbba acttcgaccc gatcccgatc cactactgcg 2400 
ctccggctgg fctacgctatc ctgaaatgca acaacaaaac ctbcbccggb aaaggbccgb 2460 
gcaacaacgt ttccaccgtt cagtgcaccc atggtatcaa accggttgbb tccacccagc 2520 
tgctgctgaa cggbbcccbg gctgaaaaag aaatcatcat ccgttccgaa aacctgaccg 2580 
acaacgttaa aaccatcatc gttcacctga acaaabccgb tgaaatcgtt tgcacccgtg 264 0 
ctagcgctta ctgcaacatc tccggttcca aabggaacga aacccbgaaa cgtgtbaaag 2700 
aaaaactgca ggaaaacbac aacaacaaca aaaccatcaa ahtcgctccg tcctccggtg 2760 
gtgacctgga aatcaccacc cactccttca actgccgtgg bgaabbcbtc tactgcaaca 2820 
ccacccgtct gttcaacaac aacgctaccg aagacgaaac catcaccctg ccgtgccgta 2880 
tcaaacagat catcaacatg bggcagggbg tbggtcgtgc tatgtacgct ccgccgatcg 2940 
ctggtaacat cacctgcaaa bccaacabca ccggtctgct gctggttcgt gacggtggtg 3000 
aagacaacaa aaccgaagaa atcttccgtc cgggtggtgg taacatgaaa gacaactggc 3060 
gttccgaact gtacaaatac aaagttatcg aacbgaaacc gcbgggbabc gctccgaccg 3120 
gtgctaagct taccgttcag gctcgtcagc tgctgtcctc catcgttcag cagcagtcca 3180 
acctgctgcg tgctatcgaa gcbcagcagc acatgctgca gctgaccgtt tggggtatca 3240 
aacagctgca gacccgtgtt ctggctatcg aacgttacct gaaagaccag cagctcgaga 33 00 
tctggaacaa catgacctgg atggaatggg accgtgaaat ctccaactac accgacacca 3360 
tctaccgbcb gctggaagac tcccagaccc agcaggaaaa aaacgaaaaa gacctgctgg 3420 
ctctggactc ctggaaaaac cbgbggbccb ggttcgacab cbccaacbgg cbgbggbaca 3480 
bcaaaabcbb cabcabgabc gbtggbggtc bgatcggtcb gcgbabcabc tbcgcbgbbc 3540 
tgtccabctg aggabccaga bctgcbgbgc ctbctagbbg ccagccabcb gbbgtbbgcc 3600 
cctcccccgb gccbbccbtg accctggaag gbgccacbcc cacbgbccbb bcctaabaaa 3660 
abgaggaaat bgcabcgcat bgtcbgagba ggbgbcabbc babbcbgggg ggbggggtgg 3720 
ggcaggacag caagggggag gatbgggaag acaabagcag gcabgcbggg gabgcggbgg 3780 
gcbcbatggg bacccaggtg cbgaagaatb gacccggbtc cbcctgggcc agaaagaagc 3840 
aggcacatcc ccbbcbcbgb gacacacccb gtccacgccc cbggtbcbba gbtccagccc 3900 
cacbcabagg acacbcabag ctcaggaggg ctccgccbtc aatcccaccc gcbaaagtac 3960 
ttggagcggb cbcbcccbcc cbcabcagcc caccaaacca aaccbagccb ccaagagtgg 402 0 
gaagaaatba aagcaagaba ggcbabbaag bgcagaggga gagaaaatgc ctccaacabg 4080 
tgaggaagta atgagagaaa bcabagaabb bbaaggccab gabbtaaggc cabcabggcc 4140 
btaabcbbcc gcbbccbcgc bcactgactc gcbgcgcbcg gbcgbbcggc bgcggcgagc 4200 
ggbabcagcb cacbcaaagg cggtaatacg gtbatccaca gaabcagggg abaacgcagg 4260 
aaagaacabg bgagcaaaag gccagcaaaa ggccaggaac cgbaaaaagg ccgcgbtgct 4320 
ggcgtbbbbc cabaggcbcc gccccccbga cgagcatcac aaaaatcgac gcbcaagtca 4380 
gaggbggcga aacccgacag gactataaag ataccaggcg bbbcccccbg gaagcbccct 4440 
cgtgcgctcb cctgttccga cccbgccgct taccggabac cbgbccgcct bbcbcccbbc 4500 
gggaagcgtg gcgcbbtcbc atagcbcacg ctgbaggbab cbcagbbcgg bgbaggtcgb 4560 
bcgcbccaag ctgggcbgtg bgcacgaacc ccccgbbcag cccgaccgcb gcgccbtatc 4620 
cggbaacbab cgbcbbgagb ccaacccggb aagacacgac bbatcgccac bggcagcagc 4680 
cacbggtaac aggattagca gagcgaggba bgtaggcggb gcbacagagb bctbgaagbg 4740 
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gtggcctaac tacggctaca ctagaagaac agbatbbggb atctgcgctc tgctgaagcc 4800 

agbbaccbbc ggaaaaagag bbggtagcbc ttgatccggc aaacaaacca ccgctggtag 4 860 

cggbggbbbb tbbgtbbgca agcagcagab tacgcgcaga aaaaaaggat ctcaagaaga 4920 

tcctttgabc tbbbctacgg ggbctgacgc bcagtggaac gaaaactcac gttaagggat 4980 

tttggtcatg agattatcaa aaaggatctt cacctagatc cbbbbaaabt aaaaatgaag 5040 

ttttaaatca abctaaagba tatatgagta aacttggbcb gacagttacc aatgcttaat 5100 

cagtgaggca ccbatcbcag cgatctgtct atttcgttca tccatagttg cctgactcgg 5160 

gggggggggg cgcbgaggbc bgccbcgtga agaaggtgtt gctgactcat accaggccbg 5220 
aatcgcccca tcatccagcc agaaagtgag ggagccacgg tbgabgagag cbbtgtbgta 5280 

ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtctg cgttgtcggg 5340 
aagabgcgbg atctgatcct bcaacbcagc aaaagbbcga' ttbabbcaac aaagccgccg 5400 

bcccgbcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa ttctgattag 5460 

aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt abcaabacca 5520 

tatttttgaa aaagccgtbt ctgtaatgaa ggagaaaact caccgaggca gttccatagg 5580 

abggcaagab ccbggbatcg gtctgcgatt ccgactcgtc caacabcaat acaacctatt 5640 

aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt gacgactgaa 5700 

tccggtgaga abggcaaaag cttatgcatt tctttccaga cttgttcaac aggccagcca 5760 

ttacgctcgt catcaaaatc actcgcatca accaaaccgt tattcattcg tgattgcgcc 5820 

tgagcgagac gaaatacgcg atcgctgtta aaaggacaat tacaaacagg aatcgaatgc 5880 

aaccggcgca ggaacactgc cagcgcatca acaatatbtb cacctgaatc aggatattct 5940 

tctaatacct ggaatgctgt tttcccgggg atcgcagtgg tgagtaacca tgcatcatca 6000 

ggagtacgga taaaatgctt gatggtcgga agaggcataa attccgtcag ccagbtbagb 6060 

ctgaccatct catctgtaac atcattggca acgctacctt tgccatgttt cagaaacaac 6120 

tctggcgcat cgggcttccc atacaatcga tagattgtcg cacctgattg cccgacatta 6180 

tcgcgagccc atttataccc atataaatca gcatccatgt tggaatttaa tcgcggcctc 6240 

gagcaagacg bbtcccgbbg aatatggctc ataacacccc tbgbabbact gtttatgtaa 6300 

gcagacagbb ttattgttca tgatgatata bttbtabcbb gbgcaabgba acatcagaga 6360 

ttttgagaca caacgtggct ttcccccccc ccccattatt gaagcattta tcagggttat 6420 

bgbcbcabga gcggabacab atttgaatgt atttagaaaa ataaacaaat aggggttccg 6480 

cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta 6540 

acctataaaa ataggcgtab cacgaggccc btbcgtc 6577 



<210> 145 
<211> 6538 
<212*> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 145 

tcgcgcgbtb cggbgabgac ggtgaaaacc bcbgacacat gcagctcccg gagacggbca 60 

cagcbbgbcb gbaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 

tbggcgggbg bcggggcbgg cbbaacbabg cggcatcaga gcagattgba cbgagagbgc 180 

accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 

ctabbggcca bbgcabacgt tgbabccaba bcabaababg bacabbbaba bbggcbcabg 300 

bccaacatta ccgccabgbb gacabbgabb abbgacbagb babtaabagb aabcaabbac 360 

ggggbcabta gttcabagcc cabatabgga gbbccgcgbb acataactba cggtaaabgg 420 

cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaabaatga cgtabgbbcc 480 

cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggbaaac 540 

bgcccacbtg gcagtacabc aagbgbabca babgccaagb acgcccccba bbgacgbcaa 600 

bgacggbaaa bggcccgccb ggcatbabgc ccagbacabg acctbabggg acbbbccbac 6.60 

bbggcagbac abctacgbab tagbcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
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catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaabggg agtbtgbbbb ggcaccaaaa tcaacgggac bbbccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agcbcgbbba gtgaaccgtc agabcgccbg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcabbg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
bcbbabgcat gctatactgt bbbbggcttg gggcctabac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
batbggbgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctghccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccabbbabba bbbacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagbtbbb attaaacata gcgbgggabc tccacgcgaa tctcgggbac gtgttccgga 144 0 
catgggctct tctccggtag cggcggagct tccacatccg agcccbggbc ccabgccbcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaact ccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacabaatag ctgacagact aacagactgt bccbttccab gggbcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gababcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgbaact ggccgcagtg gtggatgtgg ggbabccbgg 1980 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggbbacc gtttactacg 204 0 
gbgtbccggb ttggaccgac gctaaaacca cccbgbbcbg cgcttccgac accaaagccb 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacbbcaa catgtggaaa aacgacatgg 2220 
ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 22 80 
aactgacccc gctgtgcgbb accctgcact gcaccaacgc tagcatcaac tgcaacgctt 2340 
ccaccabcac ccaggcttgc ccgaaagtta acttcgaccc gatcccgatc cactactgcg 2400 
ctccggctgg bbacgcbabc ctgaaatgca acaacaaaac cttctccggt aaaggbccgb 2460 
gcaacaacgt bbccaccgbb cagtgcaccc atggtatcaa accggttgtt bccacccagc 2520 
tgctgctgaa cggttccctg gctgaaaaag aaatcatcat ccgttccgaa aacctgaccg 2580 
acaacgttaa aaccatcatc gttcacctga acaaatccgt bgaaabcgtb tgcacccgtg 2640 
ctagcgctta cbgcaacabc tccggttcca aabggaacga aaccctgaaa cgtgttaaag 2700 
aaaaactgca ggaaaactac aacaacaaca aaaccatcaa abbcgcbccg bccbccggbg 2760 
gtgacctgga aafccaccacc cacbcctbca acbgccgbgg tgaattcttc tactgcaaca 2820 
ccgctagcct gccgtgccgt atcaaacaga tcatcaacat gtggcagggt gtfcggtcgtg 2880 
ctatgtacgc tccgccgatc gctggtaaca tcacctgcaa atccaacatc accggtctgc 2940 
tgctggttcg tgacggtggt gaagacaaca aaaccgaaga aatcttccgt ccgggtggtg 3000 
gtaacatgaa agacaactgg cgttccgaac tgtacaaata caaagttatc gaactgaaac 3060 
cgctgggtat cgctccgacc ggtgctaagc ttaccgttca ggctcgtcag ctgctgtcct 3120 
ccatcgttca gcagcagtcc aacctgctgc gtgctatcga agctcagcag cacatgctgc 3180 
agctgaccgt ttggggtatc aaacagctgc agacccgtgt tctggctatc gaacgttacc 3240 
tgaaagacca gcagctcgag atctggaaca acatgacctg gatggaatgg gaccgtgaaa 3300 
tctccaacta caccgacacc atctaccgtc tgctggaaga cbcccagacc cagcaggaaa 3360 
aaaacgaaaa agaccbgctg gcbcbggact ccbggaaaaa ccbgbggbcc bggbtcgaca 3420 
bcbccaacbg gcbgtggbac atcaaaabcb bcabcabgat cgbbggbggb cbgatcggtc 3480 
tgcgbabcat cbbcgctgtt ctgtccatct gaggatccag atcbgcbgtg ccbtcbagtb 3540 
gccagccabc bgbtgbbbgc ccctcccccg bgccttcctt gacccbggaa ggbgccacbc 3600 
ccacbgbccb bbccbaabaa aabgaggaaa btgcatcgca tbgbcbgagb aggbgbcabb 3660 
cbabbctggg gggbggggbg gggcaggaca gcaaggggga ggabbgggaa gacaabagca 3720 
ggcabgcbgg ggabgcggbg ggctcbatgg gbacccaggt gctgaagaab tgacccggtt 3780 
ccbccbgggc cagaaagaag caggcacatc cccbtcbctg bgacacaccc bgbccacgcc 3840 
cctggttctt agbbccagcc ccacbcabag gacacbcaba gcbcaggagg gcbccgccbb 3900 
caatcccacc cgcbaaagba cbbggagcgg bcbcbcccbc ccbcatcagc ccaccaaacc 3960 
aaacctagcc bccaagagbg ggaagaaatb aaagcaagab aggcbabbaa gtgcagaggg 4 020 
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agagaaaatg cctccaacat gtgaggaagt aatgagagaa atcatagaat tttaaggcca 4080 
tgatttaagg ccatcatggc cbtaabctbc cgcttcctcg ctcactgact cgctgcgctc 4140 
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaabac ggttatccac 4200 
agaatcaggg gabaacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 4260 
ccgbaaaaag gccgcgbtgc bggcgbbtbb ccabaggcbc cgcccccctg acgagcatca 4320 
caaaaabcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 4380 
gbbtcccccb ggaagcbccc tcgtgcgcfcc tcctgttccg accctgccgc ttaccggata 4440 
ccbgbccgcc bfctcbccctb cgggaagcgt ggcgcbbbct catagcbcac gctgtaggta 4500 
tctcagttcg gbgbaggbcg bbcgcbccaa gcbgggcbgb gtgcacgaac cccccgttca 4560 
gcccgaccgc bgcgccbbab ccggtaacta bcgbcbbgag tccaacccgg taagacacga 4620 
cttatcgcca cbggcagcag ccactggtaa caggathagc agagcgaggt atgtaggcgg 4 680 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg 4740 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 4 800 
caaacaaacc accgctggta gcggtggttt bbbtgbbtgc aagcagcaga ttacgcgcag 4860 
aaaaaaagga tctcaagaag atcctttgat cbtbtctacg gggtctgacg ctcagtggaa 4920 
cgaaaactca cgbbaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 4980 
ccttttaaat taaaaatgaa gbtbbaaabc aatctaaagt abababgagb aaacbbggbc 5040 
bgacagbbac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 5100 
atccatagtt gcctgactcg gggggggggg gcgctgaggh cbgccbcgbg aagaaggtgt 5160 
tgctgachca taccaggcct gaatcgcccc atcatccagc cagaaagtga gggagccacg 5220 
gttgatgaga gcbbbgtbgt aggtggacca gttggtgatt bbgaacbbbt gctttgccac 5280 
ggaacggbcb gcgbbgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagttcg 534 0 
atbbatbcaa caaagccgcc gbcccgbcaa gtcagcgtaa tgctctgcca gtgttacaac 5400 
caattaacca attctgatfca gaaaaactca tcgagcatca aatgaaactg caatttattc 5460 
atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga aggagaaaac 5520 
tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat tccgactcgt 5580 
ccaacatcaa tacaacctafc taabttcccc tcgbcaaaaa taaggtbatc aagbgagaaa 5640 
bcaccabgag bgacgacbga abccggbgag aabggcaaaa gctbatgcab ttcttbccag 5700 
acbtgbbcaa caggccagcc abbacgcbcg bcabcaaaab cacbcgcabc aaccaaaccg 5760 
btabbcabbc gtgatbgcgc cbgagcgaga cgaaabacgc gatcgcbgbb aaaaggacaa 5820 
bbacaaacag gaabcgaatg caaccggcgc aggaacacbg ccagcgcabc aacaababbb 5880 
bcaccbgaat caggababbc tbctaabacc bggaabgcbg ttttcccggg gabcgcagbg 5940 
gfcgagbaacc abgcabcabc aggagbacgg abaaaabgct bgabggbcgg aagaggcaba 6000 
aabtccgbca gccagbtbag tcbgaccabc bcabctgtaa cabcatbggc aacgctaccb 6060 
bbgccabgbb bcagaaacaa cbcbggcgca bcgggcbbcc cabacaabcg abagabtgtc 6120 
gcaccbgatb gcccgacatb abcgcgagcc catfctatacc cabatiaaabc agcatccatg 6180 
btggaatbba abcgcggcct cgagcaagac gtttcccgbb gaatabggcb cabaacaccc 6240 
cbbgtabbac tgttbabgba agcagacagb tbbatbgbbc abgabgabat atbbtbabct 6300 
bgbgcaabgt aacabcagag abbtbgagac acaacgbggc tbbccccccc cccccabbat 6360 
bgaagcabbb atcagggbba tbgtcbcabg agcggabaca tabbbgaabg babbtagaaa 6420 
aabaaacaaa baggggtbcc gcgcacabtb ccccgaaaag tgccaccbga cgtcbaagaa 6480 
accabtabba bcabgacabt aaccbabaaa aabaggcgta bcacgaggcc cbtbcgbc 6538 



<210> 146 
<211> 671B 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 146 

bcgcgcgbbt cggbgabgac ggbgaaaacc tcbgacacab gcagcbcccg gagacggbca 60 
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cagcttgtct gtaagcggat gccgggagca 
ttggcgggtg tcggggctgg cbbaactatg 
accatatgcg gbgbgaaata ccgcacagat 
ctattggcca ttgcatacgt bgbabccata 
tccaacatta ccgccatgtt gacattgatt 
ggggtcatfca gtbcabagcc catatatgga 
cccgcctggc tgaccgccca acgacccccg 
catagtaacg ccaataggga cbttccabtg 
tgcccacttg gcagtacatc aagtgtatca 
tgacggtaaa tggcccgcct ggcattatgc 
ttggcagbac atctacgtat tagtcatcgc 
catcaatggg cgbggatagc ggtttgactc 
cgtcaabggg agbbtgtbbb ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgttta gtgaaccgtc agatcgcctg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg baagtaccgc 
tcttatgcab gctatactgt btbtggcbbg 
baggtgabgg tatagcttag cctataggtg 
batbggtgac gatactttcc attactaatc 
tattggctat atgccaabac tctgtccttc 
ggatggggtc ccatbbabba tttacaaatt 
cgcagttttt attaaacata gcgbgggabc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgbgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc bgbbaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
bgcagbcacc gtcgtcgaca cgtgtgatca 
ccatgcgtgt tcgtggtatc ccgcgtaact 
gtttctggat gatcatcatc tgccgtgtbg 
gbgbtccggb bbggaccgac gctaaaacca 
acgaccgbga agbbcacaac gtbtgggcta 
cgcaggaaat cgtbcbggaa aacgtbaccg 
tbgaccagab gcacgaagac abcatctccc 
aactgacccc gctgbgcgtt acccbgcacb 
ccaacgacat gaacaaagaa atccgbaact 
ccaccatcac ccaggcbbgc ccgaaagtta 
cbccggcbgg ttacgctatc cbgaaatgca 
gcaacaacgt btccaccgtt cagbgcaccc 
bgcbgcbgaa cggbbcccbg gctgaaaaag 
acaacgtbaa aaccatcabc gbtcaccbga 
cgaacaacaa cacccgbaaa bccabgcgba 
gbgacabcab cggtgacabc cgtcaggctb 
aaaccctgaa acgbgttaaa gaaaaacbgc 
aatbcgctcc gtcctccggb ggbgaccbgg 
gbgaabbcbb ctactgcaac accacccgbc 
ccabcacccb gccgbgccgt abcaaacaga 
ctabgbacgc bccgccgatc gcbggtaaca 
bgctggttcg bgacggbggt gaagacaaca 
gbaacatgaa agacaactgg cgbbccgaac 
cgcbgggbab cgcbccgacc ggbgctaagc 
ccabcgbbca gcagcagtcc aaccbgctgc 



gacaagcccg bcagggcgcg bcagcgggbg 12 0 
cggcabcaga gcagabtgba ctgagagbgc 180 
gcgbaaggag aaaataccgc abcagatbgg 240 
tcabaatabg tacahttata ttggctcatg 300 
abtgactagb batbaatagt aatcaabbac 360 
gbtccgcgtb acataacbba cggbaaabgg 420 
cccabtgacg bcaataabga cgtabgbtcc 480 
acgbcaabgg gtggagtabb tacggbaaac 540 
tatgccaagt acgcccccba ttgacgbcaa 600 
ccagbacabg accttatggg acbbbcctac 660 
tatbaccatg gbgabgcggt tttggcagta 720 
acggggatbb ccaagbctcc accccatbga 780 
bcaacgggac tbtccaaaat gtcgbaacaa 840 
gcgtgtacgg tgggaggtct abataagcag 900 
gagacgccab ccacgcbgbt ttgaccbcca 960 
cggccgggaa cggbgcattg gaacgcggat 1020 
cbatagacbc tabaggcaca ccccbbbggc 1080 
gggcctabac acccccgcbt ccbbatgcta 1140 
tgggbbatbg accattabbg accactcccc 1200 
cabaacabgg cbcbtbgcca caactabctc 1260 
agagactgac acggactcbg battttbaca 1320 
cacababaca acaacgccgt cccccgbgcc 1380 
tccacgcgaa bctcgggbac gtgttccgga 1440 
bccacabccg agcccbggtc ccabgccbcc 1500 
ctccbaacag bggaggccag acbbaggcac 1560 
cacaaggccg bggcggbagg gbabgbgbcb 1620 
gcbgacgcag atggaagacb baaggcagcg 1680 
gtabtctgat aagagbcaga ggbaactccc 174 0 
gtagtcbgag cagbactcgt tgcbgccgcg 1800 
aacagactgt bccbbbccat gggbcbtbbc 1860 
gatatcgcgg ccgcbctaga gabatcgcca 1920 
ggccgcagbg gtggabgbgg ggtabcctgg 1980 
bbggbaacab gtgggbbacc gtbtactacg 2040 
cccbgbtcbg cgcbtccgac accaaagcct 2100 
cccacgctbg cgbbccgacc. gacccgaacc 2160 
aaaacbtcaa cabgtggaaa aacgacabgg 2220 
bgtgggacca gbcccbgaaa ccgbgcgbba 2280 
gcaccaacgc baccbtcaaa aacaacgtta 2340 
gcbccttcgc abgcabcaac tgcaacgcbb 2400 
actbcgaccc gabcccgabc cacbacbgcg 2460 
acaacaaaac cbbcbccggb aaaggbccgt 2520 
abggbabcaa accggbbgbb tccacccagc 2580 
aaatcatcab ccgbbccgaa aaccbgaccg 2640 
acaaabccgt tgaaatcgtb tgcacccgbc 2700 
bcggbccggg tcagaccbbc tacgcbaccg 2760 
acbgcaacab cbccggbbcc aaabggaacg 2820 
aggaaaacba caacaacaac aaaaccabca 2880 
aaabcaccac ccacbccttc aacbgccgbg 2940 
tgbbcaacaa caacgcbacc gaagacgaaa 3000 
tcabcaacab gtggcagggt gbfcggtcgbg 3 060 
bcaccbgcaa atccaacabc accggtcbgc 3120 
aaaccgaaga aatcbtccgb ccgggtggbg 3180 
bgbacaaaba caaagttatc gaactgaaac 3240 
bbaccgtbca ggcbcgbcag ctgcbgbccb 3300 
gtgcbabcga agcbcagcag cacatgcbgc 3360 
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agctgaccgt ttggggtatc aaacagctgc agacccgtgt bctggcbabc gaacgttacc 3420 
tgaaagacca gcagctcgag atctggaaca acatgacctg gatggaatgg gaccgtgaaa 3480 
tctccaacta caccgacacc atctaccgtc tgctggaaga ctcccagacc cagcaggaaa 3540 
aaaacgaaaa agacctgcfcg gcbcbggacb cctggaaaaa cctgtggtcc tggttcgaca 3600 
bcbccaacbg gctgtggtac atcaaaatct tcatcatgat cgttggbggb ctgatcggtc 3660 
tgcgtatcat cbbcgcbgtb cbgtccabcb gaggatccag atctgctgbg ccttctagtt 3720 
gccagccatc tgttgtttgc cccbcccccg bgccbbccbb gaccctggaa ggtgccactc 3780 
ccacbgbccb bbccbaabaa aatgaggaaa bbgcabcgca bbgbctgagb aggbgbcabb 3840 
cbabbcbggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca 3900 
ggcatgctgg ggatgcggtg ggctctatgg gtacccaggb gcbgaagaat bgacccggtt 3360 
ccbccbgggc cagaaagaag caggcacatc cccttctctg tgacacaccc bgbccacgcc 4020 
ccbggbbcbb agbbccagcc ccacbcabag gacacbcaba gcbcaggagg gcbccgccbt 4080 
caabcccacc cgcbaaagba cbbggagcgg bcbctcccbc ccbcabcagc ccaccaaacc 4140 
aaaccbagcc bccaagagbg ggaagaaabb aaagcaagat aggctabbaa gtgcagaggg 4200 
agagaaaatg ccbccaacat gtgaggaagb aabgagagaa abcabagaab bbtaaggcca 4260 
bgabtbaagg ccatcabggc ctbaatcbbc cgcbtccbcg cbcacbgact cgcbgcgcbc 4320 
ggbcgbbcgg cbgcggcgag cggbabcagc bcactcaaag gcggtaabac ggttatccac 4380 
agaatcaggg gabaacgcag gaaagaacab gbgagcaaaa ggccagcaaa aggccaggaa 4440 
ccgbaaaaag gccgcgbtgc bggcgbbtbt ccabaggcbc cgccccccbg acgagcabca 4500 
caaaaabcga cgcbcaagbc agaggbggcg aaacccgaca ggacbabaaa gataccaggc 4560 
gbbbcccccb ggaagcbccc bcgbgcgcbc tccbgbbccg acccbgccgc tbaccggata 4620 
ccbgbccgcc tbbcbcccbt cgggaagcgt ggcgctbtcb cabagcbcac gcbgbaggba 4680 
bctcagbbcg gbgtaggtcg bbcgcbccaa gcbgggctgb gtgcacgaac cccccgbtca 4740 
gcccgaccgc bgcgccbbab ccggbaacba bcgbcbbgag bccaacccgg baagacacga 4800 
cbtabcgcca ctggcagcag ccacbggtaa caggabbagc agagcgaggt abgtaggcgg 4860 
tgcbacagag bbctbgaagb ggbggccbaa cbacggctac acbagaagaa cagtatbbgg 4920 
tabcbgcgct cbgctgaagc cagbbaccbt cggaaaaaga gbbggbagcb cbbgabccgg 4980 
caaacaaacc accgctggba gcggbggtbfc bbbbgtbbgc aagcagcaga tbacgcgcag 5040 
aaaaaaagga bcbcaagaag abccbbbgab cbtttcbacg gggbcbgacg ctcagtggaa 5100 
cgaaaacbca cgbtaaggga bbbbggbcab gagattabca aaaaggatcb bcacctagab 5160 
ccbttbaaab baaaaabgaa gbbbtaaabc aabcbaaagb abababgagb aaactbggbc 5220 
bgacagbtac caatgcbbaa bcagbgaggc accbabcbca gcgabcbgbc babbbcgbbc 5280 
abccabagbb gccbgacbcg gggggggggg gcgctgaggb cbgccbcgtg aagaaggbgb 5340 
bgcbgacbca baccaggccb gaabcgcccc atcabccagc cagaaagtga gggagccacg 5400 
gbtgatgaga gctbbgbbgb aggtggacca gttggbgabb bbgaacbbbb gctbbgccac 5460 
ggaacggbcb gcgbbgtcgg gaagabgcgb gabcbgabcc bbcaacbcag caaaagtbcg 5520 
abbtabbcaa caaagccgcc gbcccgbcaa gbcagcgbaa tgcbctgcca gbgbtacaac 5580 
caabbaacca abbcbgabba gaaaaacbca tcgagcabca aabgaaacbg caabbbabbc 5640 
atabcaggab babcaabacc ababbbtbga aaaagccgbb tcbgbaabga aggagaaaac 5700 
bcaccgaggc agbbccabag gabggcaaga bccbggtabc ggtcbgcgab bccgacbcgt 5760 
ccaacabcaa bacaaccbab baabbbcccc bcgbcaaaaa baaggbbabc aagbgagaaa 5820 
bcaccabgag bgacgacbga abccggbgag aabggcaaaa gcbbabgcab bbcbtbccag 5880 
acbbgbbcaa caggccagcc abbacgctcg bcabcaaaab cacbcgcabc aaccaaaccg 5940 
bbabbcabtc gbgatbgcgc cbgagcgaga cgaaabacgc gabcgcbgbb aaaaggacaa 6000 
bbacaaacag gaabcgaabg caaccggcgc aggaacacbg ccagcgcabc aacaababbb 6060 
bcaccbgaab caggababbc bbcbaabacc bggaabgcbg btbbcccggg gabcgcagbg 6120 
gbgagbaacc abgcabcabc aggagbacgg abaaaabgct bgabggbcgg aagaggcaba 6180 
aabbccgbca gccagbbbag bcbgaccabc bcabcbgbaa cabcabbggc aacgcbaccb 6240 
tbgccabgbb bcagaaacaa cbcbggcgca bcgggcbbcc cabacaabcg abagabbgbc 6300 
gcaccbgabb gcccgacabb abcgcgagcc cabbbabacc cababaaabc agcabccabg 6360 
bbggaabbba abcgcggccb cgagcaagac gbbbcccgbt gaababggcb cabaacaccc 6420 
cbbgtabbac bgbbbabgba agcagacagb bbtabbgtbc atgatgabab abbbtbabcb 6480 
bgbgcaatgb aacabcagag abtbbgagac acaacgbggc btbccccccc cccccabbab 6540 
bgaagcabbb abcagggtba bbgtcbcabg agcggabaca babbbgaabg babbbagaaa 6600 
aabaaacaaa taggggbbcc gcgcacabbb ccccgaaaag bgccaccbga cgbcbaagaa 6660 
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accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtc 6718 



<210> 147 
<211> 6637 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 147 

tcgcgcgttt cggtgatgac ggtgaaaacc tcfcgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 3 00 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 42 0 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 13 80 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 174 0 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 1980 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 
gtgttccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 222 0 
tbgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 2280 
aactgacccc gctgtgcgtt accctgcact gcaccaacgc taccttcaaa aacaacgtta 2340 
ccaacgacat gaacaaagaa atccgtaact gctccttcgc atgcatcaac tgcaacgctt 2400 
ccaccatcac ccaggcttgc ccgaaagtta acttcgaccc gatcccgatc cactactgcg 2460 
ctccggctgg ttacgctatc ctgaaatgca acaacaaaac cttctccggt aaaggtccgt 2520 
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gcaacaacgt ttccaccgtt cagtgcaccc atggtatcaa accggttgtt tccacccagc 2580 
tgctgctgaa cggttccctg gctgaaaaag aaabcabcab ccgttccgaa aacctgaccg 2640 
acaacgttaa aaccatcatc gbbcaccbga acaaatccgb tgaaatcgtt tgcacccgtg 2700 
ctagcgctta cbgcaacabc tccggttcca aatggaacga aaccctgaaa cgtgttaaag 2.760 
aaaaactgca ggaaaactac aacaacaaca aaaccatcaa attcgctccg tcctccggtg 2820 
gtgacctgga aatcaccacc cacbccbbca actgccgtgg tgaattcttc bacbgcaaca 2880 
ccacccgbcb gttcaacaac aacgctaccg aagacgaaac catcaccctg ccgtgccgta 2940 
bcaaacagab cabcaacatg bggcagggbg bbggbcgbgc batgtacgct ecgccgatcg 3 000 
ctggtaacat caccbgcaaa tccaacatca ccggtctgct gctggttcgt gacggbggbg 3060 
aagacaacaa aaccgaagaa atcttccgtc cgggtggtgg taacatgaaa gacaactggc 3120 
gbbccgaacb gtacaaatac aaagtbabcg aacbgaaacc gcbgggbabc gctccgaccg 3180 
gtgctaagct taccgttcag gcfccgtcagc tgcbgtcctc catcgttcag cagcagtcca 3240 
acctgctgcg tgcbatcgaa gcbcagcagc acatgcbgca gcbgaccgbt tggggbatca 3300 
aacagcfcgca gacccgbgtb cbggcbatcg aacgttacct gaaagaccag cagcbcgaga 3360 
tctggaacaa catgacctgg abggaabggg accgbgaaat ctccaacbac accgacacca 3420 
bctaccgbct gcbggaagac tcccagaccc agcaggaaaa aaacgaaaaa gacctgctgg 3480 
ctctggactc cbggaaaaac cbgtggbccb ggbbcgacab ctccaacbgg cbgbggbaca 3540 
tcaaaabctt catcabgatc gbtggbggbc tgabcggtcb gcgtatcatc bbcgcbgtbc 3600 
tgtccabctg aggabccaga tcbgcbgbgc cttcbagtbg ccagccatct gbbgbbbgcc 3660 
cctcccccgt gccbbccbtg acccbggaag gtgccacbcc cactgbccbt bccbaabaaa 3720 
atgaggaaat tgcabcgcat tgtcbgagba ggtgbcabbc batbcbgggg ggbggggtgg 3780 
ggcaggacag caagggggag gabbgggaag acaabagcag gcabgcbggg gabgcggtgg 3840 
gcbctabggg bacccaggtg ctgaagaatt gacccggtbc cbcctgggcc agaaagaagc 3900 
aggcacabcc cctbcfccbgb gacacaccct gbccacgccc cbggttctba gtbccagccc 3960 
cactcabagg acacbcabag ctcaggaggg cbccgcctbc aatcccaccc gctaaagbac 4020 
ttggagcggt ctctcccbcc ctcabcagcc caccaaacca aaccbagccb ccaagagbgg 4080 
gaagaaabba aagcaagata ggcbabbaag tgcagaggga gagaaaatgc cbccaacabg 414 0 
tgaggaagba abgagagaaa bcabagaabt tbaaggccab gatbbaaggc cabcabggcc 4200 
ttaatcbbcc gcbtccbcgc bcacbgacbc gcbgcgcbcg gtcgbtcggc bgcggcgagc 42 60 
ggtabcagct cactcaaagg cggtaabacg gtbabccaca gaatcagggg abaacgcagg 4320 
aaagaacabg bgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgtbgcb 43 80 
ggcgtbbbtc cabaggcbcc gcccccctga cgagcabcac aaaaabcgac gcbcaagbca 4440 
gaggbggcga aacccgacag gacbabaaag abaccaggcg bbtcccccbg gaagcbcccb 4500 
cgbgcgcbct ccbgbbccga cccbgccgct baccggatac cbgbccgccb bbctccctbc 4560 
gggaagcgbg gcgcbbbctc abagcbcacg cbgtaggtab cbcagbbcgg bgbaggtcgb 4620 
tcgcbccaag cbgggcbgbg bgcacgaacc ccccgbbcag cccgaccgcb gcgccttabc 4680 
cggtaactab cgbcttgagb ccaacccggb aagacacgac bbabcgccac tggcagcagc 4740 
cacbggbaac aggabtagca gagcgaggba tgbaggcggb gcbacagagb bcbbgaagbg 4800 
gbggccbaac bacggcbaca cbagaagaac agbabtbggb abcbgcgctc bgctgaagcc 4860 
agbbaccbbc ggaaaaagag tbggbagctc bbgabccggc aaacaaacca ccgcbggbag 4920 
c 99tggbbbb bbbgbbbgca agcagcagab bacgcgcaga aaaaaaggat ctcaagaaga 4980 
tccbbbgabc bbbbcbacgg ggtctgacgc bcagbggaac gaaaacbcac gttaagggab 5040 
bbbggbcabg agabbabcaa aaaggabcbb caccbagabc cbtbbaaabb aaaaatgaag 5100 
btbbaaatca abctaaagba batabgagba aacbbggtcb gacagbbacc aabgcbbaab 5160 
cagbgaggca ccbabcbcag cgabcbgtcb abtbcgbtca bccabagbtg ccbgacbcgg 5220 
9999999999 cgctgaggbc bgccbcgtga agaaggbgbb gctgacbcab accaggccbg 5280 
aabcgcccca bcatccagcc agaaagbgag ggagccacgg bbgabgagag ctbbgttgta 5340 
ggbggaccag bbggbgabbb bgaacbbtbg cbtbgccacg gaacggbcbg cgbbgtcggg 5400 
aagabgcgbg abcbgabccb bcaacbcagc aaaagttcga bbbabbcaac aaagccgccg 5460 
bcccgbcaag bcagcgbaab gcbcbgccag bgbbacaacc aabbaaccaa btcbgabtag 5520 
aaaaacbcab cgagcabcaa abgaaactgc aabbbabbca babcaggabb atcaatacca 5580 
babfcbbtgaa aaagccgbbt cbgbaabgaa ggagaaaacb caccgaggca gtbccabagg 5640 
abggcaagab ccbggbabcg gbcbgcgabt ccgacbcgbc caacatcaab acaaccbabb 5700 
aabbbccccb cgtcaaaaat aaggbbabca agbgagaaab caccabgagb gacgacbgaa 5760 
bccggbgaga abggcaaaag cbbabgcabb bcbbbccaga cbbgbbcaac aggccagcca 5820 
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ttacgctcgt catcaaaatc actcgcatca accaaaccgt tattcattcg tgattgcgcc 5880 

bgagcgagac gaaatacgcg abcgcbgbba aaaggacaat tacaaacagg aatcgaatgc 5940 

aaccggcgca ggaacacbgc cagcgcafcca acaababtbb cacctgaatc aggatahtcb 6000 

tctaatacct ggaatgctgt tttcccgggg atcgcagtgg tgagtaacca tgcatcatca 6060 

ggagtacgga taaaatgctt gatggtcgga agaggcabaa attccgtcag ccagtttagt 6120 

ctgaccabcb cabctgtaac abcabbggca acgctacctb bgccatgtbt cagaaacaac 6180 

bcbggcgcab cgggcbtccc abacaabcga bagatbgbcg caccbgattg cccgacabba 6240 

tcgcgagccc atttataccc atabaaatca gcabccabgb bggaatttaa bcgcggcctc 63 00 

gagcaagacg bbtcccgbtg aababggctc ataacacccc ttgbattacb gttbafcgtaa 6360 

gcagacagtb bbabtgtbca tgabgatata btbttatcbb gbgcaabgba acabcagaga 6420 

bbtbgagaca caacgbggcb bbcccccccc ccccabtabt gaagcabbba bcagggtbab 6480 

tgtcbcabga gcggabacab abbbgaabgb abbbagaaaa abaaacaaab aggggbbccg 6540 

cgcacabtbc cccgaaaagb gccacctgac gbcbaagaaa ccabbabbab cabgacabba 6600 

accbabaaaa abaggcgbab cacgaggccc tttcgbc 6637 



<210> 148 
<211> 6598 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 148 

bcgcgcgbbb cggbgabgac ggbgaaaacc tcbgacacat gcagcbcccg gagacggtca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg tcagggcgcg bcagcgggbg 12 0 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgta cbgagagbgc 180 
accatabgcg gtgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc atcagabbgg 24 0 
ctabtggcca tbgcabacgt tgbabccaba bcataababg bacabbbaba bbggcbcabg 300 
bccaacabba ccgccabgbt gacabbgabb abbgacbagt babbaatagb aabcaatbac 360 
ggggbcabba gbbcabagcc cababatgga gtbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabtgacg bcaataabga cgtabgbbcc 480 
cabagtaacg ccaabaggga ctbbccabbg acgbcaatgg gtggagtatb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgbatca batgccaagb acgcccccba tbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagtacatg accbbabggg acbbbccbac 660 
bbggcagtac abcbacgbab bagbcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg cgbggatagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgtcaabggg agbbbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggtcb ababaagcag 900 
agcbcgbtba gtgaaccgbc agabcgcctg gagacgccat ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcatbg gaacgcggab 1020 
bccccgbgcc aagagbgacg baagbaccgc cbatagactc bataggqaca ccccttbggc 1080 
bcbbabgcab gcbabacbgb bbbbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggbgatgg babagcbbag ccbabaggbg bgggbbabbg accabbabbg accacbcccc 1200 
babbggbgac gabacbbbcc abbacbaabc cabaacabgg cbcbbbgcca caacbabctc 1260 
babbggcbab abgccaabac bcbgbccbbc agagacbgac acggacbcbg babbbbbaca 1320 
ggabggggtc ccabbbabba bbtacaaabb cacabataca acaacgccgt cccccgbgcc 1380 
cgcagtbbbb abbaaacaba gcgbgggabc bccacgcgaa bctcgggbac gbgbbccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccatgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbcctbg cbccbaacag bggaggccag acbtaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaabgagc gbggagatbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbabtctgab aagagbcaga ggbaacbccc 1740 
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gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagacfcgt tcctttccat gggtctbtbc 1860 
tgcagtcacc gbcgtcgaca cgtgtgatca gatatcgcgg ccgcbcbaga gatatcgcca 1920 
ccatgcgtgb tcgtggtatc ccgcgbaacb ggccgcagtg gbggatgbgg ggbabccbgg 1980 
gbbtctggab gatcatcatc tgccgtgttg bbggtaacat gtgggttacc gtttactacg 2040 
gbgtbccggb ttggaccgac gctaaaacca cccbgbbcbg cgcttccgac accaaagcct 2100 
acgaccgtga agtbcacaac gtbbgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacbtcaa catgtggaaa aacgacatgg 2220 
tbgaccagat gcacgaagac atcatctccc tgtgggacca ghccctgaaa ccgtgcgtfca 22 80 
aactgacccc gctgtgcgtt acccbgcact gcaccaacgc baccttcaaa aacaacgtta 2340 
ccaacgacab gaacaaagaa atccgtaacb gctccbbcgc atgcabcaac tgcaacgctb 2400 
ccaccatcac ccaggctfcgc ccgaaagtta acbtcgaccc gabcccgatc cactacbgcg 2460 
ctccggctgg ttacgcfcabc cbgaaatgca acaacaaaac ctbctccggb aaaggbccgb 2520 
gcaacaacgb ttccaccgbb cagtgcaccc abggtatcaa accggbtgtt tccacccagc 2580 
tgctgcbgaa cggbtcccbg gcbgaaaaag aaatcatcab ccgtbccgaa aaccbgaccg 2640 
acaacgbbaa aaccafccabc gtbcaccbga acaaabccgb bgaaabcgbt bgcacccgtg 2700 
cbagcgctta ctgcaacabc bccggbbcca aatggaacga aacccbgaaa cgtgbbaaag 2760 
aaaaacbgca ggaaaactac aacaacaaca aaaccatcaa atbcgctccg bccbccggbg 2820 
gbgaccbgga aatcaccacc cacbcctbca actgccgtgg tgaabbcttc bacbgcaaca 2880 
ccgcbagccb gccgtgccgb atcaaacaga bcatcaacat gtggcagggb gtbggbcgtg 2940 
cbabgbacgc tccgccgabc gcbggbaaca bcacctgcaa abccaacatc accggbcbgc 3000 
bgctggtbcg bgacggbggb gaagacaaca aaaccgaaga aabcbbccgt ccgggtggbg 3060 
gbaacatgaa agacaacbgg cgbbccgaac bgtacaaaba caaagtbabc gaactgaaac 3120 
cgcbgggbab cgctccgacc ggbgcbaagc tbaccgtbca ggcbcgtcag cbgctgtcct 3180 
ccabcgbbca gcagcagtcc aaccbgcbgc gbgctatcga agcbcagcag cacabgcbgc 3240 
agcbgaccgt bbggggbabc aaacagcbgc agacccgtgt bctggcbabc gaacgbbacc 3300 
bgaaagacca gcagcbcgag atcbggaaca acabgaccbg gatggaatgg gaccgtgaaa 3360 
bctccaacba caccgacacc abcbaccgbc bgctggaaga ctcccagacc cagcaggaaa 3420 
aaaacgaaaa agaccbgctg gcbctggacb ccbggaaaaa ccbgbggbcc bggttcgaca 3480 
bctccaacbg gctgbggbac abcaaaabcb bcatcabgab cgtbggbggb cbgabcggbc 3540 
bgcgbatcab cttcgcbgtb ctgbccabct gaggatccag abcbgcbgbg ccbbcbagbb 3600 
gccagccabc bgbtgbtbgc ccctcccccg bgccbbccbb gacccbggaa ggbgccacbc 3660 
ccacbgbccb btccbaabaa aabgaggaaa bbgcatcgca tbgbcbgagb aggtgbcatb 3720 
ctabbcbggg gggbggggbg gggcaggaca gcaaggggga ggabbgggaa gacaabagca 3780 
ggcabgctgg ggabgcggbg ggctcbabgg gbacccaggb gcbgaagaat bgacccggtb 3840 
ccbccbgggc cagaaagaag caggcacabc ccctbctcbg bgacacaccc bgbccacgcc 3900 
ccbggbbcbb agbbccagcc ccacbcatag gacactcaba gcbcaggagg gcbccgcctb 3960 
caabcccacc cgctaaagba cbbggagcgg bcbcbcccbc ccbcatcagc ccaccaaacc 4020 
aaacctagcc bccaagagbg ggaagaaabb aaagcaagat aggcbabbaa gtgcagaggg 4080 
agagaaaatg ccbccaacab gbgaggaagb aabgagagaa abcabagaab ttbaaggcca 4140 
tgatbtaagg ccabcabggc ctbaabctbc cgcttccbcg cbcactgacb cgctgcgctc 4200 
ggbcgtbcgg cbgcggcgag cggtabcagc tcactcaaag gcggtaabac ggtbabccac 4260 
agaabcaggg gataacgcag gaaagaacab gtgagcaaaa ggccagcaaa aggccaggaa 4320 
ccgbaaaaag gccgcgbtgc bggcgbbbbb ccabaggcbc cgccccccbg acgagcabca 43 80" 
caaaaabcga cgcbcaagbc agaggbggcg aaacccgaca ggactabaaa gataccaggc 4440 
gbbtcccccb ggaagcbccc bcgbgcgcbc bccbgbbccg acccbgccgc btaccggata 4500 
ccbgbccgcc btbcbcccbt cgggaagcgb ggcgcbbbcb cabagcbcac gcbgbaggba 4560 
bcbcagbbcg gbgbaggtcg ttcgcbccaa gcbgggcbgb gtgcacgaac cccccgbbca 4620 
gcccgaccgc bgcgccttat ccggtaacta bcgbcbtgag bccaacccgg baagacacga 4680 
cttabcgcca cbggcagcag ccacbggbaa caggabbagc agagcgaggb atgbaggcgg 4740 
bgcbacagag btcbbgaagb ggbggccbaa cbacggctac acbagaagaa cagtabtbgg 4800 
babcbgcgcb cbgctgaagc cagbbaccbb cggaaaaaga gtbggbagct cttgabccgg 4 860 
caaacaaacc accgctggba gcggtggbtt tbtbgbbtgc aagcagcaga bbacgcgcag 4920 
aaaaaaagga tctcaagaag abccbbtgab cttbbcbacg gggtctgacg cbcagbggaa 4980 
cgaaaacbca cgbbaaggga bbbbggtcab gagabtatca aaaaggabcb bcaccbagab 5040 
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ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 5100 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgabcbgbc tatttcgttc 5160 
atccatagtt gccbgacbcg gggggggggg gcgctgaggt ctgccbcgt.g aagaaggtgt 5220 
tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga gggagccacg 5280 
gttgatgaga gctttgttgt aggtggacca gttggbgatt ttgaacbbtt gctttgccac 5340 
ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagtbcg 5400 
atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca gtgttacaac 5460 
caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg caatttattc 5520 
atatcaggat babcaabacc atatttttga aaaagccgtt tctgtaatga aggagaaaac 5580 
tcaccgaggc agbbccabag gabggcaaga bcctggbabc ggtctgcgat tccgactcgt 5640 
ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc aagbgagaaa 5700 
tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat ttctttccag 5760 
acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc aaccaaaccg 5820 
ttabbcatbc gtgattgcgc ctgagcgaga cgaaatacgc gabcgcbgtt aaaaggacaa 5880 
ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaatattt 594 0 
tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg gatcgcagbg 6000 
gtgagtaacc atgcatcatc aggagtacgg abaaaabgcb tgatggtcgg aagaggcaba 6060 
aattccgbca gccagbbtag bcbgaccatc tcabctgtaa cabcabbggc aacgcbaccb 6120 
bbgccatgbt bcagaaacaa cbcbggcgca tcgggcbbcc catacaabcg abagabbgtc 6180 
gcaccbgabt gcccgacatb abcgcgagcc cabbbatacc cababaaatc agcatccabg 6240 
bbggaabbba abcgcggcct cgagcaagac gtttcccgtt gaababggcb cataacaccc 6300 
cbbgbabbac bgbbbabgba agcagacagb bbbatbgbbc atgatgatat abbtttabcb 6360 
bgbgcaabgb aacabcagag abbbbgagac acaacgbggc bbtccccccc cccccabbat 6420 
bgaagcabbb atcagggbba bbgbctcatg agcggabaca batbbgaabg babbbagaaa 6480 
aataaacaaa baggggbbcc gcgcacabbb ccccgaaaag bgccacctga cgbcbaagaa 6540 
accabbabba bcabgacabb aacctabaaa aabaggcgba bcacgaggcc ctbbcgbc 6598 



<210> 149 
<211> 6679 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 149 

tcgcgcgttt cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagctbgtct gtaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
tbggcgggtg bcggggcbgg cbbaactatg cggcatcaga gcagatbgta ctgagagtgc 180 
accatatgcg gbgtgaaata ccgcacagat gcgtaaggag aaaataccgc abcagatbgg 24 0 
cbabbggcca bbgcabacgb bgbabccaba bcataabatg bacatbbata ttggctcatg 300 
tccaacatta ccgccabgbb gacabbgabb abbgactagb battaabagb aatcaabbac 360 
ggggbcabta gbbcatagcc catababgga gtbccgcgbb acabaactba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaabaabga cgbabgtbcc 480 
catagbaacg ccaataggga ctttccatbg acgtcaatgg gbggagbatt bacggtaaac 540 
bgcccacbbg gcagbacabc aagtgbabca tatgccaagt acgcccccta btgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagtacabg accbbatggg acbbbccbac 660 
bbggcagbac atcbacgbab bagbcatcgc tatbaccabg gbgabgcggb tbbggcagba 720 
cabcaabggg cgtggatagc ggbttgactc acggggabbt ccaagbcbcc accccabtga 780 
cgbcaabggg agtttgtbtb ggcaccaaaa tcaacgggac bbbccaaaab gtcgtaacaa 840 
ctccgcccca btgacgcaaa tgggcggbag gcgtgbacgg bgggaggtcb ababaagcag 900 
agctcgbbba gbgaaccgtc agabcgcctg gagacgccat ccacgcbgtt tbgaccbcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcatbg gaacgcggab 102 0 
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tccccgtgcc aagagtgacg taagtaccgc 
tcttatgcat gctatactgt ttttggcttg 
taggtgatgg tatagcttag cctataggtg 
tattggtgac gatactttcc attactaatc 
tattggctat atgccaabac tctgtccttc 
ggatggggtc ccatttatta tttacaaatt 
cgcagttttt attaaacata gcgtgggatc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggtcgctcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaatgagc gtggagattg ggctcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca cgtgtgatca 
ccatgcgtgt tcgtggtatc ccgcgtaact 
gtttctggat gatcatcatc tgccgtgttg 
gtgttccggt ttggaccgac gctaaaacca 
acgaccgtga agttcacaac gtttgggcta 
cgcaggaaat cgttctggaa aacgttaccg 
ttgaccagat gcacgaagac atcatctccc 
aactgacccc gctgtgcgtt accctgcact 
ccaacgacat gaacaaagaa atccgbaact 
ccaccatcac ccaggcttgc ccgaaagtta 
ctccggctgg tfcacgctatc ctgaaatgca 
gcaacaacgt ttccaccgtt caghgcaccc 
tgctgctgaa cggttccctg gctgaaaaag 
acaacgttaa aaccatcatc gttcacctga 
cgaacaacaa cacccgtaaa tccatgcgta 
gtgacatcat cggtgacatc cgtcaggctt 
aaaccctgaa acgtgttaaa gaaaaactgc 
aattcgctcc gtcctccggt ggtgacctgg 
gtgaattctt ctactgcaac accgctagcc 
tgtggcaggg tgttggtcgt gctatgtacg 
aatccaacat caccggtctg ctgctggttc 
aaatcttccg tccgggtggt ggtaacatga 
acaaagttat cgaactgaaa ccgctgggta 
aggctcgtca gctgctgtcc tccatcgttc 
aagctcagca gcacatgctg cagctgaccg 
ttctggctat cgaacgttac ctgaaagacc 
ggatggaatg ggaccgtgaa atctccaact 
actcccagac ccagcaggaa aaaaacgaaa 
acctgtggtc ctggttcgac atctccaact 
tcgttggtgg tctgatcggt ctgcgtatca 
gatctgctgt gccttctagt tgccagccat 
tgaccctgga aggtgccact cccactgtcc 
attgtctgag taggtgtcat tctattctgg 
aggattggga agacaatagc aggcatgctg 
tgctgaagaa ttgacccggt tcctcctggg 
gtgacacacc ctgtccacgc ccctggttct 
agctcaggag ggctccgcct tcaatcccac 
ccctcatcag cccaccaaac caaacctagc 
taggctatta agtgcagagg gagagaaaat 
aatcatagaa ttttaaggcc atgatttaag 
gctcactgac tcgctgcgct cggtcgttcg 



ctatagactc tataggcaca cccctttggc 1080 
gggcctatac acccccgctt ccttatgcta 1140 
tgggttattg accattattg accactcccc 1200 
cataacatgg ctctttgcca caactatctc 1260 
agagactgac acggactctg tatttttaca 1320 
cacatataca acaacgccgt cccccgtgcc 1380 
tccacgcgaa tctcgggtac gtgttccgga 1440 
tccacatccg agccctggtc ccatgcctcc 1500 
ctcctaacag tggaggccag acttaggcac 1560 
cacaaggccg tggcggtagg gtatgtgtct 1620 
gctgacgcag atggaagact taaggcagcg 1680 
gtattctgat aagagtcaga ggtaactccc 1740 
gtagtctgag cagtactcgt tgctgccgcg 1800 
aacagactgt tcctttccat gggtcttttc 1860 
gatatcgcgg ccgctctaga gatatcgcca 1920 
ggccgcagtg gtggatgtgg ggtatcctgg 1980 
ttggtaacat gtgggttacc gtttactacg 2040 
ccctgttctg cgcttccgac accaaagcct 2100 
cccacgcttg cgttccgacc gacccgaacc 2160 
aaaacttcaa catgtggaaa aacgacatgg 2220 
tgtgggacca gtccctgaaa ccgtgcgtta 2280 
gcaccaacgc taccttcaaa aacaacgtta 2340 
gctccttcgc atgcatcaac tgcaacgctt 2400 
acttcgaccc gatcccgatc cactactgcg 2460 
acaacaaaac cttctccggt aaaggtccgt 2520 
atggtatcaa accggttgtt tccacccagc 2580 
aaatcatcat ccgttccgaa aacctgaccg 2640 
acaaatccgt tgaaatcgtt tgcacccgtc 2700 
tcggtccggg tcagaccttc tacgctaccg 2760 
actgcaacat ctccggttcc aaatggaacg 2820 
aggaaaacta caacaacaac aaaaccatca 2880 
aaatcaccac ccactccttc aactgccgtg 2940 
tgccgtgccg tatcaaacag atcatcaaca 3000 
ctccgccgat cgctggtaac atcacctgca 3 060 
gtgacggtgg tgaagacaac aaaaccgaag 3120 
aagacaactg gcgttccgaa ctgtacaaat 3180 
tcgctccgac cggtgctaag cttaccgttc 3240 
agcagcagtc caacctgctg cgtgctatcg 3300 
tttggggtat caaacagctg cagacccgtg 3360 
agcagctcga gatctggaac aacatgacct 34.20 
acaccgacac catctaccgt ctgctggaag 3480 
aagacctgct ggctctggac tcctggaaaa 3540 
ggctgtggta catcaaaatc ttcatcatga 3600 
tcttcgctgt tctgtccatc tgaggatcca 3660 
ctgttgtttg cccctccccc gtgccttcct 3720 
tttcctaata aaatgaggaa attgcatcgc 3780 
ggggtggggt ggggcaggac agcaaggggg 3840 
gggatgcggt gggctctatg ggtacccagg 3900 
ccagaaagaa gcaggcacat ccccttctct 3 960 
tagttccagc cccactcata ggacactcat 4020 
ccgctaaagfc acttggagcg gtctctccct 4080 
ctccaagagt gggaagaaat taaagcaaga 4140 
gcctccaaca tgtgaggaag taatgagaga 4200 
gccatcatgg ccttaatctt ccgcttcctc 4260 
gctgcggcga gcggtatcag ctcactcaaa 4320 



« 



WO 02/32943 PCT/US01/25721 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 4380 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 444 0 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggbggc gaaacccgac 4500 
aggactataa agataccagg cgtttccccc tggaagcbcc ctcgtgcgct ctcctgttcc 4560 
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg bggcgctbbc 4620 
tcatagchca cgchgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 4680 
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaacb atcgtcttga 4740 
gtccaacccg gbaagacacg acttatcgcc actggcagca gccactggta acaggattag 4800 
cagagcgagg tatgtaggcg gbgcbacaga ghtcttgaag tggtggccta actacggcta 4860 
cacbagaaga acagtatttg gbabctgcgc tctgctgaag ccagttacct tcggaaaaag 4920 
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggbggbb bbttbgbbbg 4980 
caagcagcag abbacgcgca gaaaaaaagg abcbcaagaa gatccbbtga bcbbbbcbac 5040 
ggggbcbgac gctcagtgga acgaaaacfcc acgtbaaggg abtbbggbca bgagabbabc 5100 
aaaaaggabc btcaccbaga bccbbbtaaa bbaaaaabga agbttbaaab caabcbaaag 5160 
tabababgag baaacbbggt cbgacagbba ccaabgctba atcagbgagg caccbabcbc 5220 
agcgabcbgb ctabbbcgbt cabccabagt tgccbgacbc gggggggggg ggcgcbgagg 5280 
bctgccbcgb gaagaaggbg bbgcbgacbc abaccaggcc tgaatcgccc catcatccag 5340 
ccagaaagbg agggagccac ggbtgabgag agcbbbgbbg taggbggacc agbtggbgab 54 00 
bttgaacbbt bgcbtbgcca cggaacggbc bgcgbbgbcg ggaagabgcg bgabcbgabc 5460 
cttcaacbca gcaaaagtbc gabbbab.bca acaaagccgc cgbcccgbca agtcagcgba 5520 
abgcbcbgcc agbgtbacaa ccaabtaacc aattctgabb agaaaaactc abcgagcabc 5580 
aaabgaaact gcaabbbabb catabcagga ttabcaabac cababbtttg aaaaagccgb 5640 
btcbgbaabg aaggagaaaa ctcaccgagg cagbtccaba ggatggcaag abcctggbab 5700 
cggtcbgcga bbccgacbcg bccaacabca abacaaccba bbaabbbccc cbcgtcaaaa 5760 
abaaggbbab caagbgagaa abcaccabga gbgacgacbg aabccggbga gaabggcaaa 5820' 
agcbbabgca bbbcbbbcca gacbbgttca acaggccagc cabbacgctc gbcabcaaaa 5880 
bcacbcgcab caaccaaacc gbbabbcabb cgbgabbgcg cctgagcgag acgaaabacg 5940 
cgatcgcbgb baaaaggaca abbacaaaca ggaabcgaab gcaaccggcg caggaacact 6000 
gccagcgcat caacaababb bbcaccbgaa bcaggababb ctbcbaabac cbggaabgcb 6060 
gbbtbcccgg ggabcgcagt ggbgagbaac cabgcabcab caggagtacg gataaaatgc 6120 
bbgabggbcg gaagaggcab aaabtccgbc agccagbtta gbcbgaccab cbcabcbgba 6180 
acabcattgg caacgcbacc bbbgccabgb bbcagaaaca acbcbggcgc abcgggcbbc 6240 
ccatacaabc gabagabbgb cgcaccbgab bgcccgacab babcgcgagc ccabbbabac 6300 
ccababaaab cagcabccab gbbggaabbb aabcgcggcc bcgagcaaga cgbbbcccgb 6360 
bgaababggc tcabaacacc cctbgbabta cbgbbbabgb aagcagacag bbbbabbgbb 6420 
cabgabgaba babbbtbatc bbgbgcaabg taacabcaga gabbbbgaga cacaacgbgg 6480 
cbbbcccccc ccccccabta bbgaagcatt babcagggtb abbgtcbcab gagcggabac 6540 
ababbbgaat gbabbbagaa aaabaaacaa abaggggtbc cgcgcacabb bccccgaaaa 6600 
gbgccaccbg acgbcbaaga aaccabbabb abcabgacab baaccbabaa aaataggcgb 6660 
abcacgaggc ccbtbcgbc 6679 



<210> 150 
<211> 6748 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVRl012x/s conbaining HIV genes 
<400> 150 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgta cbgagagtgc 180 
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accatatgcg gtgbgaaaba ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctatfcggcca tbgcabacgt tgtatccata tcataababg tacatttata bbggcbcabg 300 
tccaacabta ccgccatgtt gacattgatt abbgactagb tabtaabagb aabcaabbac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgccbggc bgaccgccca acgacccccg cccattgacg tcaabaatga cgtabgttcc 480 
catagbaacg ccaataggga cbbtccabbg acgtcaabgg gtggagtatb bacggbaaac 540 
tgcccactbg gcagbacatc aagtgbabca tatgccaagb acgcccccba ttgacgbcaa 600 
bgacggtaaa bggcccgccb ggcattabgc ccagbacatg accbtatggg actbbcctac 660 
bbggcagtac abcbacgtab bagtcatcgc tabbaccatg gtgabgcggt tttggcagba 720 
cabcaabggg cgtggatagc ggbtbgacbc acggggattb ccaagbctcc accccabbga 780 
cgtcaabggg agbtbgbbbb ggcaccaaaa bcaacgggac tbbccaaaab gtcgbaacaa 840 
cbccgcccca btgacgcaaa bgggcggbag gcgbgbacgg tgggaggtct ababaagcag 900 
agctcgttta gtgaaccgbc agatcgccbg gagacgccafc ccacgcbgbb ttgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcabbg gaacgcggab 1020 
bccccgtgcc aagagtgacg baagtaccgc ctabagactc babaggcaca ccccbttggc 1080 
bcbbatgcab gctatacbgb btbbggcttg gggccbatac acccccgcbt ccbbabgcta 1140 
taggtgabgg tatagcbtag cctataggbg bgggbbabbg accatbabbg accacbcccc 1200 
tabbggbgac gatacbtbcc abbactaabc cataacabgg cbcbttgcca caacbabctc 1260 
tabtggcbab atgccaabac tcbgbcctbc agagactgac acggactcbg tabbbtbaca 1320 
ggatggggbc ccatttabba tbbacaaabb cacabataca acaacgccgt cccccgbgcc 1380 
cgcagbbbbb abtaaacaba gcgbgggabc tccacgcgaa bctcgggtac gtgbtccgga 1440 
cabgggcbcb bctccggtag cggcggagcb bccacabccg agccctggtc ccatgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbcctbg cbcctaacag bggaggccag acbtaggcac 1560 
agcacaatgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbatgbgbcb 1620 
gaaaabgagc gtggagattg ggctcgcacg gcbgacgcag abggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag cbgagbbgbb gbattcbgab aagagbcaga ggbaacbccc 1740 
gbbgcggbgc tgbbaacggb ggagggcagb gbagtctgag cagbacbcgt tgcfcgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagactgb tccbbbccat gggtcbbbbc 1860 
bgcagbcacc gbcgbcgaca cgtgbgabca gatatcgcgg ccgcbcbaga gababcgcca 1920 
ccabgcgtgb bcgtggbatc ccgcgbaacb ggccgcagbg gbggabgbgg ggbabccbgg 1980 
gbbbcbggat gatcabcabc bgccgbgbtg btggtaacat gbgggtbacc gbbbactacg 2040 
gbgbbccggb bbggaccgac gcbaaaacca ccctgtbcbg cgcbbccgac accaaagccb 2100 
acgaccgbga agttcacaac gbtbgggcta cccacgctbg cgbbccgacc gacccgaacc 2160 
cgcaggaaab cgtbctggaa aacgbtaccg aaaacbbcaa cabgbggaaa aacgacabgg 2220 
btgaccagat gcacgaagac abcatcbccc bgbgggacca gbcccbgaaa ccgbgcgbba 2280 
aactgacccc gcbgbgcgbb acccbgcacb gcaccaacgc taccbbcaaa aacaacgbba 2340 
ccaacgacab gaacaaagaa abccgbaacb gcbccbbcaa caccaccacc gaaabccgbg 2400 
acaaaaaaca gcagggbbac gcbcbgbbcb accgbccgga cabcgbbcbg cbgaaagaaa 2460 
accgbaacaa cbccaacaac bccgaabaca tccbgabcaa ctgcaacgcb bccaccabca 2520 
cccaggctbg cccgaaagtt aacbtcgacc cgabcccgat ccactactgc gcbccggcbg 2580 
gbbacgcbab cctgaaabgc aacaacaaaa ccbbctccgg baaaggbccg tgcaacaacg 2640 
bbbccaccgt bcagbgcacc cabggbatca aaccggbbgt ttccacccag cbgcbgctga 2700 
acggbbccct ggcbgaaaaa gaaabcabca bccgtbccga aaaccbgacc gacaacgtta 2760 
aaaccatcat cgbbcaccbg aacaaatccg bbgaaabcgt bbgcacccgb gctagcgctb 2820 
actgcaacat cbccggbtcc aaabggaacg aaacccbgaa acgbgbtaaa gaaaaactgc 2880 
aggaaaacta caacaacaac aaaaccabca aabbcgcbcc gtcctccggb ggbgaccbgg 2940 
aaabcaccac ccacbccfctc aacbgccgbg gbgaatbcbb cbacbgcaac accacccgbc 3000 
bgtbcaacaa caacgcbacc gaagacgaaa ccatcacccb gccgbgccgb abcaaacaga 3060 
bcabcaacab gbggcagggb gtbggbcgbg cbatgtacgc tccgccgabc gcbggbaaca 3120 
bcaccbgcaa abccaacabc accggbcbgc bgctggbbcg bgacggbggb gaagacaaca 3180 
aaaccgaaga aabcttccgb ccgggbggbg gbaacabgaa agacaacbgg cgbbccgaac 3240 
bgbacaaaba caaagtbabc gaacbgaaac cgcbgggtat cgcbccgacc ggtgcbaagc 3300 
bbaccgbbca ggcfccgtcag cbgcbgbccb ccabcgbbca gcagcagbcc aaccbgcbgc 3360 
gbgcbabcga agctcagcag cacabgcbgc agctgaccgb btggggbatc aaacagcbgc 3420 
agacccgbgb bcbggcbabc gaacgbtacc bgaaagacca gcagcbcgag atcfcggaaca 3480 
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acatgaccbg gatggaatgg gaccgtgaaa 

bgcbggaaga ctcccagacc cagcaggaaa 

cctggaaaaa cctgtggtcc tggttcgaca 

bcabcabgab cgbtggbggb ctgatcggtc 

gaggatccag abcbgcbgbg ccttctaght 

tgccttcctt gaccctggaa ggtgccacbc 

bbgcabcgca bbgtctgagb aggbgbcabb 

gcaaggggga ggattgggaa gacaatagca 

gtacccaggt gctgaagaat tgacccggtt 

cccbbcbcbg bgacacaccc tgtccacgcc 

gacactcata gctcaggagg gctccgcctt 

tctctccctc cctcatcagc ccaccaaacc 

aaagcaagat aggcbabbaa gbgcagaggg 

aatgagagaa abcabagaab bbbaaggcca 

cgcbbcctcg cbcacbgacb cgctgcgctc 

tcactcaaag gcggbaabac ggttatccac 

gtgagcaaaa ggccagcaaa aggccaggaa 

ccataggctc cgcccccctg acgagcatca 

aaacccgaca ggactataaa gabaccaggc 

tccbgbbccg accctgccgc ttaccggata 

ggcgctttct catagctcac gctgtaggta 

gctgggctgt gtgcacgaac cccccgbbca 

tcgtcttgag tccaacccgg taagacacga 

caggattagc agagcgaggt atgtaggcgg 

ctacggctac actagaagaa cagtatttgg 

cggaaaaaga gttggtagct cttgatccgg 

ttttgtttgc aagcagcaga ttacgcgcag 

cbbbbcbacg gggtctgacg ctcagtggaa 

gagattatca aaaaggatct tcacctagat 

aatctaaagt atatatgagt aaacttggtc 

accbabcbca gcgabctgtc tabtbcgbbc 

gcgctgaggb ctgcctcgtg aagaaggbgb 

abcatccagc cagaaagbga gggagccacg 

gbbggtgabb bbgaacbbbb gcbbbgccac 

gabctgabcc bbcaactcag caaaagtbcg 

gbcagcgbaa bgctcbgcca gbgbbacaac 

tcgagcabca aabgaaactg caabbbabbc 

aaaagccgbt bcbgbaabga aggagaaaac 

bcctggbabc ggbctgcgab bccgactcgb 

bcgbcaaaaa baaggbtatc aagbgagaaa 

aabggcaaaa gctbatgcab btctbbccag 

tcatcaaaat cacbcgcabc aaccaaaccg 

cgaaabacgc gabcgcbgbt aaaaggacaa 

aggaacacbg ccagcgcabc aacaababbt 

bggaabgctg bbbbcccggg gabcgcagbg 

abaaaabgcb bgabggbcgg aagaggcaba 

tcatcbgbaa cabcattggc aacgcbacct 

bcgggctbcc cabacaabcg atagattgbc 

catbbabacc cabataaabc agcabccabg 

gbtbcccgtb gaababggcb cataacaccc 

tttabbgbbc atgatgabat abtbbbabcb 

acaacgbggc btbccccccc cccccatbab 

agcggataca babbtgaabg tatttagaaa 

ccccgaaaag bgccaccbga cgtcbaagaa 

aabaggcgba tcacgaggcc cbttcgbc 



bcbccaacba caccgacacc abcbaccgbc 3540 
aaaacgaaaa agaccbgcbg gcbcbggact 3600 
bcbccaacfcg gcbgbggbac abcaaaabcb 3660 
bgcgbabcab cbtcgctgtt cbgbccatcb 3720 
gccagccatc bgttgbbbgc cccbcccccg 3780 
ccacbgbcct tbccbaabaa aabgaggaaa 3840 
cbabbcbggg gggbggggbg gggcaggaca 3900 
ggcatgcbgg ggatgcggbg ggctcbatgg 3960 
ccbccbgggc cagaaagaag caggcacabc 4020 
ccbggbtcbt agbtccagcc ccacbcatag 4080 
caabcccacc cgcbaaagba cbbggagcgg 4140 
aaaccbagcc bccaagagbg ggaagaaabb 4200 
agagaaaabg ccbccaacat gbgaggaagb 4260 
bgabbbaagg ccabcatggc ctbaabcbbc 4320 
ggbcgbbcgg ctgcggcgag cggbabcagc 4380 
agaabcaggg gabaacgcag gaaagaacab 4440 
ccgbaaaaag gccgcgtbgc bggcgbttbb 4500 
caaaaabcga cgctcaagtc agaggbggcg 4560 
gbbtcccccb ggaagcbccc bcgbgcgcbc 4620 
ccbgbccgcc btbcbcccbt cgggaagcgb 4680 
bcbcagbbcg gtgbaggbcg ttcgcbccaa 4740 
gcccgaccgc bgcgccbbab ccggbaacba 4800 
cbbatcgcca cbggcagcag ccacbggbaa 4860 
tgcbacagag bbcbbgaagb ggtggcctaa 4920 
tatcbgcgcb ctgcbgaagc cagbbaccbt 4980 
caaacaaacc accgcbggba gcggbggbtt 5040 
aaaaaaagga tcbcaagaag atccbtbgat 5100 
cgaaaactca cgbbaaggga ttbbggbcat 5160 
cctbbbaaat baaaaatgaa gbbbtaaabc 5220 
bgacagbtac caatgctbaa tcagtgaggc 5280 
abccabagtt gccbgacbcg gggggggggg 5340 
bgcbgactca baccaggccb gaabcgcccc 5400 
gttgabgaga gcbbbgtbgb aggbggacca 5460 
ggaacggtcb gcgbbgtcgg gaagabgcgt 5520 
abtbabbcaa caaagccgcc gbcccgbcaa 5580 
caatbaacca abbctgatba gaaaaacbca 5640 
abatcaggab babcaatacc atabtbbbga 5700 
tcaccgaggc agbbccabag gatggcaaga 5760 
ccaacabcaa bacaaccbab taabbbcccc 5820 
tcaccabgag bgacgacbga abccggbgag 5 880 
acbbgtbcaa caggccagcc abbacgcbcg 5940 
tbabbcabbc gbgatbgcgc cbgagcgaga 6000 
tbacaaacag gaatcgaabg caaccggcgc 6060 
bcaccbgaab caggababbc. bbcbaabacc 6120 
gbgagbaacc abgcabcabc aggagbacgg 6180 
aabbccgbca gccagbtbag bcbgaccabc 6240 
bbgccabgbb bcagaaacaa cbcbggcgca 6300 
gcacctgabb gcccgacabt abcgcgagcc 6360 
bbggaabbba abcgcggcct cgagcaagac 6420 
cbbgbabbac bgtbbabgba agcagacagt 6480 
tgbgcaabgb aacabcagag abbbbgagac 6540 
bgaagcabbb abcagggbba bbgbcbcabg 6600 
aabaaacaaa baggggbbcc gcgcacatbb 6660 
accabbabba bcabgacabb aaccbabaaa 6720 

6748 
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<210> 151 
<211>. 6709 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 151 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 

tkggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 

9999tcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 102 0 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 

ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 162 0 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 

tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 

ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 1980 

gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 

gfcgttccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 

acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 

cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa cabgtggaaa aacgacatgg 2220 

ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 2280 

aactgacccc gctgtgcgtt accctgcact gcaccaacgc taccttcaaa aacaacgtta 2340 

ccaacgacat gaacaaagaa atccgtaact gctccttcaa caccaccacc gaaatccgtg 2400 

acaaaaaaca gcagggttac gctctgttct accgtccgga catcgttctg ctgaaagaaa 2460 

accgtaacaa ctccaacaac tccgaataca tcctgatcaa ctgcaacgct tccaccatca 2520 
cccaggcttg cccgaaagtt aacttcgacc cgatcccgat ccactactgc gctccggctg 2580 
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gttacgctat cctgaaatgc aacaacaaaa 
tttccaccgt tcagtgcacc catggtatca 
acggttccct ggctgaaaaa gaaatcabca 
aaaccatcat cgbtcacctg aacaaatccg 
actgcaacat cbccggbbcc aaatggaacg 
aggaaaacta caacaacaac aaaaccatca 
aaatcaccac ccactccttc aachgccgbg 
tgccgtgccg tatcaaacag atcatcaaca 
cbccgccgab cgctggfcaac abcaccbgca 
gtgacggtgg tgaagacaac aaaaccgaag 
aagacaacbg gcgtbccgaa ctgtacaaat 
tcgctccgac cggtgctaag cttaccgttc 
agcagcagtc caacctgctg cgtgctatcg 
tttggggtat caaacagcbg cagacccgtg 
agcagctcga gatctggaac aacatgaccb 
acaccgacac catctaccgt ctgctggaag 
aagacctgct ggctctggac tcctggaaaa 
ggctgtggta catcaaaatc tbcatcatga 
tcttcgctgt tcbgtccabc tgaggatcca 
cbgtbgbbbg cccctccccc gbgccbtccb 
bbtccbaata aaatgaggaa attgcatcgc 
ggg9tggggt ggggcaggac agcaaggggg 
gggatgcggt gggctcbatg ggtacccagg 
ccagaaagaa gcaggcacat ccccttcbcb 
tagttccagc cccactcata ggacactcat 
ccgctaaagb acttggagcg gtctctccct 
ctccaagagt gggaagaaat taaagcaaga 
gcctccaaca tgtgaggaag baabgagaga 
gccatcatgg ccttaatctt ccgcttcctc 
gctgcggcga gcggtatcag ctcactcaaa 
ggataacgca ggaaagaaca tgtgagcaaa 
ggccgcgbbg cbggcgtbtt tccataggcb 
acgcbcaagb cagaggtggc gaaacccgac 
tggaagctcc ctcgtgcgct ctcctgttcc 
ctttctccct tcgggaagcg bggcgcbbbc 
ggbgbaggbc gttcgctcca agcbgggcbg 
cbgcgccbba tccggtaact atcgtcttga 
actggcagca gccactggta acaggabbag 
gttcttgaag tggtggccta actacggcba 
tctgctgaag ccagttacct tcggaaaaag 
caccgctggt agcggbggbt bbtbbgtbbg 
atctcaagaa gabcctbbga bcbbbbcbac 
acgttaaggg attttggtca bgagabbabc 
ttaaaaatga agttttaaat caabcbaaag 
ccaatgctta atcagbgagg cacctatctc 
tgcctgactc gggggggggg ggcgcbgagg 
abaccaggcc tgaatcgccc cabcatccag 
agcttbgtbg taggtggacc agbbggbgab 
tgcgbbgbcg ggaagatgcg tgabctgatc 
acaaagccgc cgbcccgbca agbcagcgta 
aattcbgabt agaaaaactc atcgagcatc 
bbabcaabac cababtbtbg aaaaagccgb 
cagbbccata ggabggcaag abccbggbab 
atacaaccba bbaattbccc cbcgbcaaaa 
gtgacgactg aabccggbga gaatggcaaa 



ccbbcbccgg baaaggbccg bgcaacaacg 2640 
aaccggbtgt btccacccag cbgcbgcbga 2700 
bccgbbccga aaaccbgacc gacaacgtba 2760 
bbgaaabcgt bbgcacccgb gcbagcgcbt 2820 
aaacccbgaa acgtgbtaaa gaaaaacbgc 2880 
aatbcgcbcc gtcctccggt ggtgaccbgg 2940 
gbgaabbctb ctactgcaac accgctagcc 3000 
tgtggcaggg bgbbggtcgb gcbabgbacg 3060 
aatccaacat caccggbcbg ctgcbggbbc 3120 
aaatcbtccg bccgggtggb ggtaacabga 3180 
acaaagbbab cgaacbgaaa ccgcbgggba 3240 
aggcbcgbca gcbgctgtcc tccatcgbbc 3300 
aagcbcagca gcacabgcbg cagcbgaccg 3360 
bbctggcbab cgaacgtbac cbgaaagacc 3420 
ggabggaabg ggaccgbgaa atcbccaacb 3480 
acbcccagac ccagcaggaa aaaaacgaaa 354 0 
accbgbggtc cbggtbcgac atcbccaact 3600 
tcgbtggtgg bcbgatcggt cbgcgbabca 3660 
gabctgcbgb gcctbctagb bgccagccat 3720 
tgacccbgga aggbgccacb cccacbgbcc 3780 
atbgbcbgag taggtgtcab tctattcbgg 3840 
aggatbggga agacaatagc aggcabgcbg 3900 
tgcbgaagaa bbgacccggt bccbcctggg 3960 
gtgacacacc ctgtccacgc cccbggbtct 4020 
agcbcaggag ggcbccgcct tcaatcccac 4080 
cccbcabcag cccaccaaac caaaccbagc 4140 
baggcbabta agtgcagagg gagagaaaab 4200 
aabcabagaa btbtaaggcc abgatbbaag 4260 
gctcacbgac bcgcbgcgct cggbcgbbcg 4320 
ggcggtaata cggtbatcca cagaabcagg 4380 
aggccagcaa aaggccagga accgbaaaaa 4440 
ccgccccccb gacgagcabc acaaaaabcg 4500 
aggacbabaa agabaccagg cgbbbccccc 4560 
gacccbgccg cbbaccggab accbgbccgc 4620 
bcatagcbca cgcbgbaggb abcbcagbbc 4680 
bgbgcacgaa ccccccgbbc agcccgaccg 4740 
gbccaacccg gbaagacacg acbbabcgcc 4800 
cagagcgagg babgbaggcg gtgcbacaga 4860 
cacbagaaga acagbabbbg gbabcbgcgc 4920 
a 9ttggbagc bctbgatccg gcaaacaaac 4980 
caagcagcag atbacgcgca gaaaaaaagg 504 0 
ggggtcbgac gcbcagtgga acgaaaacbc 5100 
aaaaaggabc btcaccbaga bccbbtbaaa 5160 
tatababgag baaacttggt cbgacagbta 5220 
agcgatcbgb cbabbtcgbt cabccabagb 5280 
tctgccbcgb gaagaaggbg bbgcbgactc 5340 
ccagaaagbg agggagccac ggbbgabgag 5400 
tbbgaactbb tgcbbtgcca cggaacggtc 5460 
cbbcaacbca gcaaaagtbc gabbbabbca 5520 
abgctcbgcc agbgbbacaa ccaabbaacc 5580 
aaabgaaacb gcaabbtabb cababcagga 5640 
tbcbgbaatg aaggagaaaa ctcaccgagg 5700 
cggbcbgcga bbccgacbcg bccaacabca 5760 
abaaggbbat caagbgagaa atcaccabga 5820 
agcbtatgca bttctttcca gactbgbbca 5880 
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acaggccagc cattacgctc gtcatcaaaa 
cgtgattgcg cctgagcgag acgaaatacg 
ggaatcgaat gcaaccggcg caggaacact 
tcaggatatt cttctaatac ctggaatgcfc 
catgcatcat caggagtacg gataaaatgc 
agccagttta gtctgaccat ctcatctgta 
tfccagaaaca actchggcgc atcgggcttc 
tgcccgacat tatcgcgagc ccatttatac 
aatcgcggcc tcgagcaaga cgtttcccgt 
ctgtttatgt aagcagacag ttttattgtt 
taacatcaga gattttgaga cacaacgtgg 
tatcagggtt attgtctcat gagcggatac 
ataggggttc cgcgcacatt tccccgaaaa 
atcatgacat taacctataa aaataggcgt 



tcactcgcat caaccaaacc gttattcatt 5940 
cgatcgchgt taaaaggaca attacaaaca 6000 
gccagcgcat caacaatatt ttcacctgaa 6060 
gttttcccgg ggatcgcagt ggtgagtaac 6120 
ttgatggtcg gaagaggcat aaattccgtc 6180 
acatcattgg caacgctacc tttgccabgt 6240 
ccatacaatc gatagattgt cgcacctgat 6300 
ccatataaat cagcatccat gttggaattt 6360 
tgaatatggc tcataacacc ccttgtatta 6420 
catgatgata tatttttatc ttgtgcaatg 6480 
ctttcccccc ccccccatta ttgaagcatt 6540 
atatttgaat gtatttagaa aaataaacaa 6600 
gtgccacctg acgtcbaaga aaccattatt 6660 
abcacgaggc cctttcgtc 6709 



<210> 152 
<211> 6790 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 152 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gtfccatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca tfcgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 12 00 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
9g at g99gtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
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gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgchgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt bcgbggbabc ccgcgtaact ggccgcagbg gbggabgbgg ggbabccbgg 1980 
gtttctggat gabcabcabc tgccgtgbtg ttggtaacat gtgggttacc gbbbacbacg 2040 
ghgttccggt ttggaccgac gctaaaacca ccctgttctg cgcbbccgac accaaagcct 2100 
acgaccgtga agttcacaac gtttgggcta cccacgcttg cgtbccgacc gacccgaacc 2160 
cgcaggaaab cgttctggaa aacgttaccg aaaacbbcaa cabgbggaaa aacgacatgg 2220 
bbgaccagab gcacgaagac abcabcbccc bgbgggacca gtcccbgaaa ccgtgcgtta 2280 
aactgacccc gctgbgcgtt accctgcact gcaccaacgc taccfctcaaa aacaacgtta 2340 
ccaacgacat gaacaaagaa atccgtaact gctccttcaa caccaccacc gaaatccgtg 24 00 
acaaaaaaca gcagggttac gcbcbgbbcb accgtccgga cahcgbtctg ctgaaagaaa 24 60 
accgtaacaa ctccaacaac tccgaabaca bccbgabcaa cbgcaacgcb bccaccabca 2520 
cccaggcbbg cccgaaagbt aactbcgacc cgabcccgat ccacbacbgc gcbccggcbg 2580 
gbbacgctab ccbgaaabgc aacaacaaaa cctbcbccgg taaaggbccg bgcaacaacg 2640 
tbbccaccgt bcagbgcacc catggbabca aaccggbbgb bbccacccag ctgctgcbga 2700 
acggbbcccb ggcbgaaaaa gaaabcabca bccgbbccga aaaccbgacc gacaacgbba 2760 
aaaccatcab cgtbcaccbg aacaaabccg ttgaaatcgb bbgcacccgb ccgaacaaca 2820 
acacccgbaa atccabgcgb atcggbccgg gbcagaccbb cbacgcbacc ggbgacatca 2880 
bcggbgacab ccgbcaggcb bacbgcaaca tcbccggbbc caaabggaac gaaaccctga 2940 
aacgbgbbaa agaaaaacbg caggaaaacb acaacaacaa caaaaccabc aaabbcgcbc 3000 
cgbccbccgg bggbgaccbg gaaabcacca cccactccbb caacbgccgb ggtgaabtcb 3060 
bcbactgcaa caccgcbagc ctgccgbgcc gbabcaaaca gabcabcaac abgbggcagg 3120 
gbgbbggbcg bgcbabgbac gcbccgccga bcgcbggbaa cabcaccbgc aaabccaaca 3180 
bcaccggbcb gctgcbggbb cgbgacggbg gbgaagacaa caaaaccgaa gaaabcbbcc 3240 
gbccgggbgg bggbaacabg aaagacaacb ggcgbbccga acbgbacaaa bacaaagtba 3300 
bcgaacbgaa accgctgggb abcgcbccga ccggbgcbaa gcbbaccgbb caggcbcgbc 3360 
agcbgcbgbc cbccabcgbb cagcagcagb ccaaccbgcb gcgbgcbabc gaagcbcagc 3420 
agcacabgcb gcagcbgacc gbbbggggba bcaaacagcb gcagacccgb gbbcbggcta 34 80 
bcgaacgfcba ccbgaaagac cagcagctcg agabcbggaa caacabgacc bggabggaab 3540 
gggaccgbga aatcbccaac bacaccgaca ccabcbaccg bcbgcbggaa gactcccaga 3600 
cccagcagga aaaaaacgaa aaagacctgc bggcbcbgga ctcctggaaa aaccbgbggb 3660 
ccbggbbcga cabcbccaac bggctgbggb acabcaaaab ctbcabcabg abcgbbggbg 3720 
gbctgabcgg bcbgcgbabc abcbtcgcbg btcbgbccab cbgaggabcc agabcbgcbg 3780 
bgccbbcbag bbgccagcca bcbgbbgbbb gccccbcccc cgtgccbbcc bbgacccbgg 3840 
aaggbgccac bcccactgbc cbbbccbaab aaaabgagga aatbgcabcg cabbgbcbga 3900 
gtaggbgbca bbcbabtcbg gggggbgggg bggggcagga cagcaagggg gaggabtggg 3960 
aagacaabag caggcabgcb ggggabgcgg bgggcbcbab gggbacccag gbgcbgaaga 4020 
abbgacccgg bbccbccbgg gccagaaaga agcaggcaca bccccbbcbc bgbgacacac 4080 
ccbgbccacg ccccbggbbc tbagbtccag ccccacbcab aggacacbca bagcbcagga 4140 
gggcbccgcc bbcaabccca cccgcbaaag bacbbggagc ggbcbcbccc bcccbcabca 4200 
gcccaccaaa ccaaaccbag ccbccaagag bgggaagaaa bbaaagcaag abaggcbabb 4260 
aagbgcagag ggagagaaaa bgccbccaac abgbgaggaa gtaabgagag aaabcabaga 4320 
abbbbaaggc cabgabbbaa ggccabcabg gccbbaabct bccgctbccb cgcbcacbga 4380 
cbcgcbgcgc bcggbcgbbc ggcbgcggcg agcggbatca gcbcacbcaa aggcggbaat 4440 
acggbbabcc acagaabcag gggabaacgc aggaaagaac afcgbgagcaa aaggccagca 4500 
aaaggccagg aaccgbaaaa aggccgcgbb gcbggcgbbb bbccabaggc bccgcccccc 4560 
tgacgagcab cacaaaaabc gacgcbcaag bcagaggbgg cgaaacccga caggacbaba 4620 
aagabaccag gcgbbbcccc cbggaagcbc ccbcgtgcgc bcbccbgbbc cgacccbgcc 4680 
gcbbaccgga baccbgbccg ccbbbctccc tbcgggaagc gbggcgcbbb cbcabagcbc 4740 
acgctgbagg babcbcagbb cggbgbaggb cgbbcgcbcc aagcbgggcb gbgbgcacga 4800 
accccccgbb cagcccgacc gcbgcgccbb abccggbaac babcgbcbbg agbccaaccc 4860 
ggbaagacac gactbabcgc cacbggcagc agccacbggt aacaggabba gcagagcgag 4920 
gbabgbaggc ggbgctacag agbbcbtgaa gbggbggccb aacbacggcb acactagaag 4980 
aacagbabbb ggbabcbgcg ctcbgcfcgaa gccagbbacc bbcggaaaaa gagbbggtag 5040 
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ctcbbgabcc ggcaaacaaa ccaccgctgg 
gattacgcgc agaaaaaaag gabctcaaga 
cgcbcagbgg aacgaaaact cacgbbaagg 
cttcacctag atccttttaa abbaaaaabg 
gtaaacttgg tctgacagtt accaatgcbb 
tcbabbbcgb bcabccabag bbgccbgacb 
tgaagaaggt gttgcbgact cabaccaggc 
gagggagcca cggbtgatga gagctttgtt 
ttgctttgcc acggaacggt ctgcgttgtc 
agcaaaagtt cgatttattc aacaaagccg 
cagtgttaca accaabtaac caabbcbgat 
bgcaatbbab tcatatcagg attatcaata 
gaaggagaaa actcaccgag gcagbbccat 
attccgactc gtccaacatc aatacaacct 
tcaagtgaga aatcaccatg agtgacgact 
abbbcbbbcc agacttgttc aacaggccag 
tcaaccaaac cgbbatbcat tcgtgattgc 
bbaaaaggac aattacaaac aggaatcgaa 
bcaacaabab tttcacctga atcaggatat 
gggatcgcag bggtgagbaa ccatgcatca 
ggaagaggca taaattccgt cagccagttt 
gcaacgctac ctttgccatg tttcagaaac 
cgatagattg tcgcacctga ttgcccgaca 
tcagcatcca tgttggaatt taatcgcggc 
ctcataacac cccttgtatt actgtttatg 
atatttttat cttgtgcaat gtaacatcag 
cccccccatt attgaagcat ttatcagggt 
bgbabbbaga aaaataaaca aabaggggbb 
gacgtctaag aaaccabbab babcabgaca 
ccctttcgtc 



tagcggtggt ttttttgttt gcaagcagca 5100 
agatcctttg atcbtbbcba cggggtctga 5160 
gabtbbggbc atgagabbab caaaaaggat 5220 
aagttttaaa tcaatctaaa gtatatatga 5280 
aatcagtgag gcacctatct cagcgabcbg 5340 
cggggggggg gggcgcbgag gtctgcctcg 5400 
cbgaabcgcc ccabcabcca gccagaaagb 5460 
gtaggbggac cagbbggbga bbbbgaacbb 5520 
gggaagatgc gbgabcbgab ccbbcaacbc 5580 
ccgtcccgbc aagbcagcgb aabgctcbgc 5640 
bagaaaaacb cabcgagcab caaabgaaac 5700 
ccabattttt gaaaaagccg btbcbgbaab 5760 
aggabggcaa gatccbggba tcggbcbgcg 5820 
abbaabbbcc ccbcgbcaaa aabaaggbta 5880 
gaatccggbg agaabggcaa aagcbbabgc 5940 
ccatbacgcb cgtcabcaaa abcacbcgca 6000 
gccbgagcga gacgaaabac gcgabcgcbg 6060 
tgcaaccggc gcaggaacac tgccagcgca 6120 
tcbbctaaba ccbggaabgc bgbbbbcccg 6180 
bcaggagbac ggabaaaabg cbbgabggbc 6240 
agbcbgacca bctcatctgt aacabcabbg 6300 
aacbcbggcg cabcgggcbb cccabacaab 6360 
bbabcgcgag cccabbtata cccababaaa 6420 
ctcgagcaag acgbbbcccg bbgaababgg 64 80 
baagcagaca gbbbbatbgb bcabgafcgab 6540 
agabbbbgag acacaacgbg gcbbbccccc 6600 
babbgbcbca bgagcggaba cababbtgaa 6660 
ccgcgcacab btccccgaaa agbgccaccb 6720 
bbaaccbaba aaaabaggcg babcacgagg 6780 

6790 



<210> 153 
<211> 6637 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV gene3 
<400> 153 

bcgcgcgbbt cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 
tbggcgggbg tcggggcbgg cbbaacbabg cggcabcaga gcagabtgba ctgagagbgc 180 
accababgcg gbgtgaaaba ccgcacagab gcgbaaggag aaaataccgc abcagabbgg 24 0 
cbabbggcca bbgcabacgb bgtabccaba bcabaababg bacatbbaba bbggcbcabg 300 
bccaacabba ccgccabgtb gacatbgabb abbgacbagb babbaabagb aabcaabbac 360 
ggggbcabba gbbcabagcc cabababgga gbtccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaatga cgbatgbbcc 480 
catagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggbaaac 54 0 
bgcccacbbg gcagbacabc aagbgbabca babgccaagb acgcccccba bbgacgbcaa 600 
tgacggbaaa bggcccgccb ggcabbabgc ccagbacabg acctbatggg acbbbccbac 660 
bbggcagbac abcbacgbat bagbcabcgc babbaccabg gtgabgcggb bbbggcagba 72 0 
cabcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
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cgtcaatggg agttbgbttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 

agctcgfctta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagfiaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatacbgt tttbggcbtg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tctgtccttc agagactgac acggactctg tabtbbbaca 1320 

ggatggggtc ccatttatta tbtacaaatt cacatataca acaacgccgb cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgtbccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg cbcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgfcgccg cacaaggccg bggcggbagg gtatgbgtcb 1620 

gaaaatgagc gtggagabbg ggctcgcacg gctgacgcag atggaagact taaggcagcg 168 0 

gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacataabag ctgacagact aacagactgt tcctttccat gggtcttthc 1860 

tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgcbctaga gatabcgcca 1920 

ccatgcgtgt tcgfcggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 1980 

gtttctggat gatcatcatc tgccgtgttg tbggtaacab gtgggttacc gtttactacg 2 04 0 

gtgttccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 

acgaccgtga agttcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 

cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 2220 

ttgaccagah gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgbta 2280 

aactgacccc gctgtgcgtt accctgcact gcaccaacgc tagccgtaac tgctccttca 2340 

acaccaccac cgaaatccgt gacaaaaaac agcagggtta cgctctgttc taccgtccgg 2400 

acatcgttct gctgaaagaa aaccgtaaca actccaacaa ctccgaatac atcctgatca 2460 

actgcaacgc ttccaccatc acccaggctt gcccgaaagt taacbtcgac ccgatcccga 2520 

tccactactg cgctccggct ggtbacgcba tcctgaaatg caacaacaaa acctbctccg 2580 

gtaaaggtcc gtgcaacaac gbttccaccg ttcagtgcac ccatggbatc aaaccggttg 2 64 0 

tttccaccca gctgctgctg aacgghtccc tggctgaaaa agaaatcabc atccgbbccg 2700 

aaaacctgac cgacaacgtb aaaaccabca tcgttcacct gaacaaatcc gttgaaatcg 2760 

btbgcacccg bgctagcgct tactgcaaca tctccggbtc caaabggaac gaaaccctga 2B20 

aacgtgttaa agaaaaacbg caggaaaact acaacaacaa caaaaccatc aaattcgctc 2 880 

cgtccfcccgg tggtgaccbg gaaabcacca cccactcctt caactgccgt ggtgaabbcb 2940 

tctactgcaa caccacccgb ctgttcaaca acaacgcbac cgaagacgaa accabcaccc 3000 

tgccgtgccg babcaaacag abcabcaaca tgbggcaggg tgbtggbcgt gcbatgtacg 3060 

ctccgccgat cgctggtaac atcaccbgca aatccaacab caccggtctg ctgctggttc 3120 

gtgacggtgg bgaagacaac aaaaccgaag aaabcttccg tccgggbggt ggtaacatga 3180 

aagacaactg gcgttccgaa ctgtacaaat acaaagttab cgaactgaaa ccgctgggta 3240 

tcgctccgac cggtgctaag cttaccgttc aggctcgtca gctgctgtcc tccatcgttc 3 3 00 

agcagcagtc caacctgctg cgbgctatcg aagctcagca gcacabgctg cagctgaccg 3360 

tttggggtat caaacagctg cagacccgtg ttctggctat cgaacgttac ctgaaagacc 3420 

agcagctcga gatctggaac aacatgacct ggatggaatg ggaccgbgaa atctccaact 34 80 

acaccgacac catctaccgt ctgctggaag actcccagac ccagcaggaa aaaaacgaaa 354 0 

aagacctgct ggctctggac tccbggaaaa acctgtggtc cbggbfccgac atctccaacb 3600 

ggctgbggtg aggabccaga fcctgcbgtgc cttctagttg ccagccatct gthgtfcbgcc 3660 

ccbcccccgt gccbtccbtg accctggaag gtgccactcc cacbgtccbb tcctaataaa 3720 

atgaggaaat tgcatcgcab tgtctgagta ggtgtcatbc tatbctgggg ggbggggbgg 3780 

ggcaggacag caagggggag gabbgggaag acaatagcag gcatgctggg gabgcggbgg 3 840 

gcbctatggg tacccaggtg cbgaagaatt gacccggttc cbccbgggcc agaaagaagc 3900 

aggcacatcc cctbctcbgb gacacaccct gtccacgccc ctggtbctba gtbccagccc 3960 

cactcatagg acacbcatag ctcaggaggg ctccgccfctc aatcccaccc gctaaagbac 4020 

ttggagcggt ctcbccctcc ctcatcagcc caccaaacca aaccbagccb ccaagagbgg 4 080 
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gaagaaabba aagcaagaba ggcbabtaag tgcagaggga gagaaaatgc cbccaacabg 4140 
tgaggaagta atgagagaaa tcatagaatt bbaaggccat gatttaaggc catcatggcc 42 00 
ttaatcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc 4260 
ggtatcagct cacbcaaagg cggtaatacg gbbabccaca gaatcagggg ataacgcagg 4320 
aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgtbgct 4380 
ggcgtttttc cataggctcc gcccccctga cgagcabcac aaaaatcgac gctcaagtca 4440 
gaggbggcga aacccgacag gactataaag ataccaggcg tttccccctg gaagcbcccb 4500 
cgtgcgctct cctgttccga ccctgccgct taccggatac ctgbccgcct ttctcccttc 4560 
gggaagcgtg gcgctttctc atagctcacg cbgbaggbab cbcagtbcgg tgtaggtcgt 4620 
tcgctccaag cbgggctgbg bgcacgaacc ccccgttcag cccgaccgcb gcgccbbabc 4680 
cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc 4740 
cactggtaac aggattagca gagcgaggta bgbaggcggb gctacagagb tcbbgaagbg 4800 
gbggccbaac tacggcbaca ctagaagaac agtabbbggb abctgcgcbc bgcbgaagcc 4860 
agbbaccbbc ggaaaaagag tbggbagcbc tbgabccggc aaacaaacca ccgcbggbag 4920 
cggbggbtbb bbbgbttgca agcagcagab bacgcgcaga aaaaaaggat ctcaagaaga 4980 
bcctbtgabc bbbbctacgg ggtcbgacgc bcagtggaac gaaaactcac gttaagggab 5040 
bbbggtcabg agattabcaa aaaggabcbt cacctagabc cbbttaaabb aaaaabgaag '5100 
bbbbaaabca atcbaaagta batabgagba aacttggbcb gacagtbacc aabgcbbaab 5160 
cagbgaggca cctabcbcag cgabcbgtcb abbbcgbbca tccabagbbg cctgacbcgg 5220 
999999gggg cgctgaggbc bgccbcgtga agaaggtgbb gctgactcat accaggcctg 52 80 
aabcgcccca bcabccagcc agaaagbgag ggagccacgg bbgabgagag cbttgttgta 5340 
ggbggaccag bbggbgabtb bgaacbbtbg ctbbgccacg gaacggbctg cgbbgbcggg 5400 
aagabgcgbg atcbgabccb bcaacbcagc aaaagbbcga bbbabbcaac aaagccgccg 5460 
tcccgbcaag bcagcgtaat gcbcbgccag bgbbacaacc aabtaaccaa bbcbgabbag 5520 
aaaaacbcat cgagcabcaa atgaaactgc aabbbabbca babcaggatb atcaabacca 5580 
babtbbbgaa aaagccgbbt cbgbaabgaa ggagaaaacb caccgaggca gttccabagg 5640 
abggcaagab ccbggtabcg gbcbgcgabb ccgacbcgbc caacabcaat acaacctabb 5700 
aabtbccccb cgbcaaaaat aaggbtabca agtgagaaab caccabgagb gacgacbgaa 5760 
tccggtgaga abggcaaaag ctbatgcabt tctfcfcccaga cbbgbbcaac aggccagcca 5820 
bbacgcbcgb cabcaaaatc acbcgcabca accaaaccgb batbcabtcg bgabbgcgcc 5880 
bgagcgagac gaaabacgcg abcgctgbba aaaggacaab bacaaacagg aabcgaabgc 5940 
aaccggcgca ggaacacbgc cagcgcabca acaababbtb caccbgaabc aggabatbcb 6000 
tcbaabacct ggaabgcbgb bbbcccgggg abcgcagtgg bgagtaacca bgcabcabca 6060 
ggagbacgga baaaabgcbb gatggbcgga agaggcabaa abbccgbcag ccagbbbagb 6120 
ctgaccabct cabctgbaac- abcabbggca acgcbaccbb bgccabgbbb cagaaacaac 6180 
bcbggcgcab cgggcbtccc abacaabcga tagattgtcg cacctgattg cccgacabta 6240 
bcgcgagccc abbbabaccc abataaabca gcabccabgb tggaabtbaa bcgcggccbc 6300 
gagcaagacg bbbcccgbbg aababggcbc abaacacccc bbgbabtacb gbtbabgbaa 6360 
gcagacagbb btabtgttca bgabgabata tbbbbabcbb gbgcaatgta acatcagaga 6420 
bbbbgagaca caacgbggcb ttcccccccc ccccatbabb gaagcabbba bcagggbbab 64 80 
bgbctcabga gcggabacab atbbgaabgb abttagaaaa abaaacaaab aggggbbccg 6540 
cgcacabbbc cccgaaaagt gccaccbgac gbctaagaaa ccabbabbab cabgacabba 6600 
accbabaaaa abaggcgbat cacgaggccc bbbcgtc 6637 
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tcgcgcgttt cggtgatgac ggtgaaaacc bcbgacacat gcagctcccg gagacggtca 60 
cagcttgtct gbaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
tt: g9 c g9gtg tcggggcbgg cbtaacbabg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gbgbgaaaba ccgcacagat gcgbaaggag aaaataccgc atcagattgg 240 
cbabbggcca ttgcatacgt bgtabccaba tcataatatg bacabbbaba ttggctcatg 300 
tccaacatha ccgccabgtb gacattgatt attgactagt babbaabagb aatcaafcbac 360 
ggggbcabba gbbcatagcc catatatgga gttccgcgtfc acataactta cggbaaabgg 420 
cccgcctggc bgaccgccca acgacccccg cccattgacg tcaataatga cgbatgbbcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
bgcccacbbg gcagtacatc aagtgtatca hatgccaagt acgcccccta bbgacgbcaa 600 
bgacggbaaa tggcccgcct ggcattatgc ccagtacabg accttatggg actbtccbac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagbcbcc accccattga 7 80 
cgbcaabggg agtbbgbbtb ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca bbgacgcaaa bgggcggbag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgbc agatcgcctg gagacgccat ccacgctgtt btgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcabbg gaacgcggat 1020 
tccccgtgcc aagagtgacg baagtaccgc ctatagactc babaggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccbbabgcba 1140 
taggtgatgg batagcttag ccbabaggbg bgggbbatbg accabbabbg accacbcccc 1200 
babbggtgac gatactbbcc abtacbaabc cabaacabgg ctctbbgcca caacbabcbc 1260 
babbggctab abgccaatac tcbgtcctbc agagacbgac acggacbctg tabttbtaca 1320 
ggabggggbc ccabbtabta bbtacaaabb cacababaca acaacgccgb cccccgbgcc 13 80 
cgcagbbtbb abbaaacata gcgbgggabc tccacgcgaa bcbcgggbac gbgttccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggtc ccabgccbcc 1500 
agcggctcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbcb 162 0 
gaaaabgagc gbggagabtg ggcbcgcacg gcbgacgcag abggaagact baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgtb gbabbcbgat aagagbcaga ggbaactccc 1740 
gbbgcggbgc bgbbaacggb ggagggcagt gbagbcbgag cagbacbcgb bgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagacbgt bccbbbccab gggbcbbtbc 1860 
bgcagbcacc gbcgbcgaca cgbgbgabca gafcabcgcgg ccgctcbaga gababcgcca 1920 
ccabgcgbgb tcgbggtabc ccgcgbaacb ggccgcagtg gbggabgbgg ggbabccbgg 1980 
gbbbcbggab gabcabcabc bgccgbgbbg bbggbaacab gbgggtbacc gbbbacbacg 2040 
gbgbbccggb bbggaccgac gcbaaaacca cccbgbbcbg cgcbbccgac accaaagccb 2100 
acgaccgbga agtbcacaac gbbbgggcba cccacgcbbg cgbbccgacc gacccgaacc 2160 
cgcaggaaab cgbtcbggaa aacgbbaccg aaaacbbcaa cabgtggaaa aacgacabgg 2220 
bbgaccagab gcacgaagac abcatcbccc bgtgggacca gtcccbgaaa ccgbgcgtba 2280 
aacbgacccc gcbgbgcgbb acccbgcact gcaccaacgc bagccgbaac bgcbccttca 2340 
acaccaccac cgaaatccgb gacaaaaaac agcagggbba cgcbcbgbbc baccgtccgg 24 00 
acabcgbbcb gcbgaaagaa aaccgbaaca acbccaacaa cbccgaabac abccbgabca 2460 
acbgcaacgc bbccaccabc acccaggcbt gcccgaaagb baacbbcgac ccgabcccga 2520 
bccacbacbg cgcbccggcb ggtbacgcba bccbgaaabg caacaacaaa accbbcbccg 2580 
gbaaaggtcc gbgcaacaac gbbtccaccg btcagbgcac ccabggbabc aaaccggbbg 2640 
bbbccaccca gcbgcbgcbg aacggbbccc bggcbgaaaa agaaabcabc abccgtbccg 2700 
aaaaccbgac cgacaacgbb aaaaccabca bcgtbcaccb gaacaaatcc gtbgaaabcg 2760 
bbbgcacccg bgcbagcgcb bacbgcaaca bcbccggbtc caaatggaac gaaacccbga 2820 
aacgbgbbaa agaaaaacbg caggaaaacb acaacaacaa caaaaccabc aaabbcgcbc 2880 
cgbccbccgg tggbgaccbg gaaabcacca cccacbccbb caacbgccgb ggbgaabbcb 2940 
bcbacbgcaa caccgctagc cbgccgbgcc gbatcaaaca gatcabcaac abgtggcagg 3000 
gbgbbggtcg bgcbabgbac gctccgccga bcgcbggbaa cabcaccbgc aaabccaaca 3060 
bcaccggbcb gcbgcbggbb cgbgacggbg gbgaagacaa caaaaccgaa gaaabcbbcc 3120 
gbccgggbgg bggbaacabg aaagacaacb ggcgbbccga acbgbacaaa bacaaagbba 3180 
bcgaacbgaa accgcbgggb abcgctccga ccggbgcbaa gcbbaccgbb caggcbcgbc 3240 
agctgcbgbc cbccabcgbt cagcagcagb' ccaacctgcb gcgbgcbatc gaagcbcagc 3300 
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agcacahgcb gcagctgacc gbbbggggta tcaaacagct gcagacccgt gbbcbggcba 3360 
bcgaacgtba cctgaaagac cagcagctcg agatctggaa caacafcgacc bggatggaab 3420 
gggaccgbga aahctccaac tacaccgaca ccabcbaccg tcbgctggaa gacbcccaga 3480 
cccagcagga aaaaaacgaa aaagacctgc tggcbctgga cbccbggaaa aacctgbggt 3540 
cctggttcga catctccaac tggcbgbggt acatcaaaat cttcabcatg atcgttggbg 3600 
gtcbgabcgg bctgcgbabc abcbbcgctg bbctgtccat cbgaggabcc agatcbgcbg 3660 
tgccbbcbag ttgccagcca bctgbbgbbt gcccctcccc cgbgccbtcc tbgaccctgg 3720 
aa 99tgccac tcccacbgtc ctbtccbaab aaaabgagga aattgcabcg cattgbctga 3780 
gtaggtgtca btcbattcbg gggggbgggg tggggcagga cagcaagggg gaggattggg 384 0 
aagacaatag caggcatgct ggggabgcgg tgggctctat gggbacccag gbgctgaaga 3900 
abtgacccgg tfcccbcctgg gccagaaaga agcaggcaca tccccttctc bgtgacacac 3960 
cctgbccacg cccctggttc btagbtccag ccccactcat aggacactca bagctcagga 4020 
gggctccgcc tbcaabccca cccgcbaaag tacttggagc ggtctctccc tcccbcatca 4080 
gcccaccaaa ccaaacctag ccbccaagag tgggaagaaa ttaaagcaag ataggctatb 4140 
aagtgcagag ggagagaaaa bgcctccaac atgtgaggaa gbaatgagag aaabcataga 4200 
attbbaaggc catgatttaa ggccatcabg gccttaatct tccgcttccb cgcbcactga 4260 
cbcgctgcgc bcggtcgttc ggcbgcggcg agcggbatca gctcactcaa aggcggtaat 4320 
acggtbatcc acagaabcag gggabaacgc aggaaagaac atgtgagcaa aaggccagca 4380 
aaaggccagg aaccgtaaaa aggccgcgbt gcbggcgbtt ttccataggc tccgcccccc 4440 
bgacgagcat cacaaaaatc gacgctcaag bcagaggtgg cgaaacccga caggacbata 45 00 
aagataccag gcgtttcccc ctggaagcbc cctcgtgcgc bcbcctgbbc cgacccbgcc 4560 
gcttaccgga tacctgtccg ccbttctccc ttcgggaagc gbggcgcbtt cbcabagcbc 4620 
acgctgtagg batctcagbt cggbgbaggfc cgtbcgctcc aagcbgggct gtgtgcacga 4680 
accccccgbt cagcccgacc gcbgcgccbb atccggtaac tatcgbcbtg agtccaaccc 4740 
ggbaagacac gacttabcgc cacbggcagc agccactggt aacaggatta gcagagcgag 4 800 
gbatgtaggc ggtgctacag agbbctbgaa gtggbggcct aacbacggct acacbagaag 4 860 
aacagbattb ggtatctgcg ctctgctgaa gccagttacc bbcggaaaaa gagttggbag 4920 
ctcbbgatcc ggcaaacaaa ccaccgctgg bagcggbggt ttttttgttfc gcaagcagca 4 980 
gabtacgcgc agaaaaaaag gabctcaaga agabcctbbg abctbbbcta cggggtctga 504 0 
cgctcagbgg aacgaaaact cacgbtaagg gabttbggbc atgagabbat caaaaaggab 5100 
cbbcaccbag abccbttbaa abbaaaaabg aagtttbaaa tcaatcbaaa gtabababga 5160 
gbaaacbbgg bctgacagtb accaabgcbb aabcagbgag gcacctabct cagcgatcbg 5220 
bcbabttcgb tcabccabag btgccbgact cggggggggg gggcgctgag gtcbgccbcg 52 80 
bgaagaaggt gtbgcbgacb cataccaggc cbgaatcgcc ccabcabcca gccagaaagt 5340 
gagggagcca cggbtgabga gagctbbgtb gbaggtggac cagbbggbga bbbtgaactb 540 0 
ttgctbbgcc acggaacggb ctgcgtbgbc gggaagabgc gtgabctgab ccttcaacbc 5460 
agcaaaagtt cgattbabtc aacaaagccg ccgtcccgtc aagbcagcgb aabgcbcbgc 552 0 
cagtgttaca accaabbaac caabbcbgab bagaaaaact cabcgagcab caaatgaaac 5580 
bgcaattbab tcatatcagg abbabcaata ccatatbtbt gaaaaagccg btbctgbaab 5640 
gaaggagaaa acbcaccgag gcagbtccab aggatggcaa gatccbggta bcggtcbgcg 5700 
abbccgacbc gtccaacatc aabacaaccb abbaabtbcc ccbcgbcaaa aabaaggtba 5760 
bcaagbgaga aatcaccabg agbgacgacb gaabccggbg agaabggcaa aagcttatgc 5820 
abbtctbbcc agacbbgtbc aacaggccag ccabbacgcb cgbcabcaaa abcactcgca 5880 
bcaaccaaac cgttabbcab tcgtgabbgc gcctgagcga gacgaaabac gcgabcgcbg 5940 
bbaaaaggac aatbacaaac aggaabcgaa tgcaaccggc gcaggaacac bgccagcgca 6000 
bcaacaabab bttcacctga atcaggabab bcbbcbaaba ccbggaatgc bgbbbbcccg 6060 
gggabcgcag bggbgagtaa ccabgcabca bcaggagbac ggabaaaabg cbbgabggtc 6120 
ggaagaggca baaabbccgb cagccagtbb agbctgacca tcbcatctgt aacabcabbg 6180 
gcaacgcbac cbbbgccabg bbtcagaaac aacbcbggcg cabcgggctb cccabacaab 6240 
cgabagabtg bcgcacctga btgcccgaca bbatcgcgag cccabbbaba cccababaaa 6300 
bcagcabcca bgttggaabt taatcgcggc cbcgagcaag acgbbbcccg tbgaababgg 6360 
cbcabaacac cccttgtabt acbgbtbabg baagcagaca gbbbtabbgb tcabgabgab 6420 
ababbtbtab cttgbgcaab gbaacabcag agattbbgag acacaacgbg gcbbbccccc 6480 
cccccccatb abtgaagcab bbabcagggt babbgbcbca bgagcggaba cababtbgaa 6540 
bgtattbaga aaaabaaaca aabaggggbt ccgcgcacat bbccccgaaa agbgccacct 6600 
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gacgtctaag aaaccattat tatcatgaca ttaacchata aaaataggcg tatcacgagg 6660 
ccctttcgtc SS10 

<210> 155 
<211> 6547 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 155 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgccfc ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatfca tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc I860 
tgcagtcacc gtcgtcgaca cgtgtgatca gatatcgcgg ccgctctaga gatatcgcca 1920 
ccatgcgtgt tcgtggtatc ccgcgtaact ggccgcagtg gtggatgtgg ggtatcctgg 1980 
gtttctggat gatcatcatc tgccgtgttg ttggtaacat gtgggttacc gtttactacg 2040 
gtgttccggt ttggaccgac gctaaaacca ccctgttctg cgcttccgac accaaagcct 2100 
acgaccgtga agfctcacaac gtttgggcta cccacgcttg cgttccgacc gacccgaacc 2160 
cgcaggaaat cgttctggaa aacgttaccg aaaacttcaa catgtggaaa aacgacatgg 222 0 
ttgaccagat gcacgaagac atcatctccc tgtgggacca gtccctgaaa ccgtgcgtta 2280 
aactgacccc gctgtgcgtt accctgcact gcaccaacgc tagcatcaac tgcaacgctt 2340 
ccaccatcac ccaggcttgc ccgaaagtta acttcgaccc gatcccgatc cactactgcg 2400 
ctccggctgg ttacgctatc ctgaaatgca acaacaaaac cttctccggt aaaggtccgt 2460 
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gcaacaacgt ttccaccgtt cagtgcaccc 
bgctgctgaa cggttccctg gctgaaaaag 
acaacgttaa aaccatcatc gttcacctga 
cgaacaacaa cacccgbaaa bccatgcgta 
gtgacatcat cggtgacatc cgbcaggctt 
aaaccctgaa acgtgttaaa gaaaaacbgc 
aattcgctcc gtcctccggt ggtgacctgg 
gtgaatbctb ctactgcaac accgctagcc 
tgtggcaggg tgttggbcgb gcbabgtacg 
aatccaacat caccggtcbg ctgctggttc 
aaatcttccg tccgggtggt ggtaacatga 
acaaagttat cgaactgaaa ccgcbgggta 
aggctcgtca gctgctgtcc tccatcgttc 
aagctcagca gcacatgctg cagctgaccg 
ttctggctat cgaacgttac ctgaaagacc 
ggatggaatg ggaccgtgaa atcbccaact 
actcccagac ccagcaggaa aaaaacgaaa 
acctgtggtc ctggttcgac atctccaact 
cfctctagttg ccagccatct gttgtttgcc 
gtgccactcc cactgtcctt tcctaataaa 
ggtgtcattc tattctgggg ggtggggtgg 
acaatagcag gcatgctggg gatgcggtgg 
gacccggttc ctcctgggcc agaaagaagc 
gtccacgccc ctggttctta gttccagccc 
ctccgccttc aatcccaccc gctaaagtac 
caccaaacca aacctagcct ccaagagtgg 
tgcagaggga gagaaaatgc ctccaacatg 
ttaaggccat gatttaaggc catcatggcc 
gctgcgctcg gtcgttcggc tgcggcgagc 
gttatccaca gaatcagggg ataacgcagg 
ggccaggaac cgtaaaaagg ccgcgttgct 
cgagcatcac aaaaatcgac gctcaagtca 
ataccaggcg tttccccctg gaagctccct 
taccggatac cbgbccgcct ttctcccttc 
ctgtaggtat ctcagttcgg tgtaggtcgt 
ccccgttcag cccgaccgct gcgccttatc 
aagacacgac ttatcgccac tggcagcagc 
tgtaggcggt gctacagagt tcttgaagtg 
agtabttggt atctgcgctc tgctgaagcc 
tbgatccggc aaacaaacca ccgcbggtag 
tacgcgcaga aaaaaaggat cbcaagaaga 
tcagtggaac gaaaactcac gttaagggat 
cacctagatc cbtttaaatt aaaaabgaag 
aacttggtct gacagttacc aatgcttaat 
atttcgbtca tccatagbtg cctgactcgg 
agaaggtgtt gctgactcat accaggcctg 
ggagccacgg ttgatgagag ctttgttgba 
ctttgccacg gaacggbctg cgttgtcggg 
aaaagttcga tttattcaac aaagccgccg 
tgttacaacc aattaaccaa ttctgattag 
aatttabtca babcaggatt atcaatacca 
ggagaaaacb caccgaggca gttccabagg 
ccgactcgbc caacatcaab acaaccbatt 
agtgagaaat caccatgagt gacgactgaa 
tcbbtccaga cbtgtbcaac aggccagcca 



abggbatcaa accggttgtt tccacccagc 2520 
aaatcatcat ccgttccgaa aacctgaccg 2580 
acaaatccgb bgaaatcgtt bgcacccgtc 2640 
bcggtccggg tcagaccbbc bacgctaccg 2700 
actgcaacab cbccggtbcc aaatggaacg 2760 
aggaaaacba caacaacaac aaaaccabca 2 820 
aaatcaccac ccactccttc aactgccgtg 2880 
bgccgbgccg tatcaaacag abcatcaaca 2 940 
ctccgccgat cgcbggbaac atcacctgca 3 000 
gtgacggtgg tgaagacaac aaaaccgaag 3 060 
aagacaactg gcgbbccgaa ctgtacaaat 3120 
tcgcbccgac cggbgctaag cttaccgbtc 3180 
agcagcagtc caacctgcbg cgtgcbabcg 3240 
bbtggggtat caaacagctg cagacccgtg 3300 
agcagctcga gatctggaac aacatgacct 3360 
acaccgacac catctaccgt ctgctggaag 3420 
aagacctgct ggcbctggac bcctggaaaa 3480 
ggctgtggbg aggabccaga tctgctgbgc 3540 
cctcccccgt gccbtccttg accctggaag 3600 
atgaggaaat bgcabcgcat tgtcbgagba 3660 
ggcaggacag caagggggag gattgggaag 3720 
gctctatggg tacccaggtg ctgaagaatt 3780 
aggcacatcc ccttctctgt gacacaccct 3840 
cactcatagg acactcatag ctcaggaggg 3900 
ttggagcggt ctcbcccbcc cbcatcagcc 3960 
gaagaaatta aagcaagaba ggctattaag 4020 
tgaggaagta atgagagaaa bcatagaatt 4 080 
ttaatcttcc gcbbcctcgc tcactgactc 4140 
ggtatcagcb cacbcaaagg cggtaatacg 4200 
aaagaacabg tgagcaaaag gccagcaaaa 42 60 
ggcgtttttc cataggcbcc gccccccbga 4320 
gaggtggcga aacccgacag gactataaag 4380 
cgtgcgctct cctgttccga ccctgccgct 4440 
gggaagcgtg gcgcttbctc abagctcacg 4500 
tcgctccaag ctgggctgbg bgcacgaacc 4560 
cggtaactat cgtcttgagt ccaacccggt 4620 
cactggbaac aggattagca gagcgaggba 4 680 
gtggccbaac tacggcbaca ctagaagaac 4740 
agttaccttc ggaaaaagag ttggtagctc 4800 
cggtggbbbb btbgbbbgca agcagcagat 4860 
bccttbgabc bbbtctacgg ggbcbgacgc 4920 
bttggbcatg agattatcaa aaaggatctt 4 980 
tbttaaatca atctaaagba batatgagba 5040 
cagtgaggca ccbabctcag cgatcbgtct 5100 
gggggggggg cgctgaggbc tgcctcgtga 5160 
aatcgcccca bcabccagcc agaaagbgag 5220 
ggtggaccag bbggbgabbt bgaacbbtbg 5280 
aagatgcgtg atctgatcct tcaactcagc 5340 
tcccgbcaag tcagcgbaab gctctgccag 5400 
aaaaactcab cgagcatcaa abgaaactgc 5460 
tattttbgaa aaagccgbtt ctgtaatgaa 5520 
abggcaagab ccbggtabcg gtctgcgabt 5580 
aabttccccb cgbcaaaaab aaggttatca 5640 
tccggtgaga abggcaaaag cbtatgcabt 5700 
btacgctcgb cabcaaaabc acbcgcatca 5760 
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gaaatacgcg atcgctgtta 5820 
ggaacactgc cagcgcatca 5880 
ggaatgctgt bbbcccgggg 5940 
taaaatgcht gabggbcgga 6000 
catctgtaac atcattggca 6060 
cgggcttccc atacaatcga 6120 
atttataccc atataaatca 6180 
tttcccgttg aatatggctc 6240 
ttattgttca tgatgatata 6300 
caacgtggct ttcccccccc 6360 
gcggabacab atttgaatgt 6420 
cccgaaaagt gccacctgac 64 80 
ataggcgbat cacgaggccc 6540 

6547 
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ccaaaccgb babtcabtcg bgatbgcgcc tgagcgagac 

aaggacaat tacaaacagg aatcgaatgc aaccggcgca 

caatatttt cacctgaatc aggabatbct tctaatacct 

bcgcagtgg tgagtaacca tgcatcatca ggagtacgga 

gaggcataa attccgtcag ccagtthagt ctgaccatct 

cgcbaccbb tgccatgttt cagaaacaac bctggcgcab 

agabbgbcg cacctgattg cccgacabba tcgcgagccc 

catccatgt tggaatttaa tcgcggcctc gagcaagacg 

taacacccc btgtabtact gbtbatgtaa gcagacagtt 

bbbbatcbb gtgcaabgta acatcagaga btbbgagaca 

cccattabt gaagcabtba bcagggbbab bgtcbcabga 

tbbagaaaa abaaacaaab aggggbbccg cgcacabbbc 

bcbaagaaa ccattatbat cafcgacatta accbataaaa 



<210> 156 
<211> 6679 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 156 

bcgcgcgtbb cggbgatgac ggbgaaaacc tctgacacat gcagctcccg gagacggbca 60 
cagcbbgbcb gtaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggtg 120 
ttggcgggbg tcggggctgg ctbaacbabg cggcatcaga gcagabbgba cbgagagbgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgbaaggag aaaataccgc atcagabbgg 24 0 
ctabbggcca ttgcatacgt tgbatccata tcabaababg bacatbbaba ttggcbcabg 300 
bccaacabba ccgccabgbt gacabbgatt atbgacbagt tabbaabagb aatcaabtac 360 
ggggbcabba gttcatagcc cabatabgga gtbccgcgtt acataacbta cggtaaatgg 420 
cccgcctggc bgaccgccca acgacccccg cccabbgacg tcaabaatga cgtabgbtcc 480 
cabagtaacg ccaataggga ctbbccabtg acgbcaatgg gbggagbatt bacggtaaac 540 
bgcccacbtg gcagbacabc aagbgbatca babgccaagt acgcccccta btgacgbcaa 600 
bgacggtaaa bggcccgcct ggcattabgc ccagtacatg accttatggg acbttccbac 660 
tbggcagbac abctacgbat bagbcatcgc tabtaccatg gtgatgcggb bttggcagba 720 
cabcaabggg cgbggabagc ggbbbgactc acggggabtt ccaagbctcc accccatbga 780 
cgbcaatggg agbbbgbbbt ggcaccaaaa tcaacgggac btbccaaaab gbcgbaacaa 840 
cbccgcccca btgacgcaaa bgggcggbag gcgtgbacgg bgggaggtcb abataagcag 900 
agcbcgbtta gtgaaccgtc agabcgccbg gagacgccab ccacgcbgbb ttgacctcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgtgcc aagagtgacg baagbaccgc ctatagactc bataggcaca ccccttbggc 1080 
bcttabgcab gcbabactgt bbtbggcbbg gggccbatac acccccgctb cctbatgcta 1140 
baggtgabgg batagctbag cctataggbg bgggbtabbg accabtabbg accactcccc 1200 
babtggtgac gabactbtcc abtacbaatc cabaacatgg cbcbbtgcca caactabctc 1260 
babtggctab atgccaatac bcbgtcctbc agagacbgac acggacbctg babbtbbaca 1320 
99 a kggggtc ccabbbatta bttacaaatt cacatataca acaacgccgt cccccgbgcc 1380 
cgcagttbbb abbaaacata gcgbgggatc tccacgcgaa tcbcgggtac gbgttccgga 144 0 
cabgggcbcb tcbccggbag cggcggagct bccacabccg agcccbggtc ccatgccbcc 15 00 
agcggcbcab ggbcgctcgg cagctccbbg ctcctaacag tggaggccag actbaggcac 1560 
agcacaabgc ccaccaccac cagtgbgccg cacaaggccg bggcggbagg gtabgbgbcb 1620 
gaaaabgagc gtggagabtg ggctcgcacg gcbgacgcag atggaagact taaggcagcg 1680 
gcagaagaag abgcaggcag ctgagtbgbt gbabbcbgat aagagtcaga ggbaacbccc 1740 
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gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagbcacc gfccgtcgaca cgtgtgatca 
ccatgcgtgt tcgtggtatc ccgcgtaact 
gttbcbggab gatcatcatc tgccgtgttg 
gtgtbccggt ttggaccgac gctaaaacca 
acgaccgtga agttcacaac gtttgggcta 
cgcaggaaat cgttcbggaa aacgtbaccg 
bbgaccagab gcacgaagac abcabcfcccc 
aactgacccc gcbgbgcgtb accctgcacb 
acaccaccac cgaaatccgt gacaaaaaac 
acabcgttcb gctgaaagaa aaccgtaaca 
achgcaacgc ttccaccatc acccaggcbb 
bccacbacbg cgcbccggcb ggbbacgcba 
gtaaaggtcc gtgcaacaac gtbbccaccg 
bttccaccca gctgctgctg aacggbbccc 
aaaaccbgac cgacaacgtt aaaaccatca 
tttgcacccg tccgaacaac aacacccgta 
tctacgctac cggtgacabc atcggtgaca 
ccaaabggaa cgaaaccctg aaacgtgtta 
acaaaaccat caaattcgct ccgtcctccg 
tcaacbgccg tggtgaattc ttctactgca 
agatcatcaa catgtggcag ggtgttggtc 
acatcacctg caaatccaac atcaccggtc 
acaaaaccga agaaatcttc cgbccgggtg 
aactgtacaa atacaaaght atcgaactga 
agcbbaccgb bcaggcbcgb cagctgctgt 
tgcgtgctat cgaagcbcag cagcacatgc 
tgcagacccg tgttctggct atcgaacgtt 
acaacatgac ctggatggaa tgggaccgtg 
gtctgctgga agactcccag acccagcagg 
actccbggaa aaaccbgtgg tccbggbbcg 
gatctgctgt gccttctagt bgccagccab 
tgaccctgga aggtgccact cccactgtcc 
attgtctgag baggbgbcab tctattctgg 
aggattggga agacaatagc aggcatgctg 
tgctgaagaa ttgacccggt tcctcctggg 
gtgacacacc ctgtccacgc ccctggttct 
agctcaggag ggctccgcct bcaabcccac 
ccctcatcag cccaccaaac caaacctagc 
taggctatta agtgcagagg gagagaaaat 
aatcabagaa ttttaaggcc atgatttaag 
gctcactgac tcgctgcgct cggtcgbbcg 
ggcggtaata cggttatcca cagaatcagg 
aggccagcaa aaggccagga accgtaaaaa 
ccgcccccct gacgagcatc acaaaaatcg 
aggactataa agataccagg cgtttccccc 
gaccctgccg cbbaccggab acctgtccgc 
bcatagcbca cgctgtaggt abcbcagbbc 
tgtgcacgaa ccccccgbtc agcccgaccg 
gtccaacccg gtaagacacg actfcatcgcc 
cagagcgagg tatgtaggcg gtgctacaga 
cactagaaga acagtatttg gtatctgcgc 
agtfcggtagc tcbtgatccg gcaaacaaac 
caagcagcag attacgcgca gaaaaaaagg 



gtagtctgag cagtactcgt tgctgccgcg 1800 
aacagactgt bccbbbccat gggbcbbbtc 1860 
gatatcgcgg ccgcbcbaga gatatcgcca 1920 
ggccgcagbg gbggabgtgg ggtatcctgg 1980 
ttggtaacat gtgggttacc gtttactacg 2040 
ccctgttctg cgcttccgac accaaagcct 2100 
cccacgcttg cgbtccgacc gacccgaacc 2160 
aaaacbbcaa cabgtggaaa aacgacabgg 2220 
bgtgggacca gbcccbgaaa ccgbgcgbba 2280 
gcaccaacgc bagccgtaac tgcbccbbca 2340 
agcagggtba cgctcbgbbc baccgbccgg 2400 
acbccaacaa ctccgaatac abcctgabca 2460 
gcccgaaagb baactbcgac ccgatcccga 2520 
tccbgaaatg caacaacaaa acctbcbccg 2580 
bbcagtgcac ccabggtabc aaaccggbtg 2640 
tggcbgaaaa agaaabcabc abccgtbccg 2700 
bcgbtcaccb gaacaaabcc gbbgaaatcg 2760 
aatccabgcg tatcggtccg ggtcagacct 2820 
bccgtcaggc bbactgcaac atctccggbb 2880 
aagaaaaacb gcaggaaaac bacaacaaca 294 0 
gtggbgaccb ggaaabcacc acccactccb 3000 
acaccgcbag cctgccgtgc cgbabcaaac 3060 
gbgctabgba cgcbccgccg abcgcbggba 3120 
tgcbgcbggt bcgbgacggb ggbgaagaca 3180 
gtggbaacab gaaagacaac bggcgbtccg 3240 
aaccgcbggg babcgcbccg accggbgcba 3300 
ccbccatcgb bcagcagcag bccaaccbgc 3360 
tgcagcbgac cgbbbggggb abcaaacagc 3420 
accbgaaaga ccagcagcbc gagabctgga 3480 
aaabcbccaa ctacaccgac accatcbacc 3540 
aaaaaaacga aaaagacctg cbggcbcbgg 3600 
acabcbccaa ctggcbgbgg bgaggabcca 3660 
cbgbtgbbbg ccccbccccc gbgccbbccb 3720 
btbccbaata aaabgaggaa abbgcatcgc 3780 
ggggtggggt ggggcaggac agcaaggggg 3 84 0 
gggabgcggt gggcbcbatg ggbacccagg 3 900 
ccagaaagaa gcaggcacab ccccbbcbct 3960 
bagbbccagc cccacbcata ggacacbcab 4020 
ccgcbaaagb acbtggagcg gtcbctcccb 4080 
cbccaagagb gggaagaaat baaagcaaga 414 0 
gcctccaaca bgbgaggaag baabgagaga 4200 
gccabcabgg ccbbaatctt ccgcbtcctc 4260 
gcbgcggcga gcggbabcag cbcacbcaaa 4 320 
ggabaacgca ggaaagaaca bgbgagcaaa 4380 
ggccgcgbbg ctggcgbbbb bccabaggcb 444 0 
acgcbcaagb cagaggtggc gaaacccgac 4500 
bggaagcbcc cbcgbgcgcb cbccbgtbcc 4560 
cbbbctcccb bcgggaagcg bggcgcbbtc 4620 
ggbgbaggbc gbbcgcbcca agcbgggctg 4680 
ctgcgccbba tccggbaacb abcgbcbtga 4740 
acbggcagca gccacbggba acaggabbag 4800 
gbbcbbgaag bggbggccba actacggcba 4860 
bcbgcbgaag ccagbbaccb bcggaaaaag 4920 
caccgcbggb agcggbggbt bbbbtgbbtg 4980 
atcbcaagaa gabccbbbga bcbbtbcbac 5040 
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ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 5100 

aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 5160 

tatatatgag taaacttggt ctgacagtta ccaabgctta atcagtgagg cacctatctc 5220 

agcgatctgt ctatttcgtt catccatagt tgcctgactc gggggggggg ggcgctgagg 5280 

tcbgcctcgt gaagaaggtg ttgctgactc ataccaggcc tgaatcgccc catcatccag 5340 

ccagaaagtg agggagccac ggttgatgag agctttgttg taggtggacc agttggtgat 54 00 

tttgaacttt tgctttgcca cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc 5460 

cttcaactca gcaaaagttc gatttattca acaaagccgc cgtcccgtca agtcagcgta 5520 

atgctctgcc agtgttacaa ccaattaacc aattctgatt agaaaaactc atcgagcatc 5580 

aaatgaaact gcaatttatt catatcagga ttatcaatac catatttttg aaaaagccgt 5640 

ttctgtaatg aaggagaaaa ctcaccgagg cagttccata ggatggcaag atcctggtat 5700 

cggtctgcga ttccgactcg tccaacatca atacaaccta ttaatttccc ctcgtcaaaa 5760 

ataaggttat caagbgagaa atcaccatga gtgacgactg aatccggtga gaatggcaaa 5820 

agcttatgca tttctttcca gacttgttca acaggccagc cattacgctc gtcatcaaaa 5880 

tcactcgcat caaccaaacc gttattcatt cgtgattgcg cctgagcgag acgaaatacg 5940 

cgatcgctgt taaaaggaca attacaaaca ggaatcgaat gcaaccggcg caggaacact 6000 

gccagcgcat caacaatatt ttcacctgaa tcaggatatt cttctaatac ctggaatgct 6060 

gttttcccgg ggatcgcagt ggtgagtaac catgcatcat caggagtacg gataaaatgc 6120 

ttgatggtcg gaagaggcat aaattccgtc agccagttta gtctgaccat ctcatctgta 6180 

acatcahtgg caacgctacc tttgccatgt ttcagaaaca actctggcgc atcgggcttc 6240 

ccatacaatc gatagattgt cgcacctgat tgcccgacat tatcgcgagc ccatttatac 63 00 

ccatataaat cagcatccat gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt 63 60 

tgaatatggc tcataacacc ccttgtatta ctgtttatgt aagcagacag ttttattgtt 6420 

catgatgata tatttttatc ttgtgcaatg taacatcaga gattttgaga cacaacgtgg 6480 

cfcttcccccc ccccccatta fctgaagcatt tatcagggtt attgtctcat gagcggatac 6540 

atatttgaat gtatttagaa aaataaacaa ahaggggttc cgcgcacatt tccccgaaaa 6600 

gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt 6660 

atcacgaggc cctttcgtc 6679 



<210> 157 
<211> 6438 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 157 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcght acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccatcgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 54 0 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccatbga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
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agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ^acctcca 960 
Soaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 9"°9Wfc 1020 
tccccgtg=c aaglgtgacg taagtaccgc ctatagactc tataggcaca ccccttbggc 1080 
Stt.Sgcat gctatactgt ttttggcttg gggcctatac acccccg ctt 1140 
taggtgatgg t.tagctt^ cctataggtg gggttatt accatt g ^ 

== S5 Ss =s= =3 <™ c ss 
S3ss ™s sss;:tc ^ 

Jtictct tctccggtag cggcggagct tccacatccg agcoctgg c catg ctcc IS^ 
agcggctcat ^tegc^g cage t g c cot acag tggagg^J 

ageacaatge c «ccaccac "ctcgcaci getgaegcag atggaagact taaggcagcg 1680 
gaaaatgagc gtggagattg 99Ctcgcacg * * aaqagtcaga ggtaactccc 1740 

EES 5552 SSS25 ? ^ .™ 
35= E£S= 2=S 1 ^ :r - - 

ccgctgggag aagatccgcc tgcgccccgg ^gcaagaag a040 
cgtgtgggcc agccgcgagc tggagege cgc eg gaac ccegg ^99^ ^ 

egagggctge cgccagatcc tgggecaget 99 * gtgcaccagc gcatcgagat 2160 

S3SS ?£2Z iS— ■!■ -" 9 * a9 , t ™ 

g^U : „ 9 „„ 

cgtgcag.ac atcc»g 9 gcc »9>'ffi£sca =™99C 3 tgtlcagcgc 2400 

"f 9 ' 9 "1 : 9 c" 9 gs ,=c gSScSS Itgaacacc, t 999 eggc=a 2.60 

OCt9 * 9 ° 9 ; 9 !? C9 ! C at C c tgaSgagac c.tca.cgag gaggccgccg agtgggaccg 2520 
r 9 *t 9 g 9 c"S 5SSS c =9 2.S0 

ess «™ S2sr. r,°s 

-™ g™4 £. 

egactaegtg gaccgctfcct acaagaccct 9 c 9cgccg y agaccatcct 2880 

9a ,ct 9 g.tg accg.g.cec tgc gg gea g a.c ^ 
qaaggccctg ggccccgccg ccaccui-yya ^ ^ rraacaacac 3000 

"ggfcccggc cacaaggccc gcgtgctggc egaggecatg agccaggtga ccaacagcgc ^ 

caccatcatg ctgccgcgcc ccccgcaaga agggctgetg 3120 

ctgcggcaag gagggecaca ccgcccgcaa g a s cgacagg ctaatttttt 3180 

gaagfcgcggc aaggagggee accagatgaa ggactgcacc gagega gg 
agggaagatc tggccttccc acaagggaag 9-agggaat "tcttcaga gcag ccag^ ^ 
gccaacagcc ccaccagaag ^gcttcag jtttggggaa W« cactctttgg 3360 

5ss ssss 2sss ssx. ^g^ -™ 3420 

gc^ggatc cagatctget gtgecttet :a jt^oge tgcccc ccc 3480 

ccgtgccttc «tbg«cctg 9-ggtgcc -t^ g gtggggcagg 3600 

rcagcaaggg ggaggfttgg gfagacaata geaggcatge tggggatgcg gtgggctcta 3660 
toggtaccca ggSctgaag aa t ?gacccg gttccbcctg ggccagaaag aagcaggcac 3720 
Sccctt" fJgaLca ccctgtccac gcccctggtt cttagt cca gccccjctc. 

a^agrcSt S^aga gggagagaaa .tjcctccj. ^tg^ -60 

2SS5E 5SS= ss=s ™ :s 

55S= 5S= =5 S=K SSS cc 9 ai9C t c 4200 
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cgcccccctg acgagcabca caaaaatcga 
ggacbabaaa gataccaggc gbbbcccccb 
acccbgccgc ttaccggata cctgtccgcc 

catagctcac gctgtaggta bctcagbbcg 

gtgcacgaac cccccgfctca gcccgaccgc 
tccaacccgg taagacacga cbtabcgcca 

agagcgaggt atgtaggcgg tgctacagag 

acbagaagaa cagtatttgg batcbgcgct 

gbbggbagcb cbtgabccgg caaacaaacc 

aagcagcaga ttacgcgcag aaaaaaagga 

gggbcbgacg ctcaghggaa cgaaaactca 

aaaaggabcb tcacctagat ccttttaaat 

abababgagb aaacbbggbc tgacagttac 

gcgatctgtc babbbcgbbc atccatagtt 

ctgcctcgtg aagaaggbgb bgcbgactca 

cagaaagtga gggagccacg gttgatgaga 

bbgaacbbbb gctttgccac ggaacggtct 

ttcaactcag caaaagbbcg abbbabbcaa 

tgctctgcca gbgbbacaac caatbaacca 

aatgaaactg caatttattc atatcaggat 

tctgtaatga aggagaaaac tcaccgaggc 

ggtctgcgat tccgactcgt ccaacatcaa 

taaggttatc aagtgagaaa bcaccabgag 

gcbbabgcab ttctttccag acbbgbbcaa 

cactcgcatc aaccaaaccg ttattcattc 

gatcgctgtt aaaaggacaa ttacaaacag 

ccagcgcatc aacaababtb tcaccbgaab 

bbbtcccggg gabcgcagtg gtgagbaacc 

tgatggbcgg aagaggcata aabbccgbca 

catcabtggc aacgcbaccb tbgccabgbb 

cabacaatcg abagabbgbc gcacctgabb 

cababaaabc agcabccatg tbggaabbba 

gaatatggct cataacaccc cbbgbabtac 

abgabgabab abbtbtatct tgtgcaabgb 

bbbccccccc cccccatbat bgaagcabbb 

babbtgaabg tabbbagaaa aabaaacaaa 

bgccacctga cgbcbaagaa accabtabba 

bcacgaggcc ctbbcgbc 



cgcbcaagbc agaggbggcg aaacccgaca 4260 
ggaagcbccc tcgbgcgcbc tccbgbbccg 4320 
bbtcbcccbb cgggaagcgb ggcgctbtcb 4380 
gbgtaggbcg bbcgctccaa gcbgggctgt 4440 
bgcgccbtab ccggtaacba tcgbcbtgag 4500 
cbggcagcag ccacbggbaa caggabbagc 4560 
tbcbbgaagt ggbggccbaa ctacggcbac 4620 
cbgctgaagc cagbbaccbb cggaaaaaga 4680 
accgctggba gcggbggbbb bbbbgtbbgc 4740 
bctcaagaag abccbbtgab cttbbctacg 4800 
cgbbaaggga bbbbggbcab gagabbatca 4860 
baaaaabgaa gbbbtaaatc aabcbaaagb 4920 
caabgcbtaa bcagtgaggc acctabcbca 4980 
gccbgacbcg gggggggggg gcgctgaggt 5040 
baccaggccb gaabcgcccc atcatccagc 5100 
gcbbbgbbgb aggbggacca gbbggtgabb 5160 
gcgbbgtcgg gaagabgcgb gatcbgabcc 5220 
caaagccgcc gbcccgtcaa gbcagcgbaa 5280 
abbcbgabta gaaaaactca bcgagcabca 5340 
babcaatacc abatbbbbga aaaagccgbb 5400 
agbtccabag gatggcaaga bcctggbatc 5460 
bacaaccbab baabbbcccc bcgtcaaaaa 5520 
bgacgacbga abccggbgag aatggcaaaa 5580 
caggccagcc abtacgcbcg bcatcaaaab 5640 
gtgabbgcgc cbgagcgaga cgaaabacgc 5700 
gaabcgaatg caaccggcgc aggaacacbg 57 60 
caggababtc bbcbaabacc bggaatgcbg 5820 
abgcabcabc aggagbacgg ataaaabgcb 5880 
gccagbbbag bcbgaccabc bcabcbgbaa 5940 
bcagaaacaa cbcbggcgca tcgggcbbcc 6000 
gcccgacatb abcgcgagcc cabbbabacc 6060 
atcgcggccb cgagcaagac gttbcccgbb 6120 
bgbbbabgta agcagacagb bbtabbgbbc 6180 
aacabcagag abbbbgagac acaacgbggc 6240 
atcagggbba tbgbcbcabg agcggabaca 6300 
taggggtbcc gcgcacabbb ccccgaaaag 6360 
bcabgacabb aaccbabaaa aabaggcgba 6420 

6438 



<210> 158 
<211> 6486 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR!012x/s conbaining SIV genes 
<400> 158 

tcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacat gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaactabg cggcabcaga gcagabbgta cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc atcagabtgg 240 
cbabbggcca bbgcabacgb bgbabccaba bcabaababg bacabbbata btggcbcabg 300 
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bccaacatba ccgccatgtt gacahtgatb abbgacbagb babbaabagb aatcaattac 360 
ggggbcabba gbtcabagcc. cabatatgga gttccgcgtt acataactta cggbaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggaghatt tacggbaaac 540 
tgcccacttg gcagtacatc aagtgbabca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa bggcccgccb ggcabbabgc ccagtacatg accbtatggg actttcchac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gbgabgcggb tbbggcagba 720 
catcaabggg cgtggatagc ggtttgactc acggggabtt ccaagtctcc accccabtga 780 
cgtcaatggg agtttgttbt ggcaccaaaa tcaacgggac bbbccaaaab gbcgtaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb atabaagcag 900 
agcbcgbbba gbgaaccgtc agabcgccbg gagacgccafc ccacgctgbt bbgaccbcca 960 
bagaagacac cgggaccgat ccagcctccg cggccgggaa cggbgcatbg gaacgcggat 1020 
bccccgbgcc aagagtgacg baagbaccgc cbabagacbc babaggcaca ccccbbbggc 1080 
bctbabgcab gcbabacbgb btbbggcbbg gggcctabac acccccgcbb ccbbabgcba 1140 
baggbgabgg babagcbtag ccbabaggbg bgggbbabbg accabbabbg accacbcccc 1200 
babbggbgac gabactbbcc atbacbaabc cabaacabgg cbcbbbgcca caacbabcbc 1260 
battggcbafc atgccaabac bcbgbccbbc agagacbgac acggacbctg tattbbbaca 1320 
ggabggggbc ccabbtatba btbacaaatb cacababaca acaacgccgt cccccgbgcc 1380 
cgcagbbbbb atbaaacaba gcgtgggabc bccacgcgaa bcbcgggbac gbgbtccgga 1440 
cabgggctcb bcbccggtag cggcggagcb bccacabccg agcccbggbc ccabgcctcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgtcb 162 0 
gaaaabgagc gbggagatbg ggctcgcacg gcbgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagbbgbb gtattcbgat aagagbcaga ggbaactccc 1740 
gbtgcggtgc bgbbaacggb ggagggcagb gbagbcbgag cagbactcgb tgctgccgcg 1800 
cgcgccacca gacabaabag ctgacagacb aacagacbgb bccbtbccab gggbcbbtbc 1860 
bgcagbcacc gbcgtcgacg ccgccaccab gggcgbgcgc aacagcgbgc bgagcggcaa 192 0 
gaaggccgac gagcbggaga agabccgccb gcgcccgaac ggcaagaaga agbacabgcb 1980 
gaagcacgbg gbgtgggccg ccaacgagcb ggaccgcbbc ggccbggccg agagccbgct 2 04 0 
ggagaacaag gagggctgcc agaagabccb gagcgbgcbg gcgccgcbgg tgcccaccgg 2100 
cagcgagaac cbgaagagcc bgbacaacac cgbgtgcgbg atctggbgca tccacgccga 2160 
ggagaaggbg aagcacaccg aggaggccaa gcagabcgbg cagcgccacc tggbggbgga 2220 
gaccggcacc accgagacca bgcccaagac cagccgcccg accgcgccca gcagcggacg 22 80 
cggcggcaac bacccggbgc agcagabcgg cggcaacbac gbgcaccbgc cgcbgagccc 234 0 
gcgcacccbg aacgcctggg bgaagcbgab cgaggagaag aagbbcggcg ccgaggbggb 24 00 
gcccggcbbc caggccctga gcgagggcbg cacgcccbac gacabcaacc agabgctgaa 2460 
ctgcgbgggc gaccaccagg ccgccatgca gabcabccgc gacabcabca acgaggaggc 2520 
cgccgactgg gaccbgcagc acccgcagcc cgcgccgcag cagggccagc bgcgcgagcc 2580 
cagcggcagc gacabcgccg gcaccaccag cagcgtggac gagcagabcc agbggabgba 2640 
ccgccagcag aacccgabcc cggtgggcaa cabcbaccgc cgcbggabac agcbgggcct 2700 
gcagaagbgc gbgcgcatgb acaacccgac caacatcctg gacgbgaagc agggcccgaa 2760 
ggagccctbc cagagcbacg bggaccgcbb cbacaagagc cbgcgcgccg agcagaccga 2820 
cgccgccgtg aagaactgga tgacccagac ccbgctgatc cagaacgcca acccggacbg 2880 
caagcbggbg cbgaagggcc bgggcgbgaa cccgaccctg gaggagabgc bgaccgccbg 2940 
ccagggcgbg ggcggcccgg gccagaaggc ccgccbgabg gccgaggccc bgaaggaggc 3000 
ccbggcgccc gbgcccabcc cgtbcgcggc cgcccagcag cgcggcccgc gcaagcccab 3060 
caagbgctgg aacbgcggca aggagggcca cagcgcccgc cagbgccgcg cgccgcgccg 3120 
ccagggctgc bggaagbgcg gcaagabgga ccacgbgatg gccaagtgcc cggaccgcca 3180 
ggcgggbbtt bbaggccbtg gtccabgggg aaagaagccg cgcaabbtcc cgatggcbca 3240 
agbgcabcag gggcbgabgc caacbgcbcc gccagaggac ccagcbgbgg abcbgcbaaa 3300 
gaacbacabg cagbbgggca agcagcagag agaaaagcag agagaaagca gagagaagcc 3360 
btacaaggag gbgacagagg atbtgcbgca ccbcaabbcb cbcbtbggag gagaccagba 3420 
gbcacbgcab cgatcgaggg ccagcccgtg gaggbgcbtc bagaccaggc ccbggabcca 3480 
gabcbgctgb gccbbcbagb bgccagccab cbgbbgbbtg cccctccccc gbgccbbccb 3540 
bgacccbgga aggbgccacb cccacbgbcc tbbcctaaba aaabgaggaa abtgcabcgc 3600 



- 363 - 



WO 02/32943 



PCT/USQ1/25721 



abbgbcbgag taggtgtcat bcbabtctgg 
aggattggga agacaatagc aggcatgctg 
tgctgaagaa ttgacccggt tcctcctggg 
gtgacacacc ctgtccacgc ccctggttct 
agctcaggag ggctccgcct tcaatcccac 
ccctcatcag cccaccaaac caaacctagc 
taggctatta agtgcagagg gagagaaaat 
aabcabagaa bbbbaaggcc abcabggccb 
cbgcgcbcgg tcgttcggct gcggcgagcg 
tbatccacag aabcagggga taacgcagga 
gccaggaacc gtaaaaaggc cgcgttgctg 
gagcatcaca aaaatcgacg ctcaagtcag 
baccaggcgb ttccccctgg aagctccctc 
accggatacc tgtccgccbt bcbcccbbcg 
bgbaggtatc tcagtbcggt gbaggtcgtb 
cccgbbcagc ccgaccgcbg cgccbbatcc 
agacacgacb tabcgccacb ggcagcagcc 
gbaggcggbg cbacagagbb ctbgaagbgg 
gbabbtggba tcbgcgcbcb gcbgaagcca 
bgabccggca aacaaaccac cgcbggbagc 
acgcgcagaa aaaaaggabc tcaagaagab 
cagbggaacg aaaactcacg ttaagggatt 
accbagatcc bbttaaabta aaaatgaagb 
• acbtggtcbg acagbbacca abgcttaabc 
tbbcgbtcab ccabagbbgc cbgacbcggg 
gaaggbgbbg cbgactcata ccaggcctga 
gagccacggt bgatgagagc bbbgttgbag 
bbtgccacgg aacggtctgc gbtgtcggga 
aaagbbcgat bbabbcaaca aagccgccgb 
gbtacaacca atbaaccaab bctgatbaga 
abtbatbcab atcaggabta bcaataccat 
gagaaaacbc accgaggcag bbccabagga 
cgactcgbcc aacatcaaba caaccbabta 
gbgagaaabc accatgagtg acgactgaat 
cttbccagac bbgbtcaaca ggccagccab 
ccaaaccgbb atbcabbcgb gabbgcgccb 
aaggacaabb acaaacagga atcgaabgca 
caababbtbc accbgaabca ggababbcbb 
tcgcagbggb gagbaaccat gcabcabcag 
gaggcabaaa bbccgbcagc cagbbbagbc 
cgcbaccbbb gccabgbbbc agaaacaacb 
agabbgbcgc accbgabbgc ccgacabtab 
cabccabgbb ggaabbbaab cgcggccbcg 
baacaccccb bgtabtacbg btbabgtaag 
ttbtatcbbg bgcaabgtaa cabcagagab 
cccattabbg aagcabbbab cagggbbabb 
bbbagaaaaa baaacaaaba ggggbbccgc 
bcbaagaaac cabbabtabc atgacabbaa 
bbcgtc 



<210> 159 
<211> 9189 
<212> DNA 



9999tggggb ggggcaggac agcaaggggg 3660 
gggabgcggb gggcbcbabg ggtacccagg 3720 
ccagaaagaa gcaggcacab ccccbbcbcb 3780 
bagbbccagc cccactcaba ggacacbcab 3840 
ccgcbaaagt acbbggagcg gtcbcbcccb 3900 
ctccaagagb gggaagaaat baaagcaaga 3960 
gccbccaaca bgbgaggaag baabgagaga 4020 
baabcttccg cbbccbcgct cactgacbcg 4080 
gbabcagcbc acbcaaaggc ggbaabacgg 4140 
aagaacabgb gagcaaaagg ccagcaaaag 4200 
gcgbttbtcc ataggcbccg ccccccbgac 4260 
aggbggcgaa acccgacagg actataaaga 4320 
gbgcgcbcbc cbgbtccgac cctgccgcbt 4380 
ggaagcgbgg cgcbbbcbca bagcbcacgc 4440 
cgcbccaagc bgggcbgbgb gcacgaaccc 4500 
ggbaacbabc gbcbbgagbc caacccggba 4560 
acbggbaaca ggatbagcag agcgaggbab 4620 
bggccbaact acggcbacac bagaagaaca 4680 
gbbaccbtcg gaaaaagagb bggbagcbcb 4740 
ggbggtbbbb btgbbbgcaa gcagcagabt 4800 
ccbbbgatcb tbbcbacggg gtcbgacgct 4860 
bbggbcatga gabbabcaaa aaggabcbbc 4920 
bbbaaabcaa bcbaaagbat ababgagbaa 4980 
agbgaggcac cbatcbcagc gatctgbcta 5040 
gggggggggc gcbgaggbcb gccbcgtgaa 5100 
abcgccccab cabccagcca gaaagtgagg 5160 
gbggaccagb bggbgabbbt gaacbbbtgc 5220 
agabgcgbga bcbgabccbb caacbcagca. 5280 
cccgbcaagb cagcgtaabg cbcbgccagb 5340 
aaaactcabc gagcabcaaa tgaaacbgca 5400 
abbbbtgaaa aagccgbtbc bgbaabgaag 5460 
bggcaagabc cbggbabcgg bcbgcgabbc 5520 
abbbccccbc gbcaaaaaba aggbbabcaa 5580 
ccggbgagaa bggcaaaagc btabgcatbb 5640 
bacgctcgtc abcaaaabca ctcgcabcaa 5700 
gagcgagacg aaabacgcga bcgcbgtbaa 5760 
accggcgcag gaacacbgcc agcgcabcaa 5820 
cbaabaccbg gaabgcbgtb bbcccgggga 58 80 
gagbacggab aaaabgcbbg abggbcggaa 5940 
bgaccabcbc abcbgtaaca bcatbggcaa 6000 
cbggcgcabc gggcbtccca bacaatcgab 6060 
cgcgagccca tbbataccca tataaabcag 6120 
agcaagacgt bbcccgbbga atabggcbca 6180 
cagacagbbb babbgbtcab gabgababat 6240 
btbgagacac aacgbggcbt bccccccccc 6300 
gbcbcabgag cggatacata tbbgaabgba 6360 
gcacabbbcc ccgaaaagbg ccaccbgacg 6420 
ccbabaaaaa baggcgbabc acgaggcccb 64 80 

6486 



364 - 



WO 02/32943 

<213> Artificial Sequence 



PCT/US01/25721 



<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 159 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
ctattggcca ttgcatacgt tgtatccata tpataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgct acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgccfcg gagacgccat ccacgctgtt ttgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca tgggcgcccg cgccagcgtg ctgagcggcg gcgagctgga 1920 
ccgctgggag aagatccgcc tgcgccccgg cggcaagaag aagtacaagc tgaagcacat 1980 
cgtgtgggcc agccgcgagc tggagcgctt cgccgtgaac cccggcctgc tggagaccag 204 0 
cgagggctgc cgccagatcc tgggccagct gcagcccagc ctgcagaccg gcagcgagga 2100 
gctgcgcagc ctgtacaaca ccgtggccac cctgtactgc gtgcaccagc gcatcgagat 2160 
caaggacacc aaggaggccc tggacaagat cgaggaggag cagaacaaga gcaagaagaa 2220 
ggcccagcag gccgccgccg acaccggcca cagcaaccag gtgagccaga actaccccat 2280 
cgtgcagaac atccagggcc agatggtgca ccaggccatc agcccccgca ccctgaacgc 2340 
ctgggtgaag gtggtggagg agaaggcctt cagccccgag gtgatcccca tgttcagcgc 2400 
cctgagcgag ggcgccaccc cccaggacct gaacaccatg ctgaacaccg tgggcggcca 2460 

4 

ccaggccgcc atgcagatgc tgaaggagac catcaacgag gaggccgccg agtgggaccg 2520 

cgtgcacccc gtgcacgccg gccccatcgc ccccggccag atgcgcgagc cccgcggcag 2580 

cgacatcgcc ggcaccacca gcaccctgca ggagcagatc ggctggatga ccaacaaccc 264 0 

ccccatcccc gtgggcgaga tctacaagcg ctggatcatc ctgggcctga acaagatcgt 2700 

gcgcatgtac agccccacca gcatcctgga catccgccag ggccccaagg agcccttccg 2760 

cgactacgtg gaccgcttct acaagaccct gcgcgccgag caggccagcc aggaggtgaa 2 820 

gaactggatg accgagaccc tgctggtgca gaacgccaac cccgactgca agaccatcct 2 880 

gaaggccctg ggccccgccg ccaccctgga ggagatgatg accgcctgcc agggcgtggg 2940 
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cggccccggc cacaaggccc gcgtgctggc cgaggccatg agccaggtga ccaacagcgc 3000 
caccatcatg atgcagcgcg gcaacttccg caaccagcgc aagatcgtga agtgcttcaa 3060 
ctgcggcaag gagggccaca ccgcccgcaa ctgccgcgcc ccccgcaaga agggctgctg 3120 
gaagtgcggc aaggagggcc accagatgaa ggactgcacc gagcgacagg ctaataggga 3180 
agatctggcc ttcccacaag ggaaggccag ggaattttct tcagagcaga ccagagccaa 3240 
cagccccacc agaagagagc ttcagghttg gggaagagac aacaactccc tctcagaagc 3300 
aggagccgat agacaaggaa cbgbatccbt bagctbccct cagatcactc fcttggcagcg 3360 
acccctcgtc acaataaaga tagggggcca gctgaaggag gcccttctag acaccggcgc 3420 
cgacgacacc gbgctggagg agatgaacct gcccggccgc tggaagccca agatgatcgg 3480 
cggcatcggc ggcttcatca aggtgggcca gtacgaccag abcctgabcg agatctgcgg 3540 
ccacaaggcc atcggcaccg tgctggtggg ccccaccccc gtgaacatca bcggccgcaa 3600 
cctgctgacc cagatcggct gcaccctgaa cttccccatc agccccatcg agaccgtgcc 3660 
cgtgaagctg aagcccggca bggacggccc caaggtgaag cagtggcccc tgaccgagga 3720 
gaagatcaag gcccbggtgg agatctgcac cgagabggag aaggagggca agatcagcaa 3780 
gatcggcccc gagaaccccb acaacacccc cgtgbtcgcc abcaagaaga aggacagcac 3840 
caagtggcgc aagctggtgg actbccgcga gctgaacaag cgcacccagg acttctggga 3900 
ggtgcagctg ggcatccccc accccgccgg cctgaagcag aagaagagcg tgaccgtgct 3 960 
ggacgtgggc gacgcctact tcagcgtgcc cchggacaag gacttccgca agtacaccgc 4020 
cttcaccatc cccagcatca acaacgagac ccccggcatc cgcbaccagt acaacgtgct 4080 
gccccagggc tggaagggca gccccgccat cthccagtgc agcatgacca agatcctgga 414 0 
gcccttccgc aagcagaacc ccgacatcgt gatctaccag tacatggacg acctgtacgt 4200 
gggcagcgac ctggagatcg gccagcaccg caccaagatc gaggagctgc gccagcacct 4260 
gctgcgctgg ggcttcacca cccccgacaa gaagcaccag aaggagcccc ccttccbgtg 4320 
gabgggcbac gagctgcacc ccgacaagtg gaccgtgcag cccatcgtgc fcgcccgagaa 4380 
ggacagctgg accgtgaacg acatccagaa gctggtgggc aagctgaact gggccagcca 4440 
gatctacgcc ggcatcaagg tgcgccagct gtgcaagctg ctgcgcggca ccaaggccct 4500 
gaccgaggtg gtgcccctga ccgaggaggc cgagctggag ctggccgaga accgcgagat 4560 
cctgaaggag cccgtgcacg gcgbgbacba cgaccccagc aaggacctga tcgccgagat 4620 
ccagaagcag ggccagggcc agtggaccta ccagatctac caggagccct tcaagaacct 4680 
gaagaccggc aagtacgccc gcatgaaggg cgcccacacc aacgacgtga agcagctgac 4740 
cgaggccgtg cagaagatcg ccaccgagag catcgtgatc tggggcaaga cccccaagtt 4800 
caagctgccc atccagaagg agacctggga ggccbggbgg accgagtact ggcaggccac 4860 
ctggatcccc gagtgggagt tcgtgaacac ccccccccbg gtgaagcbgb ggtaccagct 492 0 
ggagaaggag cccatcatcg gcgccgagac cttctacgtg gacggcgccg ccaaccgcga 4980 
gaccaagctg ggcaaggccg gctacgtgac cgaccgcggc cgccagaagg tggtgcccct 5040 
gaccgacacc accaaccaga agaccgagct gcaggccabc cacctggccc tgcaggacag 5100 
cggcctggag gtgaacatcg tgaccgacag ccagtacgcc ctgggcatca tccaggccca 5160 
gcccgacaag agcgagagcg agchggtgag ccagatcabc gagcagctga bcaagaagga 52 2 0 
gaaggbgtac cbggcctggg tgcccgccca caagggcabc ggcggcaacg agcaggbgga 52 80 
cggccbggbg agcgccggca bccgcaaggt gctgttcctg gacggcatcg acaaggccca 5340 
ggaggagcac gagaagbacc acagcaactg gcgcgccatg gccagcgacfc bcaaccbgcc 5400 
ccccgbggtg gccaaggaga bcgbggccag cbgcgacaag bgccagcbga agggcgaggc 54 60 
catgcacggc caggbggacb gcagccccgg cabcbggcag cbggactgca cccaccbgga 552 0 
gggcaaggbg atcctggtgg ccgtgcacgb ggccagcggc bacatcgagg ccgaggbgat 5580 
ccccgccgag accggccagg agaccgccba cbtccbgctg aagcbggccg gccgcbggcc 5640 
cgbgaagacc gfcgcacaccg acaacggcag caacttcacc agcaccaccg bgaaggccgc 5700 
cbgctggbgg gccggcatca agcaggagtb cggcabcccc tacaaccccc agagccaggg 5760 
cgbgatcgag agcabgaaca aggagcbgaa gaagabcatc ggccaggtgc gcgaccaggc 582 0 
cgagcacctg aagaccgccg bgcagabggc cgbgbbcabc cacaactbca agcgcaaggg 5880 
cggcabcggc ggctacagcg ccggcgagcg cabcgtggac atcabcgcca ccgacabcca 594 0 
gaccaaggag cbgcagaagc agatcaccaa gabccagaac tbccgcgbgt acbaccgcga 6000 
cagccgcgac cccgtgtgga agggccccgc caagctgctg bggaagggcg agggcgccgb 6060 
ggbgabccag gacaacagcg acabcaaggt ggtgccccgc cgcaaggcca agatcabccg 6120 
cgactacggc aagcagatgg ccggcgacga ctgcgtggcc agccgccagg acgaggacba 6180 
- ggaatbctgc bgtgccbtcb agbtgccagc cabctgtbgt tbgccccbcc cccgtgcctb 6240 
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ccbbgaccct ggaaggtgcc actcccactg bccbbbccba ataaaabgag gaaattgcat 6300 
cgcattgtct gagbaggbgb cattctattc tggggggtgg ggtggggcag gacagcaagg 6360 
gggaggattg ggaagacaab agcaggcatg ctggggatgc ggtgggctct abgggbaccc 6420 
aggtgctgaa gaattgaccc ggtbccbccb gggccagaaa gaagcaggca catccccttc 6480 
tctgtgacac acccbgbcca cgcccctggb tcttagttcc agccccactc abaggacacb 6540 
cabagctcag gagggctccg ccttcaatcc cacccgcbaa agtacttgga gcggtctctc 6600 
cctccctcat cagcccacca aaccaaaccb agcctccaag agbgggaaga aattaaagca 6660 
agabaggcba ttaagtgcag agggagagaa aatgcctcca acatgbgagg aagtaatgag 6720 
agaaatcata gaattttaag gccatcatgg ccttaatctt ccgcttcctc gcbcacbgac 6780 
bcgcbgcgcb cggtcgttcg gcbgcggcga gcggbabcag ctcactcaaa ggcggbaaba 6840 
cggbbabcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 6900 
aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccabaggcb ccgcccccct 6960 
gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 7020 
agataccagg cgtttccccc tggaagctcc ctcgtgcgct cbccbgbbcc gaccctgccg 7080 
cttaccggat acctghccgc ctttctccct tcgggaagcg tggcgctttc tcabagcbca 7140 
cgctgtaggt abcbcagbbc ggbgbaggbc gbbcgcbcca agctgggctg tgtgcacgaa 7200 
ccccccgttc agcccgaccg ctgcgccbba bccggbaacb abcgbcbbga gtccaacccg 7260 
gbaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 7320 
babgbaggcg gbgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 7380 
acagtatttg gbabctgcge tctgctgaag ccagttacct tcggaaaaag agttggtagc 7440 
tcttgatccg gcaaacaaac caccgcbggt agcggtggtt tttttgtttg caagcagcag 7500 
attacgcgca gaaaaaaagg atctcaagaa gatcchttga hcttttctac ggggtctgac 7560 
gctcagtgga acgaaaactc acgbtaaggg attttggtca tgagattatc aaaaaggatc 7620 
btcacctaga bccbbtbaaa btaaaaatga agttttaaab caatcbaaag batababgag 7680 
taaacbbggb cbgacagbba ccaabgcbba abcagtgagg cacctabcbc agcgatcbgb 7740 
cbabbbcgbb catccatagb tgcctgacbc gggggggggg ggcgcbgagg bcbgccbcgb 7800 
gaagaaggbg btgcbgacbc abaccaggcc tgaatcgccc catcabccag ccagaaagbg 7860 
agggagccac ggbbgabgag agctbbgbbg taggbggacc agtbggbgat btbgaacbbb 7920 
tgcbbbgcca cggaacggtc bgcgbbgbcg ggaagabgcg bgabcbgabc cbbcaacbca 7980 
gcaaaagbbc gabtbatbca acaaagccgc cgbcccgbca agbcagcgba abgcbcbgcc 8040 
agbgbbacaa ccaabbaacc aabbcbgabb agaaaaacbc abcgagcabc aaafcgaaact 8100 
gcaatbtabb cababcagga bbatcaabac cababbbbbg aaaaagccgb bbcbgbaabg 8160 
aaggagaaaa ctcaccgagg cagbtccaba ggabggcaag abcctggtab cggbcbgcga 8220 
bbccgacbcg bccaacabca atacaaccba btaabtbccc cbcgtcaaaa abaaggbbab 8280 
caagtgagaa abcaccabga gbgacgacbg aatccggtga gaabggcaaa agcbbatgca 8340 
bbbctbbcca gacbbgbbca acaggccagc cabbacgcbc gbcabcaaaa bcactcgcab 8400 
caaccaaacc gbbabbcabb cgbgabtgcg cctgagcgag acgaaabacg cgabcgcbgt 8460 
baaaaggaca abbacaaaca ggaatcgaab gcaaccggcg caggaacact gccagcgcab 8520 
caacaababt bbcaccbgaa bcaggatatt cbbctaabac cbggaabgct gbbbbcccgg 8580 
ggabcgcagb ggbgagbaac cabgcabcat caggagbacg gabaaaabgc btgabggbcg 864 0 
gaagaggcab aaattccgbc agccagbbba gbcbgaccab cbcabcbgba acabcatbgg 8700 
caacgctacc bbbgccabgb bbcagaaaca acbcbggcgc abcgggcbbc ccabacaabc 8760 
gabagatbgb cgcaccbgab bgcccgacab babcgcgagc ccabbtabac ccababaaab 8820 
cagcabccab gbbggaabbb aabcgcggcc bcgagcaaga cgbbbcccgb tgaababggc 8880 
bcabaacacc ccbbgbabba ctgtbbabgb aagcagacag bbbbabbgbb cabgabgaba 8940 
batbbbbabc tbgbgcaabg baacabcaga gabbbbgaga cacaacgbgg cbbtcccccc 9000 
ccccccabba tbgaagcabb babcagggbb atbgbctcat gagcggabac atabbbgaab 9060 
gtabbbagaa aaataaacaa abaggggbbc cgcgcacabb bccccgaaaa gbgccaccbg 9120 
acgtcbaaga aaccabbabb abcabgacab baacctabaa aaabaggcgb abcacgaggc 9180 
ccbbtcgbc 9189 



<210> 160 
<211> 9285 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing SIV genes 
<400> 160 

tcgcgcgttt cggtgatgac ggbgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg bcagcgggtg 120 
ttggcgggtg tcggggctgg ctbaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gbgcgaaata ccgcacagat gcgfcaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgbatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcabba gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 4 80 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt bacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat bagtcatcgc tattaccabg gtgatgcggt tttggcagta 72 0 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccabtga 780 
cgbcaabggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 300 
agctcghtta gtgaaccgtc agatcgcctg gagacgccafc ccacgctgtt ttgacctcca 960 
bagaagacac cgggaccgab ccagcctccg cggccgggaa cggtgcabtg gaacgcggat 1020 
tccccgtgcc aagagtgacg baagtaccgc cbatagactc tabaggcaca cccctbtggc 1080 
tctbatgcat gcbatactgb ttbtggcttg gggcctatac acccccgctt ccttatgcba 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattatbg accacbcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 
tattggctab atgccaabac tctgbccttc agagactgac acggactctg tabbtbbaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagbtttt atbaaacata gcgbgggatc tccacgcgaa tctcgggbac gtgtbccgga 1440 
catgggctct tctccggtag cggcggagcb tccacabccg agccctggtc ccabgcctcc 1500 
agcggctcat ggbcgctcgg cagctccbtg ctccbaacag bggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg ghabgbgbcb 162 0 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag abgcaggcag ctgagttgbt gbabbctgab aagagtcaga ggbaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagbcacc gtcgtcgacg ccgccaccat gggcgbgcgc aacagcgtgc tgagcggcaa 1920 
gaaggccgac gagctggaga agatccgcct gcgcccgaac ggcaagaaga agtacatgct 1980 
gaagcacgtg gtgtgggccg ccaacgagct ggaccgcttc ggcctggccg agagcctgct 2040 
ggagaacaag gagggctgcc agaagabcct gagcgtgctg gcgccgcbgg tgcccaccgg 2100 
cagcgagaac ctgaagagcc tgtacaacac cgtgtgcgtg atctggtgca bccacgccga 2160 
ggagaaggtg aagcacaccg aggaggccaa gcagatcgtg cagcgccacc tggtggtgga 2220 
gaccggcacc accgagacca bgcccaagac cagccgcccg accgcgccca gcagcggacg 2280 
cggcggcaac tacccggtgc agcagabcgg cggcaactac gtgcacctgc cgcbgagccc 2340 
gcgcaccctg aacgcctggg tgaagctgat cgaggagaag aagttcggcg ccgaggtggt 2400 
gcccggcttc caggccctga gcgagggctg cacgccctac gacabcaacc agabgctgaa 2460 
ctgcgtgggc gaccaccagg ccgccatgca gabcabccgc gacabcabca acgaggaggc 2520 
cgccgactgg gacctgcagc acccgcagcc cgcgccgcag cagggccagc tgcgcgagcc 2580 
cagcggcagc gacatcgccg gcaccaccag cagcgtggac gagcagatcc agbggatgta 264 0 
ccgccagcag aacccgabcc cggbgggcaa catctaccgc cgcbggatac agcbgggccb 2700 
gcagaagtgc gtgcgcatgt acaacccgac caacatcctg gacgtgaagc agggcccgaa 2760 
ggagcccttc cagagcbacg tggaccgctt cfcacaagagc ctgcgcgccg agcagaccga 2820 
cgccgccgtg aagaactgga bgacccagac cctgctgatc cagaacgcca acccggactg 2 880 
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caagctggtg ctgaagggcc tgggcgtgaa cccgaccctg gaggagatgc bgaccgcctg 2 94 0 
ccagggcgtg ggcggcccgg gccagaaggc ccgcctgatg gccgaggccc tgaaggaggc 3000 
cctggcgccc gtgcccatcc cgttcgcggc cgcccagcag cgcggcccgc gcaagcccat 3060 
caagtgctgg aactgcggca aggagggcca cagcgcccgc cagtgccgcg cgccgcgccg 3120 
ccagggctgc tggaagtgcg gcaagatgga ccacgtgahg gccaagtgcc cggaccgcca 3180 
ggcgggtttt ttaggccttg gtccatgggg aaagaagccg cgcaatttcc cgatggctca 3240 
agtgcatcag gggcfcgabgc caactgctcc gccagaggac ccagctgtgg atctgctaaa 3300 
gaactacatg cagttgggca agcagcagag agaaaagcag agagaaagca gagagaagcc 3360 
ttacaaggag gtgacagagg atttgctgca cctcaattcfc ctctttggag gagaccagta 3420 
gtcactgcat cgatcgaggg ccagcccgtg gaggtgcttc tagacaccgg cgccgacgac 3480 
agcatcgtga ccggcatcga gctgggcccg cactacaccc cgaagatcgt gggcggcatc 3540 
ggcggcttca tcaacaccgg agagtacaag aacgtggaga tcgaggtgct gggcaagcgc 3600 
atcaagggca ccaccatgac cggcgacacg cccatcaaca tcttcggccg caacctgctg 3660 
accgcccfcgg gcahgagcct gaacttcccc atcgccaagg tggagcccgt gaaggtggcc 3720 
ctgaagcccg gcaaggacgg ccccaagctg aagcagtggc ccctgagcaa ggagaagatc 37 80 
gtggccctgc gcgagatctg cgagaagatg gagaaggacg gccagctgga ggaggcccca 384 0 
cccaccaacc cctacaacac gcccaccttc gccatcaaga agaaggacaa gaacaagtgg 3 900 
cgcatgctga tcgacttccg cgagctgaac cgcgtgaccc aggacttcac cgaggtgcag 3960 
ctgggcatcc cgcaccccgc cggcctggcc aagcgcaagc gcatcaccgt gctggacatc 4020 
ggcgacgcct acttcagcat ccccctggac gaggagttcc gccagtacac cgccttcacc 4080 
ctgcccagcg tgaacaacgc cgagcccggc aagcgctaca tctacaaggt gctgccccag 4140 
ggctggaagg gcagccccgc catcttccag tacaccatgc gccacgtgct ggagcccfctc 4200 
cgcaaggcca accccgacgt gaccctggtg cagtacatgc atgacatcct gatcgccagc 4260 
gaccgcaccg acctggagca cgaccgcgtg gtgctgcaga gcaaggagct gctgaacagc 4 320 
atcggcttca gcacgcccga ggagaagttc cagaaggacc cacccctcca gtggatgggc 4 3 80 
tacgagctgt ggcccaccaa gtggaagctg cagaagatcg agctgcccca gcgcgagacc 4440 
tggaccgtga acgacatcca gaagctggtg ggcgtgctga actgggccgc ccagatctac 4500 
cccggcatca agaccaagca cctgtgccgc ctgatccgcg gcaagatgac cctgaccgag 4560 
gaggtgcagt ggaccgagat ggccgaggcc gagtacgagg agaacaagat catcctgagc 4620 
caggagcagg agggctgcta ctaccaggag ggcaagcccc tggaggccac cgtgatcaag 4680 
agccaggaca accagtggag ctacaagatc caccaggagg acaagafccct gaaggtgggc 4740 
aagttcgcca agatcaagaa cacccacacc aacggcgtgc gcctgctggc ccacgtgatc 4800 
cagaagatcg gcaaggaggc catcgtgatc tggggccagg tgcccaagtt ccacctgccc 4860 
gtggagaagg acgtgtggga gcagtggtgg accgactact ggcaggtgac ctggataccc 4 920 
gagtgggact tcatcagcac ccctcccctg gcgcgcctgg tgtfccaacct ggtgaaggac 4 980 
cccatcgagg gcgaggagac ctactacacc gacggcagct gcaacaagca gagcaaggag 5040 
ggcaaggccg gctacatcac cgaccgcggc aaggacaagg tgaaggtgct ggagcagacq 5100 
accaaccagc aggccgagct cgaggccttc cfcgatggccc tgaccgacag cggccccaag 5160 
gccaacatca tcgtggacag ccagtacgtg atgggcatca tcaccggctg ccccaccgag 52 2 0 
agcgagagcc gcctggcgaa ccagatcatc gaggagatga tcaagaagag cgagatctac 5280 
gtggcctggg tgcccgccca caagggcatc ggcggcaacc aggagatcga ccacctggtg 5340 
agccagggca tccgccaggt gctgttcctg gagaagatcg agcccgccca ggaggagcac 5400 
gacaagtacc acagcaacgt gaaggagctg gtgttcaagt tcggcctgcc ccgcatcgtg 5460 
gcccgccaga tcgtggacac ctgcgacaag tgccaccaga agggcgaggc catccacggc 5520 
caggccaaca gcgacctggg cacchggcag atggcctgca cccacctgga gggcaagatc 5580 
atcatcgtgg ccgtgcacgt ggccagcggc ttcafccgagg ccgaggtgat cccgcaggag 564 0 
accggccgcc agaccgccct gctcctgctg aagctggccg gccgctggcc catcacccac 57 00 
ctgcacaccg acaacggcgc caacttcgcc agccaggagg tgaagatggt ggcctggtgg 5760 
gccggcatcg agcacacctt cggcgtgccc tacaacccgc agagccaggg cgtggtggag 5820 
gccatgaacc accacctgaa gaaccagatc gaccgcatcc gcgagcaggc caacagcgtg 5880 
gagaccatcg tgctgatggc cgtgcactgc atgaacttca agcgccgcgg cggcatcggc 5940 
gacatgacgc ccgccgagcg cctgatcaac atgatcacca ccgagcagga gafcccagttc 6000 
cagcagagca agaacagcaa gttcaagaac ttccgcgtgt actaccgcga gggccgcgac 6060 
cagctgtgga agggccccgg cgagctgctg tggaagggcg agggcgccgc gatcctgaag 6120 
gtgggcaccg acatcaaggt ggtgccccgc cgcaaggcca agatcatcaa ggactacggc 6180 
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ggcggcaagg aggtggacag cagcagccac abggaggaca ccggcgaggc ccgcgaggtg 624 0 

gcctgagaat tcgtttaaac acgcgtgtat acggatccag atctgctgtg ccttctagtt 6300 

gccagccatc tgttgtttgc cccbcccccg tgccttcctt gaccctggaa ggbgccacbc 6360 

ccacbgbcct bbccbaabaa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt 6420 

cbabbctggg gggbggggbg gggcaggaca gcaaggggga ggabtgggaa gacaatagca 64 80 

ggcatgctgg ggabgcggbg ggctcbabgg gtacccaggt gcbgaagaab tgacccggtt 6540 

cctcctgggc cagaaagaag caggcacatc cccbbctctg tgacacaccc tgtccacgcc 6600 

ccbggt'bcbt agttccagcc ccacbcabag gacactcata gcbcaggagg gctccgcctt 6660 

caatcccacc cgctaaagta cttggagcgg tctctccctc ccbcabcagc ccaccaaacc 6720 

aaaccbagcc tccaagagtg ggaagaaatt aaagcaagat aggctatbaa gbgcagaggg 6780 

agagaaaabg ccbccaacab gtgaggaagb aabgagagaa abcabagaab bbbaaggcca 684 0 

bcatggccbb aabcbtccgc tbccbcgctc acbgacbcgc bgcgcbcggt cgbbcggcbg 6900 

cggcgagcgg tabcagctca ctcaaaggcg gtaabacggt batccacaga abcaggggab 6960 

aacgcaggaa agaacabgbg agcaaaaggc cagcaaaagg ccaggaaccg baaaaaggcc 7020 

gcgbbgcbgg cgbbbbbcca baggcbccgc ccccctgacg agcabcacaa aaabcgacgc 7080 

bcaagtcaga ggtggcgaaa cccgacagga ctabaaagab accaggcgbt bcccccbgga 7140 

agcbcccbcg bgcgcbctcc tgbtccgacc ctgccgctta ccggabacct gbccgccbbb 7200 

ctccctbcgg gaagcgbggc gcbbbctcab agcbcacgcb gtaggtabct cagtbcggtg 7260 

taggtcgbbc gcbccaagcb gggctgbgtg cacgaacccc ccgbbcagcc cgaccgcbgc 732 0 

gccbbabccg gbaacbatcg bcbbgagbcc aacccggbaa gacacgacbb abcgccacbg 7380 

gcagcagcca cbggtaacag gabbagcaga gcgaggtabg baggcggbgc bacagagbbc 744 0 

ttgaagbggb ggccbaacba cggcbacacb agaagaacag batbtggbab cbgcgcbcbg 7500 

cbgaagccag bbaccbbcgg aaaaagagbb ggtagcbctb gatccggcaa acaaaccacc 7560 

gcbggtagcg gtggbbbtbb tgbttgcaag cagcagabta cgcgcagaaa aaaaggabcb 762 0 

caagaagabc cbbtgabcbb btcbacgggg tcbgacgctc agbggaacga aaacbcacgb 7680 

baagggabbb bggtcabgag abbatcaaaa aggatcbbca ccbagabcct bbbaaabbaa. 7740 

aaabgaagbt bbaaabcaab cbaaagbaba tabgagbaaa cbbggbcbga cagtbaccaa 7800 

bgcbbaabca gbgaggcacc tabcbcagcg abcbgbctab btcgbbcabc cabagtbgcc 7860 

tgactcgggg ggggggggcg ctgaggbcbg ccbcgbgaag aaggbgbtgc bgacbcabac 7920 

caggcctgaa bcgccccabc abccagccag aaagbgaggg agccacggbb gabgagagcb 7980 

bbgbbgbagg bggaccagbb ggbgatbbbg aactbbbgcb bbgccacgga acggtctgcg 8040 

bbgbcgggaa gabgcgbgab cbgabccbbc aacbcagcaa aagbbcgabb tabbcaacaa 8100 

agccgccgbc ccgbcaagbc agcgbaabgc bcbgccagtg bbacaaccaa tbaaccaabb 8160 

cbgabtagaa aaacbcatcg agcabcaaab gaaactgcaa bbtabbcaba tcaggabbab 822 0 

caabaccaba bbbbbgaaaa agccgbbbct gbaabgaagg agaaaactca ccgaggcagb 8280 

bccabaggab ggcaagabcc bggbatcggb cbgcgabbcc gacbcgbcca acatcaabac 8340 

aaccbattaa bbbccccbcg bcaaaaabaa ggbbabcaag bgagaaabca ccatgagbga 8400 

cgacbgaabc cggbgagaab ggcaaaagcb babgcabbbc bbbccagacb bgbtcaacag 8460 

gccagccabb acgcbcgbca bcaaaabcac bcgcabcaac caaaccgbba btcabbcgbg 8520 

abbgcgccbg agcgagacga aatacgcgab cgcbgbbaaa aggacaabba caaacaggaa 8580 

bcgaatgcaa ccggcgcagg aacactgcca gcgcabcaac aababbbtca ccbgaabcag 8640 

gatabtctbc baabaccbgg aabgcbgbbb bcccggggab cgcagbggbg agbaaccabg 8700 

cabcatcagg agbacggaba aaabgcbbga bggbcggaag aggcabaaab bccgbcagcc 8760 

agbttagtcb gaccabcbca bcbgbaacab catbggcaac gcbaccbbbg ccatgbbbca 8820 

gaaacaacbc bggcgcabcg ggctbcccab acaabcgaba gabtgbcgca cctgabbgcc 8880 

cgacatbabc gcgagcccat btabacccat ataaabcagc abccabgbbg gaabbtaatc 8940 

gcggccbcga gcaagacgbb bcccgbbgaa babggctcab aacaccccbt gtabbacbgb 9000 

btatgbaagc agacagbbbb abbgbbcabg atgabatabb bbbabcbbgb gcaabgbaac 9060 

atcagagabb bbgagacaca acgbggcbbb cccccccccc ccabbabbga agcabbbatc 9120 

agggttabbg bcbcabgagc ggabacabab tbgaabgbab bbagaaaaab aaacaaabag 9180 

gggbtccgcg cacabbtccc cgaaaagbgc caccbgacgb cbaagaaacc abbabbabca 9240 

tgacatbaac ctataaaaab aggcgtabca cgaggcccbt bcgbc 9285 
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<210> 161 
<211> 7897 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 161 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 84 0 

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 

tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 

taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 

tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatctc 1260 

tattggctat atgccaatac tctgtccttc. agagactgac acggactctg tatttttaca 1320 

99atggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 

catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 

agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 

agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 

gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 

gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 

gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 

cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 

tgcagtcacc gtcgtcgaca tgagggaaga tctggccttc ccacaaggga aggccaggga 1920 

attttcttca gagcagacca gagccaacag ccccaccaga agagagcttc aggtttgggg 1980 

aagagacaac aactccctct cagaagcagg agccgataga caaggaactg tatcctttag 2040 

cttccctcag atcactcttt ggcagcgacc cctcgtcaca ataaagatag ggggccagct 2100 

gaaggaggcc cttctagaca ccggcgccga cgacaccgtg ctggaggaga tgaacctgcc 2160 

cggccgctgg aagcccaaga tgatcggcgg catcggcggc ttcatcaagg tgggccagta 2220 

cgaccagatc ctgatcgaga tctgcggcca caaggccatc ggcaccgtgc tggtgggccc 2280 

cacccccgtg aacatcatcg gccgcaacct gctgacccag atcggctgca ccctgaactt 2340 

ccccatcagc cccatcgaga ccgtgcccgt gaagctgaag cccggcatgg acggccccaa 24 00 

ggtgaagcag tggcccctga ccgaggagaa gatcaaggcc ctggtggaga tctgcaccga 24 60 

gatggagaag gagggcaaga tcagcaagat cggccccgag aacccctaca acacccccgt 2520 

gttcgccatc aagaagaagg acagcaccaa gtggcgcaag ctggtggact tccgcgagct 2580 

gaacaagcgc acccaggact tctgggaggt gcagctgggc atcccccacc ccgccggcct 264 0 

gaagcagaag aagagcgtga ccgtgctgga cgfcgggcgac gcctacttca gcgtgcccct 2700 

ggacaaggac ttccgcaagt acaccgcctt caccatcccc agcatcaaca acgagacccc 2760 
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cggcatccgc taccagtaca acgtgctgcc ccagggchgg aagggcagcc ccgccabcbb 2 820 
ccagtgcagc abgaccaaga tcctggagcc cttccgcaag cagaaccccg acatcgtgat 2880 
ctaccagtac atggacgacc tgtacgtggg cagcgaccbg gagatcggcc agcaccgcac 2 94 0 
caagatcgag gagctgcgcc agcacctgct gcgctggggc ttcaccaccc ccgacaagaa 3000 
gcaccagaag gagcccccct tcctgtggat gggctacgag ctgcaccccg acaagtggac 3060 
cgtgcagccc abcgtgctgc ccgagaagga cagctggacc gtgaacgaca tccagaagct 3120 
99tg99 ca ag ctgaactggg ccagccagat ctacgccggc ahcaaggtgc gccagctgtg 3180 
caagctgcbg cgcggcacca aggccctgac cgaggtggtg cccctgaccg aggaggccga 324 0 
gchggagctg gccgagaacc gcgagatcct gaaggagccc gtgcacggcg bgbacbacga 3300 
ccccagcaag gacctgatcg ccgagatcca gaagcagggc cagggccagt ggacctacca 3360 
gatctaccag gagcccttca agaacctgaa gaccggcaag tacgcccgca tgaagggcgc 3420 
ccacaccaac gacgtgaagc agcbgaccga ggccgtgcag aagatcgcca ccgagagcat 34 80 
cgtgatctgg ggcaagaccc ccaagttcaa gctgcccatc cagaaggaga cctgggaggc 3540 
ctggtggacc gagbacbggc aggccacctg gatccccgag tgggagttcg tgaacacccc 3600 
cccccbggtg aagctgtggt accagctgga gaaggagccc atcatcggcg ccgagacctt 3660 
ctacgbggac ggcgccgcca accgcgagac caagctgggc aaggccggct acgtgaccga 3720 
ccgcggccgc cagaaggtgg tgcccctgac cgacaccacc aaccagaaga ccgagctgca 3780 
ggccatccac ctggccctgc aggacagcgg cctggaggtg aacatcgtga ccgacagcca 3840 
gtacgccctg ggcatcatcc aggcccagcc cgacaagagc gagagcgagc tggtgagcca 3900 
gatcatcgag cagctgatca agaaggagaa ggtgtacctg gcctgggtgc ccgcccacaa 3960 
gggcatcggc ggcaacgagc aggtggacgg cctggtgagc gccggcatcc gcaaggtgct 4020 
gttcctggac ggcatcgaca aggcccagga ggagcacgag aagfcaccaca gcaactggcg 4 0 80 
cgccafcggcc agcgacttca accbgccccc cgtggtggcc aaggagatcg tggccagctg 414 0 
cgacaagtgc cagctgaagg gcgaggccat gcacggccag gtggactgca gccccggcat 4200 
ctggcagctg gactgcaccc accbggaggg caaggtgatc ctggtggccg tgcacgbggc 4260 
cagcggctac atcgaggccg aggtgatccc cgccgagacc ggccaggaga ccgcctactt 4320 
ccbgctgaag ctggccggcc gctggcccgb gaagaccgtg cacaccgaca acggcagcaa 4380 
cbbcaccagc accaccghga aggccgcctg ctggtgggcc ggcatcaagc aggagttcgg 4440 
catcccctac aacccccaga gccagggcgt gatcgagagc atgaacaagg agctgaagaa 4500 
gatcatcggc caggtgcgcg accaggccga gcacctgaag accgccgtgc agatggccgt 4560 
gttcatccac aacttcaagc gcaagggcgg catcggcggc tacagcgccg gcgagcgcat 4620 
cgtggacatc atcgccaccg acacccagac caaggagctg cagaagcaga tcaccaagat 4680 
ccagaacttc cgcgtgbacb accgcgacag ccgcgacccc gtgtggaagg gccccgccaa 474 0 
gctgctgtgg aagggcgagg gcgccgtggt gatccaggac aacagcgaca tcaaggtggt 4800 
gccccgccgc aaggccaaga tcatccgcga ctacggcaag cagatggccg gcgacgactg 4860 
cgfcggccagc cgccaggacg aggactagga atttctgctg tgccttctag ttgccagcca 492 0 
tctgttgttt gcccctcccc cgbgccbbcc bbgaccctgg aaggtgccac tcccactgtc 4980 
ctbtccbaab • aaaabgagga aattgcatcg catbgbctga gtaggtgtca ttctabbctg 5040 
99999^9999 tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct 5100 
9999atgcgg tgggctctat gggbacccag gbgctgaaga attgacccgg ttccbccbgg 5160 
gccagaaaga agcaggcaca tccccttctc tgtgacacac cctgtccacg cccctggttc 5220 
ttagttccag ccccacbcat aggacactca tagctcagga gggctccgcc ttcaatccca 5280 
cccgctaaag tacttggagc ggtctctccc bccctcatca gcccaccaaa ccaaacctag 5340 
cctccaagag tgggaagaaa ttaaagcaag ataggctatt aagtgcagag ggagagaaaa 5400 
tgcctccaac atgtgaggaa gtaatgagag aaatcabaga attttaaggc catcatggcc 5460 
ttaatcbbcc gcbbcctcgc bcacbgactc gctgcgctcg gtcgtbcggc bgcggcgagc 5520 
ggbatcagcb cacbcaaagg cggtaabacg gbtabccaca gaabcagggg ataacgcagg 5580 
aaagaacatg bgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgbbgct 564 0 
ggcgbbttbc cabaggcbcc gcccccctga cgagcabcac aaaaatcgac gctcaagtca 5700 
gaggtggcga aacccgacag gactabaaag ataccaggcg bttccccctg gaagctcccb 5760 
cgbgcgcbcb cctgttccga cccbgccgcb baccggatac ctgtccgccb tbcbcccbtc 5820 
gggaagcgbg gcgcttbcbc abagcbcacg cbgtaggbab ctcagbtcgg tgbaggtcgt 5880 
tcgcbccaag cbgggcbgbg tgcacgaacc ccccgttcag cccgaccgct gcgccbbabc 5940 
cggbaacbat cgbcbtgagt ccaacccggb aagacacgac bbabcgccac fcggcagcagc 6000 
cactggtaac aggattagca gagcgaggta bgtaggcggt gcbacagagt tcttgaagtg 6060 
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gtggcctaac tacggctaca ctagaagaac agtatttggt abcbgcgcbc bgcbgaagcc 6120 
agttaccttc ggaaaaagag bbggbagcbc ttgatccggc aaacaaacca ccgctggtag 6180 
c 9g t 99 tttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggab cbcaagaaga 6240 
bccbbbgabc tttbctacgg ggfcctgacgc tcagtggaac gaaaactcac gttaagggat 6300 
tbbggbcabg agattatcaa aaaggatcbb cacctagatc ctbbbaaabt aaaaatgaag 6360 
ttttaaatca abcbaaagta tatatgagta aacttggtct gacagttacc aabgcbtaab 6420 
cagtgaggca cctatctcag cgatcbgbct abtbcgbbca bccatagtbg cctgactcgg 6480 
9999999999 cgctgaggtc tgcctcgtga agaaggtgtt gcbgacbcab accaggcctg 6540 
aatcgcccca tcabccagcc agaaagtgag ggagccacgg ttgabgagag ctbbgbbgba 6600 
ggtggaccag bbggbgabbb tgaacttttg cbbbgccacg gaacggtctg cgttgtcggg 6660 
aagatgcgtg atcbgatcct tcaactcagc aaaagttcga bbbabbcaac aaagccgccg 6720 
tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa ttctgattag 6780 
aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt abcaabacca 6840 
tatttbtgaa aaagccgtbb cbgtaatgaa ggagaaaact caccgaggca gtbccabagg 6900 
abggcaagab cctggbatcg gbcbgcgabt ccgactcgbc caacatcaat acaaccbabb 6960 
aabbbccccb cgbcaaaaab aaggtbabca agbgagaaab caccabgagt gacgacbgaa 7020 
bccggbgaga abggcaaaag cbbabgcabb bcbbbccaga cbbgbbcaac aggccagcca 7080 
tbacgcbcgb catcaaaatc acbcgcabca accaaaccgb babbcabbcg tgabtgcgcc 7140 
bgagcgagac gaaabacgcg abcgcbgbta aaaggacaab bacaaacagg aabcgaabgc 7200 
aaccggcgca ggaacacbgc cagcgcabca acaababbbt cacctgaabc aggatatbcb 7260 
tcbaabaccb ggaatgcbgb bbbcccgggg abcgcagtgg bgagbaacca tgcabcabca 7320 
ggagbacgga taaaabgcbb gabggbcgga agaggcabaa atbccgbcag ccagbbbagb 73 80 
cbgaccabcb cabcbgbaac abcabbggca acgcbaccbb bgccabgtbb cagaaacaac 7440 
tcbggcgcab cgggcbbccc atacaabcga bagabbgbcg cacctgatbg cccgacabba 7500 
bcgcgagccc abttabaccc ababaaabca gcatccatgb tggaabtbaa fccgcggccbc 7560 
gagcaagacg tbtcccgbbg aabatggcbc abaacacccc bbgbabbacb gbbbabgbaa 7620 
gcagacagbb bbabbgbtca bgatgababa tbbbbabcbb gtgcaabgba acabcagaga 7680 
bbbbgagaca caacgbggcb bbcccccccc ccccabbabb gaagcatbba bcagggbbab 7740 
bgbcbcabga gcggabacab abbbgaabgb abbbagaaaa abaaacaaab aggggbbccg 7800 
cgcacatbbc cccgaaaagb gccaccbgac gtcbaagaaa ccatbabbab cabgacatba 7860 
acctataaaa abaggcgtab cacgaggccc fcbbcgbc 7 897 



<210> 162 
<211> 7984 
<212> DNA 

<213> Arbificial Seguence 
<220> 

<223> plasmid pVR1012x/s conbaining SXV genes 
<40O> 162 

bcgcgcgbbb cggtgabgac ggbgaaaacc bctgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagatbgba cbgagagbgc 180 
accababgcg gtgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 24 0 
cbattggcca tbgcabacgb bgbatccaba tcabaababg bacabtbaba bbggcbcabg 300 
bccaacabba ccgccabgbb gacabbgabb abbgacbagb babbaabagb aabcaabbac 3 60 
g999tcabba gbbcabagcc cabatabgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabtgacg bcaabaabga cgbabgbtcc 4 80 
cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggbaaac 54 0 
bgcccacbtg gcagbacabc aagbgbabca babgccaagb acgcccccba ttgacgbcaa 600 
bgacggbaaa tggcccgccb ggcatbabgc ccagbacabg accbbabggg acbbbcctac 660 
bbggcagbac abcbacgtab bagbcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
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catcaatggg cgtggatagc ggtttgactc 
cgtcaatggg agbbtgbbbb ggcaccaaaa 
ctccgcccca ttgacgcaaa tgggcggtag 
agctcgbbba gbgaaccgbc agatcgccbg 
tagaagacac cgggaccgat ccagcctccg 
tccccgtgcc aagagtgacg taagbaccgc 
bcbbabgcab gctatactgt ttttggcttg 
taggtgatgg tatagcttag cctataggtg 
tattggtgac gatactbbcc attactaatc 
tattggctat atgccaatac tctgtccttc 
99atggggtc ccatttatta btbacaaatb 
cgcagttttt attaaacata gcgbgggabc 
catgggctct tctccggtag cggcggagct 
agcggctcat ggbcgcbcgg cagctccttg 
agcacaatgc ccaccaccac cagtgtgccg 
gaaaabgagc gtggagattg ggcbcgcacg 
gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc bgbbaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgacg ccgccaccat 
gcaatttccc catggctcaa gtgcatcagg 
cagctgtgga tctgctaaag aactacatgc 
gagaaagcag agagaagccb tacaaggagg 
tctttggagg agaccagtag bcacbgcabc 
agacaccggc gccgacgaca gcatcgtgac 
gaagatcgfcg ggcggcatcg gcggcttcat 
cgaggtgctg ggcaagcgca tcaagggcac 
cttcggccgc aacctgctga ccgccctggg 
ggagcccgtg aaggtggccc tgaagcccgg 
cctgagcaag gagaagatcg tggccctgcg 
ccagctggag gaggccccac ccaccaaccc 
gaaggacaag aacaagtggc gcatgctgat 
ggacttcacc gaggtgcagc tgggcatccc 
catcaccgtg cbggacabcg gcgacgccta 
ccagtacacc gccttcaccc tgcccagcgt 
ctacaaggtg ctgccccagg gctggaaggg 
ccacgtgctg gagcccbbcc gcaaggccaa 
tgacatcctg atcgccagcg accgcaccga 
caaggagctg ctgaacagca tcggcttcag 
acccttccag tggatgggct acgagchgtg 
gctgccccag cgcgagaccb ggaccgtgaa 
ctgggccgcc cagatctacc ccggcatcaa 
caagatgacc ctgaccgagg aggbgcagbg 
gaacaagatc atcctgagcc aggagcagga 
99 a 99 cca cc gtgatcaaga gccaggacaa 
caagatcctg aaggtgggca agttcgccaa 
cctgctggcc cacgtgatcc agaagatcgg 
gcccaagttc cacctgcccg tggagaagga 
gcaggtgacc tggatacccg agtgggactt 
gttcaacctg gtgaaggacc ccatcgaggg 
caacaagcag agcaaggagg gcaaggccgg 
gaaggtgctg gagcagacca ccaaccagca 
gaccgacagc ggccccaagg ccaacatcat 
caccggctgc cccaccgaga gcgagagccg 
caagaagagc gagatctacg bggccbgggb 



acggggabbt ccaagtctcc accccabtga 780 
tcaacgggac tttccaaaat gtcgtaacaa 840 
gcgtgtacgg tgggaggtct atataagcag 900 
gagacgccat ccacgctgtt ttgacctcca 960 
cggccgggaa cggtgcattg gaacgcggat 1020 
cbabagacbc babaggcaca ccccbtbggc 1080 
gggccbatac acccccgcbb cctbabgcba 1140 
tgggbtabbg accabtabbg accacbcccc 1200 
cataacabgg cbcbbbgcca caactabcbc 1260 
agagacbgac acggacbctg babbtbbaca 1320 
cacababaca acaacgccgb cccccgbgcc 1380 
bccacgcgaa tcbcgggbac gbgtbccgga 1440 
bccacatccg agcccbggbc ccabgcctcc 1500 
ctccbaacag tggaggccag acbbaggcac 1560 
cacaaggccg bggcggbagg gtabgtgtcb 1620 
gcbgacgcag atggaagact baaggcagcg 1680 
gbabbcbgab aagagbcaga ggbaactccc 1740 
gbagbcbgag cagbacbcgb bgcbgccgcg 1800 
aacagacbgfc bccbbbccab gggbctbbtc 1860 
gaggcctbgg tccabgggga aagaagcccc 1920 
ggcbgatgcc aacbgcbccc cgagaggacc 1980 
agbbgggcaa gcagcagaga gaaaagcaga 204 0 
bgacagagga bbbgctgcac ctcaatbcbc 2100 
gabcgagggc cagcccgbgg aggbgcbbct 2160 
cggcabcgag cbgggcccgc actacacccc 2220 
caacaccgga gagtacaaga acgbggagat 2280 
cabcabgacc ggcgacacgc ccabcaacab 2340 
cabgagccbg aactbcccca bcgccaaggt 2400 
caaggacggc cccaagcbga agcagtggcc 2460 
cgagabcbgc gagaagabgg agaaggacgg 2520 
cbacaacacg cccaccbbcg ccabcaagaa 2580 
cgacbtccgc gagcbgaacc gcgbgaccca 2640 
gcaccccgcc ggcctggcca agcgcaagcg 2700 
ctbcagcabc ccccbggacg aggagbtccg 2760 
gaacaacgcc gagcccggca agcgcbacab 2820 
cagccccgcc abcbbccagb acaccabgcg 2880 
ccccgacgtg accctggbgc agbacabgca 2940 
ccbggagcac gaccgcgtgg tgcbgcagag 3000 
cacgcccgag gagaagbbcc agaaggaccc 3060 
gcccaccaag bggaagcbgc agaagatcga 3120 
cgacabccag aagcbggbgg gcgbgcbgaa 3180 
gaccaagcac cbgbgccgcc bgabccgcgg 3240 
gaccgagatg gccgaggccg agbacgagga 3300 
gggcbgcbac taccaggagg gcaagccccb 3360 
ccagbggagc bacaagabcc accaggagga 3420 
gabcaagaac acccacacca acggcgtgcg 3480 
caaggaggcc atcgbgabcb ggggccaggb 3540 
cgbgbgggag cagtggbgga ccgacbacbg 3600 
cabcagcacc ccbccccbgg bgcgccbggt 3660 
cgaggagacc bacbacaccg acggcagcbg 3720 
cbacabcacc gaccgcggca aggacaaggt 3780 
ggccgagcbc gaggcctbcc bgabggcccb 3840 
cgtggacagc cagbacgbga bgggcabcab 3900 
cctggtgaac cagatcatcg aggagabgab 3960 
gcccgcccac aagggcabcg gcggcaacca 4 02 0 
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ggagatcgac caccbggtga gccagggcat 
gcccgcccag gaggagcacg acaagbacca 
cggcctgccc cgcatcgtgg cccgccagat 
999cgaggcc atccacggcc aggccaacag 
ccacctggag ggcaagafcca bcatcgtggc 
cgaggbgatc ccgcaggaga ccggccgcca 
ccgctggccc abcacccacc bgcacaccga 
gaagabggbg gcctggtggg ccggcatcga 
gagccagggc. gtggtggagg ccabgaacca 
cgagcaggcc aacagcgtgg agaccatcgt 
gcgccgcggc ggcatcggcg acabgacgcc 
cgagcaggag atccagttcc agcagagcaa 
ctaccgcgag ggccgcgacc agctgtggaa 
gggcgccgtg atcctgaagg tgggcaccga 
gatcatcaag gactacggcg gcggcaagga 
cggcgaggcc cgcgaggtgg cctgagaatt 
tctgctgtgc cttctagttg ccagccatct 
accctggaag gtgccactcc cactgtcctt 
tgtctgagta ggtgtcattc tattctgggg 
gattgggaag acaatagcag gcatgctggg 
ctgaagaatt gacccggttc ctcctgggcc 
gacacaccct gtccacgccc ctggttctta 
ctcaggaggg cbccgccbbc aahcccaccc 
cbcabcagcc caccaaacca aacctagcct 
ggcbatbaag tgcagaggga gagaaaatgc 
tcatagaatt ttaaggccat catggcctta 
gcgctcggtc gttcggctgc ggcgagcggt 
atccacagaa tcaggggata acgcaggaaa 
caggaaccgt aaaaaggccg cgttgctggc 
gcatcacaaa aabcgacgcb caagtcagag 
ccaggcgttt ccccctggaa gctccctcgt 
cggatacctg tccgcctttc tcccttcggg 
baggbabctc agttcggtgt aggbcgbbcg 
cgfctcagccc gaccgctgcg ccttatccgg 
acacgacbba tcgccactgg cagcagccac 
aggcggtgct acagagttct tgaagtggtg 
atttggtatc tgcgctctgc tgaagccagt 
atccggcaaa caaaccaccg ctggtagcgg 
gcgcagaaaa aaaggatctc aagaagatcc 
gtggaacgaa aacbcacgbt aagggatttt 
ctagatcctt ttaaattaaa aatgaagttt 
ttggtctgac agtbaccaab gcttaatcag 
tcgttcatcc atagttgcct gactcggggg 
aggtgttgct gactcatacc aggcctgaat 
gccacggttg atgagagctt bgbtgbaggb 
tgccacggaa cggtctgcgt tgtcgggaag 
agbbcgabtb abtcaacaaa gccgccgbcc 
bacaaccaab baaccaabbc bgabbagaaa 
bbabbcabab caggattatc aataccatat 
gaaaactcac cgaggcagtt ccataggatg 
actcgtccaa catcaataca acctattaat 
gagaaatcac catgagtgac gactgaatcc 
ttccagactt gbtcaacagg ccagccabta 
aaaccgbbab bcabbcgbga bbgcgccbga 
ggacaabbac aaacaggaab cgaabgcaac 



ccgccaggtg cbgtbcctgg agaagatcga 4080* 
cagcaacgbg aaggagcbgg bgtbcaagbb 4140 
cgbggacacc tgcgacaagb gccaccagaa 4200 
cgaccbgggc acctggcaga bggccbgcac 4 2 60 
cgtgcacgtg gccagcggcb bcabcgaggc 4320 
gaccgcccbg bbccbgctga agcbggccgg 4380 
caacggcgcc aactbcgcca gccaggaggb 4440 
gcacaccbbc ggcgbgcccb acaacccgca 4500 
ccacctgaag aaccagabcg accgcatccg 4560" 
gcbgabggcc gbgcacbgca bgaactbcaa 4 62 0 
cgccgagcgc cbgabcaaca bgatcaccac 4680 
gaacagcaag tbcaagaact bccgcgtgta 4740 
gggccccggc gagcbgctgb ggaagggcga 4 800 
catcaaggbg gbgccccgcc gcaaggccaa 4860 
ggbggacagc agcagccaca tggaggacac 4920 
cgbbbaaaca cgcgbgbaba cggatccaga 4980 
gbbgbbbgcc ccbcccccgb gccttccbbg 5040 
bcctaabaaa abgaggaaab bgcatcgcab 5100 
ggtggggbgg ggcaggacag caagggggag 5160 
gabgcggbgg gctcbabggg bacccaggtg 5220 
agaaagaagc aggcacabcc ccbbcbcbgb 5280 
gtbccagccc cactcatagg acactcabag 5340 
gcbaaagbac bbggagcggt cbctcccbcc 5400 
ccaagagbgg gaagaaabta aagcaagaba 54 6 0 
ctccaacabg bgaggaagta atgagagaaa 552 0 
atcbtccgct tcctcgcbca cbgacbcgct 5580 
atcagctcac tcaaaggcgg baabacggbb 5640 
gaacatgbga gcaaaaggcc agcaaaaggc 5700 
gtbbtbccat aggcbccgcc cccctgacga 5760 
gtggcgaaac ccgacaggac bataaagaba 5820 
gcgcbctcct gtbccgaccc tgccgcbtac 5880 
aagcgbggcg cbttcbcata gcbcacgcbg 5940 
ctccaagcbg ggcbgbgbgc acgaaccccc 6000 
baacbabcgt cbtgagtcca acccggbaag 6060 
bggbaacagg abbagcagag cgaggbabgt 612 0 
gccbaacbac ggctacacta gaagaacagb 6180 
bacctbcgga aaaagagbbg gbagcbcbbg 6240 
bggbtbbbbt gbbtgcaagc agcagabtac 6300 
btbgabcbtt bcbacggggb cbgacgcbca 6360 
ggbcabgaga bbatcaaaaa ggabcbbcac 6420 
baaatcaabc baaagbabab abgagbaaac 6480 
bgaggcaccb abctcagcga bcbgbcbabb 6540 
gggggggcgc bgaggbcbgc cbcgbgaaga 6600 
cgccccabca bccagccaga aagbgaggga 6660 
ggaccagbbg gbgabbtbga actbbbgcbb 6720 
abgcgbgabc bgatcctbca actcagcaaa 6780 
cgbcaagbca gcgbaatgcb cbgccagbgb 684 0 
aacbcabcga gcabcaaatg aaacbgcaab 6900 
bbbbgaaaaa gccgbbtcbg baabgaagga 6960 
gcaagabccb ggbabcggbc tgcgabbccg 7020 
bbcccctcgt caaaaataag gbtabcaagb 7080 
ggbgagaabg gcaaaagcbb abgcabbbcb 7140 
cgcbcgbcat caaaabcacb cgcabcaacc 7200 
gcgagacgaa abacgcgabc gcbgbbaaaa 7260 
cggcgcagga acacbgccag cgcatcaaca 7320 
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atattttcac cbgaabcagg atattcttct aatacchgga atgctgttth cccggggatc 7380 

gcagtggtga gbaaccatgc atcatcagga gtacggataa aatgcfctgat ggtcggaaga 7440 

ggcataaabt ccgtcagcca gtttagtctg accatcbcat cbgbaacabc attggcaacg 7500 

ctacctttgc cabgbbbcag aaacaacbc.b ggcgcatcgg gcttcccata caatcgatag 7560 

abbgbcgcac cbgabtgccc gacatbabcg cgagcccatt tatacccata taaatcagca 7620 

tccatgttgg aatttaatcg cggcctcgag caagacgttt cccgttgaat abggcbcaba 7680 

acaccccttg babtacbgbb tabgtaagca gacagbttba ttgttcatga tgabatatbt 7740 

ttatctbgtg caatgtaaca tcagagattt bgagacacaa cgbggctbbc cccccccccc 7800 

cattabbgaa gcattbatca gggbtabbgb cbcabgagcg gatacabatb bgaatgtatb 7860 

bagaaaaata aacaaabagg ggbbccgcgc acabtbcccc gaaaagbgcc accbgacgbc 7920 

taagaaacca bbattabcab gacatbaacc babaaaaaba ggcgtabcac gaggcccbbb 7980 

cgbc 7984 



<210> 163 
<211> 9194 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 163 

bcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
tbggcgggbg bcggggcbgg cbtaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gtgbgaaaba ccgcacagab gcgbaaggag aaaataccgc atcagabtgg 24 0 
ctabbggcca bbgcatacgb bgtabccaba bcabaabatg bacabbtaba tbggcbcabg 3 00 
bccaacabba ccgccabgbb gacabbgabb abbgactagb batbaabagb aabcaatbac 360 
ggggbcabba gbbcabagcc cabababgga gtbccgcgbt acabaacbba cggtaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccattgacg bcaabaabga cgbabgbbcc 480 
cabagtaacg ccaabaggga cbbbccatbg acgbcaabgg gtggagbatb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgtabca tabgccaagb acgcccccba btgacgbcaa 600 
bgacggbaaa tggcccgccb ggcabbabgc ccagbacatg acctbabggg actbbccbac 660 
bbggcagbac abcbacgbat bagbcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg cgbggabagc ggbbbgacbc acggggatbb ccaagbctcc accccabtga 780 
cgbcaabggg agbtbgttbt ggcaccaaaa bcaacgggac tbtccaaaab gtcgbaacaa 84 0 
cbccgcccca btgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb atataagcag 900 
agcbcgtbba gbgaaccgbc agabcgccbg gagacgccat ccacgcbgbb tbgacctcca 960 
bagaagacac cgggaccgat ccagccbccg cggccgggaa cggtgcatbg gaacgcggab 1020 
bccccgtgcc aagagbgacg baagbaccgc cbabagacbc tabaggcaca ccccbbbggc 1080 
bcbbabgcat gctabactgb bbbbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggbgabgg babagctbag cctabaggbg bgggbbabbg accabbabbg accacbcccc 1200 
batbggbgac gatacbbbcc atbacbaabc cataacabgg cbctbbgcca caacbabcbc 1260 
babbggcbab atgccaatac bcbgtccbtc agagacbgac acggacbcbg babbtbtaca 1320 
ggabggggbc ccabbbabba bbbacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
cgcagttbbb abbaaacaba gcgtgggabc bccacgcgaa bcbcgggtac gtgbbccgga 1440 
catgggcbcb bctccggbag cggcggagcb bccacabccg agccctggbc ccabgccbcc 1500 
agcggcbcab ggtcgcbcgg cagctccbbg ctccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgtgccg cacaaggccg bggcggbagg gbatgbgbcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gctgacgcag abggaagacb taaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbabbcbgab aagagbcaga ggbaacbccc 1740 
gbbgcggtgc bgttaacggt ggagggcagb gtagbcbgag cagbacbcgb bgcbgccgcg 1800 
cgcgccacca gacataabag cbgacagacb aacagactgb bccbbtccab gggbcbbbbc 1860 
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tgcagtcacc gtcgtcgaca bgggcgcccg cgccagcgtg ctgagcggcg gcgagctgga 1920 

ccgctgggag aagatccgcc bgcgccccgg cggcaagaag aagtacaagc tgaagcacat 1980 

cgtgtgggcc agccgcgagc tggagcgctt cgccgtgaac cccggcctgc tggagaccag 2040 

cgagggctgc cgccagabcc tgggccagct gcagcccagc ctgcagaccg gcagcgagga 2100 

gctgcgcagc ctgtacaaca ccgtggccac ccbgtactgc gtgcaccagc gcatcgagat 2160 

caaggacacc aaggaggccc tggacaagat cgaggaggag cagaacaaga gcaagaagaa 2220 

ggcccagcag gccgccgccg acaccggcca cagcaaccag gtgagccaga actaccccat 2280 

cgtgcagaac atccagggcc agabggbgca ccaggccatc agcccccgca cccfcgaacgc 2340 

ctgggtgaag gtggtggagg agaaggcctt cagccccgag gtgatcccca tgttcagcgc 2400 

ccbgagcgag ggcgccaccc cccaggacct gaacaccatg ctgaacaccg tgggcggcca 24 $0 

ccaggccgcc atgcagatgc bgaaggagac catcaacgag gaggccgccg agtgggaccg 2 520 

cgtgcacccc gtgcacgccg gccccatcgc ccccggccag atgcgcgagc cccgcggcag 2580 

cgacatcgcc ggcaccacca gcaccctgca ggagcagatc ggctggatga ccaacaaccc 2640 

ccccatcccc gfcgggcgaga tctacaagcg ctggatcatc cbgggccbga acaagabcgb 2700 

gcgcabgbac agccccacca gcabccbgga catccgccag ggccccaagg agcccttccg 2760 

cgactacgtg gaccgcttct acaagaccct gcgcgccgag caggccagcc aggaggtgaa 2820 

gaactggatg accgagaccc tgctggtgca gaacgccaac cccgacbgca agaccatcct 2880 

gaaggccctg ggccccgccg ccaccctgga ggagatgatg accgcctgcc agggcgtggg 2940 

cggccccggc cacaaggccc gcgtgctggc cgaggccatg agccaggtga ccaacagcgc 3000 

caccatcatg atgcagcgcg gcaacttccg caaccagcgc aagatcgtga agtgcttcaa 3 060 

ctgcggcaag gagggccaca ccgcccgcaa ctgccgcgcc ccccgcaaga agggctgctg 3120 

gaagbgcggc aaggagggcc accagatgaa ggactgcacc gagcgacagg cbaabbbbbb 3180 

agggaagatc tggccttccc acaagggaag gccagggaat tttcttcaga gcagaccaga 3240 

gccaacagcc ccaccagaag agagcttcag gtttggggaa gagacaacaa ctccctctca 33 00 

gaagcaggag ccgatagaca aggaactgta bccbbbagcb bcccbcagat cactctttgg 3360 

cagcgacccc tcgtcacaat aaagataggg ggccagctga aggaggccct bcbagacacc 3420 

ggcgccgacg acaccgtgcb ggaggagabg aacctgcccg gccgctggaa gcccaagatg 34 80 

atcggcggca tcggcggcbt cabcaaggbg ggccagbacg accagabcct gatcgagabc 354 0 

bgcggccaca aggccatcgg caccgtgcbg gbgggcccca cccccgtgaa catcatcggc 3600 

cgcaaccbgc bgacccagab cggctgcacc ctgaacbtcc ccatcagccc catcgagacc 3 660 

gtgcccgbga agcbgaagcc cggcatggac ggccccaagg bgaagcagtg gccccbgacc 3 720 

gaggagaaga bcaaggcccb ggbggagabc bgcaccgaga bggagaagga gggcaagabc 3780 

agcaagabcg gccccgagaa ccccbacaac acccccgbgb bcgccabcaa gaagaaggac 384 0 

agcaccaagb ggcgcaagcb ggbggacbbc cgcgagctga acaagcgcac ccaggacbbc 3 900 

bgggaggbgc agcbgggcab cccccacccc gccggccbga agcagaagaa gagcgbgacc 3960 

gtgcbggacg tgggcgacgc ctacbtcagc gbgccccbgg acaaggactb ccgcaagbac 4020 

accgccbbca ccabccccag catcaacaac gagacccccg gcatccgcba ccagbacaac 4080 

gtgcbgcccc agggctggaa gggcagcccc gccatcbbcc agbgcagcab gaccaagabc 4140 

cbggagcccb bccgcaagca gaaccccgac abcgbgabcb accagtacat ggacgaccbg 42 00 

tacgbgggca gcgaccbgga gatcggccag caccgcacca agatcgagga gcbgcgccag 4260 

caccbgctgc gcbggggcbb caccaccccc gacaagaagc accagaagga gccccccbbc 4320 

cbgbggabgg gcbacgagcb gcaccccgac aagbggaccg bgcagcccat cgbgcbgccc 4 380 

gagaaggaca gcbggaccgb gaacgacatc cagaagcbgg bgggcaagcb gaacbgggcc 4440 

agccagabcb acgccggcab caaggtgcgc cagcbgbgca agcbgcbgcg cggcaccaag 4500 

gcccbgaccg aggbggtgcc cctgaccgag gaggccgagc tggagctggc cgagaaccgc 4 560 

gagabccbga aggagcccgb gcacggcgtg bactacgacc ccagcaagga ccbgatcgcc 4 620 

gagabccaga agcagggcca gggccagbgg accbaccaga bcbaccagga gccctbcaag 4680 

aaccbgaaga ccggcaagba cgcccgcatg aagggcgccc acaccaacga cgbgaagcag 4 740 

cbgaccgagg ccgbgcagaa gabcgccacc gagagcabcg bgabcbgggg caagaccccc 4 800 

aagbbcaagc bgcccabcca gaaggagacc bgggaggccb ggbggaccga gbacbggcag 4 860 

gccacctgga bccccgagtg ggagtbcgbg aacacccccc cccbggbgaa gctgtggtac 4 920 

cagcbggaga aggagcccat catcggcgcc gagaccbbct acgtggacgg cgccgccaac 4980 
cgcgagacca agcbgggcaa ggccggcbac gbgaccgacc gcggccgcca gaaggbggbg 504 0 

ccccbgaccg acaccaccaa ccagaagacc gagcbgcagg ccabccacct ggcccbgcag 510 0 

gacagcggcc bggaggbgaa cabcgbgacc gacagccagb acgccctggg catcabccag 5160 
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gcccagcccg acaagagcga gagcgagctg gtgagccaga tcatcgagca gctgatcaag 5220 
aaggagaagg tgtacctggc ctgggtgccc gcccacaagg gcatcggcgg caacgagcag 5280 
gtggacggcc tggtgagcgc cggcabccgc aaggtgctgt tcctggacgg catcgacaag 5340 
gcccaggagg agcacgagaa gtaccacagc aactggcgcg ccatggccag cgacttcaac 5400 
ctgccccccg tgghggccaa ggagabcgbg gccagctgcg acaagbgcca gctgaagggc 5460 
gaggccatgc acggccaggt ggacbgcagc cccggcatct ggcagctgga cbgcacccac 5 520 
ctggagggca aggbgabccb ggtggccgtg cacgtggcca gcggcbacab cgaggccgag 5580 
gtgatccccg ccgagaccgg ccaggagacc gcctacttcc bgcbgaagcb ggccggccgc 5640 
tggcccgtga agaccgtgca caccgacaac ggcagcaact tcaccagcac caccgtgaag 5700 
gccgccbgcb ggtgggccgg catcaagcag gagttcggca tcccctacaa cccccagagc 5760 
cagggcgtga bcgagagcab gaacaaggag ctgaagaaga tcatcggcca ggbgcgcgac 5820 
caggccgagc acctgaagac cgccgtgcag abggccgbgb tcatccacaa cttcaagcgc 5880 
aagggcggca tcggcggcta cagcgccggc gagcgcabcg tggacatcat cgccaccgac 594 0 
atccagacca aggagctgca gaagcagatc accaagatcc agaacttccg cgbgbacbac 6000 
cgcgacagcc gcgaccccgt gtggaagggc cccgccaagc tgcbgtggaa gggcgagggc 6060 
gccgtggtga tccaggacaa cagcgacatc aaggtggtgc cccgccgcaa ggccaagatc 6120 
atccgcgact acggcaagca gatggccggc gacgacbgcg bggccagccg ccaggacgag 6180 
gactaggaat tctgctgtgc cttctagttg ccagccabcb gbtgttbgcc cctcccccgt 6240 
gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa ahgaggaaat 6300 
tgcatcgcat bgbcbgagba ggtgtcattc tattctgggg ggtggggtgg ggcaggacag 6360 
caagggggag gattgggaag acaatagcag gcabgcbggg gatgcggtgg gcbcbatggg 6420 
tacccaggtg cbgaagaabb gacccggbtc cbccbgggcc agaaagaagc aggcacatcc 6480 
ccbbcbcbgt gacacaccct gtccacgccc ctggttctta gttccagccc cactcatagg 6540 
acactcatag ctcaggaggg ctccgccttc aabcccaccc gctaaagtac ttggagcggt 6600 
cbcbcccbcc cbcabcagcc caccaaacca aaccbagccb ccaagagtgg gaagaaatta 6660 
aagcaagata ggctattaag tgcagaggga gagaaaatgc ctccaacatg tgaggaagta 6720 
atgagagaaa tcatagaatt ttaaggccat catggcctha atcttccgct tcctcgctca 6780 
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 6840 
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacabgbga gcaaaaggcc 6900 
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc ghttttccat aggctccgcc 6960 
cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 7020 
tataaagata ccaggcgbbt ccccctggaa gcbccctcgt gcgcbcbccb gttccgaccc 7080 
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg chttctcata 7140 
gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 7200 
acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 7260 
acccggtaag acacgactta bcgccacbgg cagcagccac bggbaacagg abtagcagag 7320 
cgaggbabgb aggcggtgct acagagttch bgaagbggbg gcctaactac ggctacacta 73 80 
gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 7440 
gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 7500 
agcagathac gcgcagaaaa aaaggatctc aagaagabcc tfctgatcttt tctacggggt 7560 
ctgacgctca gtggaacgaa aactcacgtt aagggatttb ggtcatgaga ttatcaaaaa 7620 
ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 7680 
atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 7740 
tctgtctatt tcgttcatcc atagttgcct gactcggggg gggggggcgc tgaggtctgc 7800 
ctcgtgaaga aggtgttgct gactcatacc aggcctgaat cgccccatca tccagccaga 7860 
aagtgaggga gccacggttg atgagagctt tgtbgtaggt ggaccagtbg gtgatbbbga 7920 
actbbbgcbb bgccacggaa cggbctgcgt bgbcgggaag atgcgtgatc tgabccbtca 7980 
actcagcaaa agbbcgatbb atbcaacaaa gccgccgtcc cgbcaagbca gcgbaabgct 8040 
cbgccagtgb bacaaccaab baaccaabbc bgabtagaaa aacbcabcga gcabcaaabg 8100 
aaacbgcaat tbabbcabab caggabbatc aabaccatab bbbbgaaaaa gccgbbtcbg 8160 
baatgaagga gaaaacbcac cgaggcagbt ccabaggabg gcaagabccb ggtabcggbc 8220 
bgcgabbccg actcgtccaa catcaabaca accbatbaab tbccccbcgb caaaaabaag 82 80 
gbbabcaagt gagaaabcac cabgagbgac gactgaatcc ggbgagaabg gcaaaagcbb 834 0 
atgcabbbcb bbccagactb gbbcaacagg ccagccabba cgcbcgtcat caaaabcact 8400 
cgcatcaacc aaaccgbbab bcabbcgbga bbgcgccbga gcgagacgaa abacgcgatc 8460 
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gctgttaaaa ggacaattac aaacaggaat cgaatgcaac cggcgcagga acactgccag 8520 
cgcatcaaca abatbbbcac ctgaabcagg atattcttct aatacctgga atgctgthht 8580 
cccggggatc gcagbggbga gtaaccatgc atcatcagga gfcacggataa aatgcttgat 8640 
ggtcggaaga ggcataaatt ccgbcagcca gtttagbctg accatctcat ctgtaacatc 8700 
attggcaacg cbaccbbbgc catgtttcag aaacaactct ggcgcabcgg gcttcccata 8760 
caatcgatag attgtcgcac cbgabbgccc gacattatcg cgagcccatt babacccaba 8820 
baaatcagca bccabgbbgg aatttaatcg cggcctcgag caagacgbtb cccgttgaat 8880 
atggctcata acaccccttg tattactgtt tatgtaagca gacagtttta ttgttcabga 8940 
bgabababbt ttatcttgtg caatgtaaca tcagagattt tgagacacaa cgtggctttc 9000 
cccccccccc cabbabbgaa gcabbbabca gggbbabbgb cbcabgagcg gabacababb 9060 
bgaabgbabb bagaaaaaba aacaaabagg ggb.bccgcgc acabbbcccc gaaaagtgcc 9120 
accbgacgbc taagaaacca ttabbabcab gacabbaacc babaaaaaba ggcgbabcac 9180 
gaggcccbbt cgbc 9194 



<210> 164 
<211> 9194 
<212> DNA 

<213> Arbificial Sequence 



<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 164 

bcgcgcgbtb cggbgabgac ggtgaaaacc bcbgacacab gcagctcccg gagacggtca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
btggcgggbg bcggggctgg cbtaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
ctabbggcca btgcatacgb bgtabccaba bcabaababg bacabbbaba bbggcbcabg 300 
bccaacabba ccgccabgtb gacabtgabb abbgactagb babbaatagb aabcaabbac 360 
ggggbcabba gbbcabagcc catababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc tgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgbcaatgg gbggagbabb bacggbaaac 54 0 
bgcccacbbg gcagbacafcc aagtgbabca babgccaagt acgcccccba bbgacgbcaa 600 
bgacggbaaa tggcccgcct ggcabtabgc ccagbacabg accbtabggg acbtbccbac 660 
btggcagbac abcbacgtab bagtcabcgc babbaccabg gtgabgcggb bbbggcagba 720 
cabcaabggg cgbggabagc ggbbbgactc acggggabbb ccaagbcbcc accccabbga 780 
cgbcaatggg agbbbgbbbt ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
ctccgcccca tbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agcbcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgcbgbb ttgacctcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcatbg gaacgcggab 1020 
bccccgbgcc aagagbgacg baagbaccgc cbabagacbc babaggcaca ccccbbtggc 1080 
bcbbabgcab gcbabactgb bbbbggcbbg gggccbabac acccccgcbb ccbbabgcba 1140 
baggbgabgg babagctbag ccbabaggbg bgggbbabbg accabbabbg accactcccc 1200 
babbggbgac gabacbbbcc abbacbaabc cabaacabgg cbcbbbgcca caacbatcbc 1260 
tabbggcbab atgccaatac bcbgbccbbc agagacbgac acggacbctg tabbbbtaca 1320 
ggabggggbc ccabbbabba bbbacaaabb cacabataca acaacgccgb cccccgtgcc 1380 
cgcagbtbbb abbaaacata gcgtgggabc bccacgcgaa bcbcgggbac gbgbbccgga 144 0 
cabgggcbcb tcbccggbag cggcggagcb bccacabccg agcccbggbc ccabgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaabgagc gbggagatbg ggctcgcacg gcbgacgcag atggaagacb baaggcagcg 16 80 
gcagaagaag abgcaggcag cbgagbbgtb gbabtcbgab aagagbcaga ggtaacbccc 1740 
gbbgcggbgc bgbtaacggb ggagggcagb gbagbcbgag cagtactcgb bgcbgccgcg 18 0 0 
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cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gbcgbcgaca tgggcgcccg cgccagcgtg ctgagcggcg gcgagctgga 192 0 
ccgcbgggag aagabccgcc tgcgccccgg cggcaagaag aagtacaagc bgaagcacab 1980 
cgbgbgggcc agccgcgagc tggagcgctt cgccgtgaac cccggcctgc bggagaccag 2040 
cgagggctgc cgccagatcc tgggccagct gcagcccagc ctgcagaccg gcagcgagga 2100 
gcbgcgcagc ctgtacaaca ccgbggccac cctgtactgc gtgcaccagc gcatcgagat 2160 
caaggacacc aaggaggccc tggacaagat cgaggaggag cagaacaaga gcaagaagaa 2220 
ggcccagcag gccgccgccg acaccggcca cagcaaccag gtgagccaga actaccccat 2280 
cgbgcagaac atccagggcc agatggtgca ccaggccatc agcccccgca ccctgaacgc 2340 
ctgggtgaag gtggtggagg agaaggcctt cagccccgag gtgatcccca tgttcagcgc 2400 
cctgagcgag ggcgccaccc cccaggacct gaacaccatg ctgaacaccg tgggcggcca 2460 
ccaggccgcc atgcagatgc tgaaggagac catcaacgag gaggccgccg agtgggaccg 2520 
cgtgcacccc gtgcacgccg gccccatcgc ccccggccag atgcgcgagc cccgcggcag 2580 
cgacatcgcc ggcaccacca gcacccbgca ggagcagatc ggctggatga ccaacaaccc 2640 
ccccatcccc gtgggcgaga tctacaagcg cbggabcatc ctgggcctga acaagatcgt 2700 
gcgcatgtac agccccacca gcatcctgga catccgccag ggccccaagg agcccttccg 2760 
cgacbacgbg gaccgcttct acaagaccct gcgcgccgag caggccagcc aggaggtgaa 2820 
gaactggatg accgagaccc tgctggfcgca gaacgccaac cccgactgca agaccatcct 2880 
gaaggccctg ggccccgccg ccaccctgga ggagatgatg accgcctgcc agggcgtggg 2940 
cggccccggc cacaaggccc gcgtgctggc cgaggccatg agccaggtga ccaacagcgc 3000 
caccatcatg atgcagcgcg gcaacttccg caaccagcgc aagatcgtga agtgcttcaa 3060 
ctgcggcaag gagggccaca ccgcccgcaa ctgccgcgcc ccccgcaaga agggctgctg 3120 
gaagtgcggc aaggagggcc accagatgaa ggacbgcacc gagcgacagg cbaatbbbbb 3180 
agggaagatc bggccbbccc acaagggaag gccagggaab btbctbcaga gcagaccaga 3240 
gccaacagcc ccaccagaag agagcbbcag gtttggggaa gagacaacaa cbccctctca 3300 
gaagcaggag ccgatagaca aggaacbgta bccbbbagcb tcccbcagab cacbcbtbgg 3360 
cagcgacccc bcgtcacaab aaagabaggg ggccagctga aggaggccct bctagacacc 3420 
ggcgccgacg acaccgbgcb ggaggagatg aacctgcccg gccgcbggaa gcccaagatg 3480 
atcggcggca tcggcggcbb cabcaaggtg ggccagbacg accagatcct gatcgagabc 3540 
bgcggccaca aggccatcgg caccgbgctg gtgggcccca cccccgbgaa cabcabcggc 3600 
cgcaacctgc bgacccagab cggctgcacc ctgaacbbcc ccatcagccc catcgagacc 3660 
gbgcccgbga agcbgaagcc cggcatggac ggccccaagg tgaagcagtg gccccbgacc 3720 
gaggagaaga bcaaggcccb ggbggagatc bgcaccgaga bggagaagga gggcaagabc 3780 
agcaagabcg gccccgagaa cccctacaac acccccgbgb bcgccabcaa gaagaaggac 3840 
agcaccaagb ggcgcaagcb ggbggacttc cgcgagcbga acaagcgcac ccaggacbbc 3900 
tgggaggbgc agcbgggcab cccccacccc gccggcctga agcagaagaa gagcgbgacc 3960 
gtgctggacg bgggcgacgc cbacbbcagc gbgccccbgg acaaggactt ccgcaagbac 4020 
accgccbbca ccabccccag cabcaacaac gagacccccg gcatccgcta ccagbacaac 4080 
gtgcbgcccc agggcbggaa gggcagcccc gccabcbbcc agbgcagcab gaccaagabc 4140 
ctggagcccb bccgcaagca gaaccccgac atcgbgabcb accagbacab ggaccaccbg 4200 
bacgbgggca gcgaccbgga gabcggccag caccgcacca agabcgagga gctgcgccag 4260 
caccbgcbgc gctggggcbb caccaccccc gacaagaagc accagaagga gccccccbbc 4320 
cbgbggabgg gctacgagcb gcaccccgac aagtggaccg bgcagcccab cgbgcbgccc 43 80 
gagaaggaca gctggaccgb gaacgacabc cagaagcbgg bgggcaagcb gaacbgggcc 444 0 
agccagabcb acgccggcab caaggbgcgc cagcbgbgca agcbgcbgcg cggcaccaag 4500 
gcccbgaccg aggtggbgcc ccbgaccgag gaggccgagc bggagcbggc cgagaaccgc 4560 
gagabccbga aggagcccgb gcacggcgfcg bacbacgacc ccagcaagga ccbgatcgcc 4 620 
gagabccaga agcagggcca gggccagbgg accbaccaga bcbaccagga gcccbbcaag 4680 
aaccbgaaga ccggcaagba cgcccgcabg aagggcgccc acaccaacga cgbgaagcag 4740 
cbgaccgagg ccgbgcagaa gabcgccacc gagagcabcg bgatcbgggg caagaccccc 4800 
aagbbcaagc bgcccabcca gaaggagacc bgggaggccb ggbggaccga gbactggcag 4 860 
gccaccbgga bccccgagbg ggagtbcgtg aacacccccc cccbggtgaa gctgbggtac 4.920 
cagcbggaga aggagcccab cabcggcgcc gagacctbct acgbggacgg cgccgccaac 4980 
cgcgagacca agcbgggcaa ggccggcbac gtgaccgacc gcggccgcca gaaggbggbg 5040 
ccccbgaccg acaccaccaa ccagaagacc gagctgcagg ccabccaccb ggcccbgcag 5100 
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gacagcggcc tggaggtgaa catcgtgacc gacagccagt acgccctggg cabcabccag 5160 
gcccagcccg acaagagcga gagcgagctg gtgagccaga tcatcgagca gctgatcaag 5220 
aaggagaagg tgtacctggc ctgggtgccc gcccacaagg gcatcggcgg caacgagcag 5280 
gtggacggcc tggtgagcgc cggcatccgc aaggtgcbgb tcctggacgg catcgacaag 5340 
gcccaggagg agcacgagaa gtaccacagc aacfcggcgcg ccabggccag cgacttcaac 5400 
ctgccccccg tggtggccaa ggagatcgtg gccagctgcg acaagtgcca gctgaagggc 5460 
' gaggccatgc acggccaggt ggactgcagc cccggcatct ggcagctggc afcgcacccac 552 0 
ctggagggca aggtgabccb ggtggccgtg cacgtggcca gcggctacat cgaggccgag 5580 
gtgatccccg ccgagaccgg ccaggagacc gcctacttcc bgctgaagcb ggccggccgc 5640 
tggcccghga agaccgtgca caccgacaac ggcagcaact tcaccagcac caccgtgaag 5700 
gccgcctgct ggtgggccgg catcaagcag gagttcggca tcccctacaa cccccagagc 5760 
cagggcgtga tcgagagcat gaacaaggag ctgaagaaga tcatcggcca ggtgcgcgac 5820 
caggccgagc accbgaagac cgccgbgcag abggccgbgb tcatccacaa cttcaagcgc 5880 
aagggcggca tcggcggcta cagcgccggc gagcgcatcg bggacabcab cgccaccgac 594 0 
atccagacca aggagctgca gaagcagatc accaagatcc agaacttccg cgbgtacbac 6000 
cgcgacagcc gcgaccccgt gtggaagggc cccgccaagc tgctgtggaa gggcgagggc 6060 
gccgtggtga tccaggacaa cagcgacatc aaggbggbgc cccgccgcaa ggccaagatc 6120 
atccgcgact acggcaagca gatggccggc gacgactgcg tggccagccg ccaggacgag 6180 
gactaggaat bcbgcbgbgc cttctagttg ccagccatct gttgtttgcc cctcccccgt 6240 
gccbbcctbg accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat 6300 
fcgcatcgcat bgbcbgagba ggbgbcabbc babbctgggg ggtggggtgg ggcaggacag 6360 
caagggggag gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggg 6420 
tacccaggtg ctgaagaatt gacccggttc ctcctgggcc agaaagaagc aggcacatcc 64 80 
ccbbcbcbgb gacacaccct gtccacgccc ctggttctta gbbccagccc cactcatagg 6540 
acactcatag cbcaggaggg ctccgccttc aatcccaccc gctaaagtac ttggagcggt 6600 
ctctccctcc cbcabcagcc caccaaacca aacctagcct ccaagagtgg gaagaaabta 6660 
aagcaagata ggcbabtaag tgcagaggga gagaaaatgc ctccaacatg tgaggaagta 6720 
atgagagaaa bcabagaabb btaaggccat cabggccbba atcttccgct tcctcgctca 6780 
cbgactcgcb gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 6840 
baabacggbt atccacagaa bcaggggaba acgcaggaaa gaacatgtga gcaaaaggcc 6900 
agcaaaaggc caggaaccgb aaaaaggccg cgttgctggc gbtbbbccab aggctccgcc 6960 
cccctgacga gcatcacaaa aatcgacgct caagbcagag gtggcgaaac ccgacaggac 7020 
bataaagaba ccaggcgttt cccccbggaa gctccctcgt gcgctctcct gbbccgaccc 7080 
bgccgcbtac cggatacctg tccgccbbbc tcccttcggg aagcgbggcg cbtbctcata 7140 
gctcacgctg baggtabcbc agttcggbgt aggtcgbbcg cbccaagctg ggctgbgbgc 7200 
acgaaccccc cgttcagccc gaccgcbgcg ccbtabccgg taactatcgb cttgagbcca 7260 
acccggbaag acacgacbba tcgccacbgg cagcagccac tggbaacagg abbagcagag 7320 
cgaggtatgb aggcggbgct acagagbtcb tgaagbggtg gccbaactac ggcbacacta 7380 
gaagaacagt atbtggbabc tgcgcbcbgc tgaagccagb bacctbcgga aaaagagtbg 7440 
gbagctcbbg abccggcaaa caaaccaccg cbggbagcgg bggbtbbbtt gttbgcaagc 7500 
agcagattac gcgcagaaaa aaaggabcbc aagaagatcc bbbgabcttb bctacggggb 7560 
cbgacgctca gbggaacgaa aacbcacgbb aagggattbb ggbcabgaga bbabcaaaaa 7620 
ggatcttcac cbagabccbb bbaaabbaaa aatgaagtbt baaatcaatc taaagtabab 7680 
atgagtaaac bbggtcbgac agbbaccaab gcbbaabcag bgaggcaccb abctcagcga 774 0 
bcbgbcbabb bcgbbcatcc abagbbgccb gacbcggggg gggggggcgc tgaggtcbgc 7800 
ctcgtgaaga aggtgbtgct gactcatacc aggccbgaat cgccccabca bccagccaga 7860 
aagtgaggga gccacggbtg abgagagcbb bgbbgtaggb ggaccagttg gtgabbbbga 7920 
acbttbgcbt bgccacggaa cggtctgcgb bgtcgggaag atgcgtgatc bgabccttca 7980 
acbcagcaaa agbbcgabtt atbcaacaaa gccgccgbcc cgbcaagtca gcgbaabgcb 8040 
cbgccagbgb bacaaccaab baaccaatbc bgabtagaaa aactcabcga gcatcaaatg 8100 
aaacbgcaab btabbcabab caggabbabc aabaccabab bbbbgaaaaa gccgbtbcbg 8160 
baatgaagga gaaaacbcac cgaggcagtb ccabaggabg gcaagatcct ggbatcggbc 8220 
bgcgabtccg acbcgtccaa catcaabaca accbabbaat ttccccbcgb caaaaabaag 8280 
gttabcaagt gagaaabcac cabgagbgac gacbgaabcc ggtgagaatg gcaaaagcbb 8340 
atgcabbbct tbccagacbb gbbcaacagg ccagccabba cgcbcgtcab caaaatcacb 8400 
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cgcatcaacc aaaccgbtab tcattcgtga hhgcgcctga gcgagacgaa abacgcgatc 8460 
gctgttaaaa ggacaattac aaacaggaat cgaatgcaac cggcgcagga acactgccag 8520 
cgcatcaaca ababbbbcac ctgaatcagg abattcbbct aatacctgga abgcbgtbbb 8580 
cccggggatc gcagbggtga gtaaccatgc atcabcagga gbacggabaa aabgcbbgab 8640 
ggbcggaaga ggcataaabb ccgtcagcea gbbtagtcbg accabcbcab cbgtaacatc 8700 
abbggcaacg cbaccbbtgc cabgbtbcag aaacaacbcb ggcgcabcgg gcbtcccata 8760 
caatcgatag abbgbcgcac cbgabbgccc gacabbabcg cgagcccabb babacccaba 8820 
taaabcagca bccabgbbgg aabtbaabcg cggccbcgag caagacgbbb cccgbbgaab 8880 
abggcbcata acacccctbg babtacbgbb babgbaagca gacagtbbba bbgtbcabga 8940 
bgatababtb bbabcbbgbg caabgbaaca bcagagatbt bgagacacaa cgbggcbbbc 9000 
cccccccccc cabbabbgaa gcatbbabca gggbtabbgt cbcabgagcg gabacababb 9060 
bgaabgbabb bagaaaaaba aacaaabagg ggbbccgcgc acabbbcccc gaaaagbgcc 9120 
accbgacgbc baagaaacca bbabbabcab gacabbaacc babaaaaaba ggcgbabcac 9180 
gaggcccbtb cgbc 9194 



<210> 165 
<211> 12411 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 165 

tcgcgcgbtb cggbgabgac ggbgaaaacc bctgacacat gcagcbcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggtg 120 
tbggcgggtg bcggggcbgg cbbaacbabg cggcatcaga gcagabtgba cbgagagbgc 180 
accabatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
cbatbggcca bbgcabacgt bgbabccaba bcabaababg bacabbtaba bbggcbcabg 300 
bccaacabta ccgccabgbb gacatbgatb abbgacbagb babbaatagb aabcaatbac 360 
ggggbcabta gbtcabagcc cabababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 
cabagtaacg ccaabaggga cbbtccabbg acgbcaabgg gtggagbabb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgtabca babgccaagb acgcccccba bbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbatggg acbbbccbac 660 
ttggcagbac abcbacgtab bagtcabcgc batbaccabg gbgatgcggb bbbggcagba 720 
cabcaabggg cgbggatagc ggbbbgactc acggggattb ccaagbcbcc accccabbga 780 
cgbcaabggg agbbbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
cbccgcccca bbgacgcaaa tgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agcbcgbbba gbgaaccgbc agabcgcctg gagacgccab ccacgcbgbb ttgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabtg gaacgcggab 1020 
bccccgbgcc aagagtgacg baagbaccgc ctabagacbc babaggcaca ccccbbtggc 1080 
bcbbabgcat gcbabactgb btbbggcbtg gggcctatac acccccgctb ccbbabgcta 1140 
baggbgabgg babagcbbag ccbabaggtg bgggbbabbg accabbabbg accacbcccc 1200 
babbggbgac gabactbbcc abbacbaatc cataacatgg cbcbbbgcca caacbabcbc 1260 
babbggcbab abgccaabac bcbgtccbbc agagacbgac acggacbcbg babbbbbaca 1320 
ggabggggbc ccatbtabta bbbacaaabb cacababaca acaacgccgb cccccgtgcc 1380 
cgcagbbbbb abbaaacata gcgbgggatc bccacgcgaa bcbcgggbac gbgbbccgga 1440 
cabgggcbcb tcbccggbag cggcggagcb tccacabccg agcccbggtc ccabgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggtagg gbabgbgtcb 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagact taaggcagcg 1680 
gcagaagaag abgcaggcag ctgagbbgtb gbabbcbgab aagagbcaga ggbaactccc 1740 
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gbbgcggbgc tgttaacggt ggagggcagt gbagbctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgb tccbbbccab gggtcttttc 1860 
tgcagtcacc gtcgtcgaca tgggcgcccg cgccagcgtg ctgagcggcg gcgagctgga 1920 
ccgctgggag aagatccgcc tgcgccccgg cggcaagaag aagtacaagc tgaagcacat 1980 
cgtgtgggcc agccgcgagc tggagcgctt cgccgtgaac cccggcctgc tggagaccag 2040 
cgagggctgc cgccagatcc tgggccagct gcagcccagc ctgcagaccg gcagcgagga 2100 
gctgcgcagc ctgtacaaca ccgtggccac ccbgbacbgc gtgcaccagc gcatcgagat 2160 
caaggacacc aaggaggccc tggacaagat cgaggaggag cagaacaaga gcaagaagaa 2220 
ggcccagcag gccgccgccg acaccggcca cagcaaccag gbgagccaga actaccccat 2280 
cgtgcagaac ahccagggcc agatggtgca ccaggccatc agcccccgca ccctgaacgc 2340 
ctgggtgaag gtggtggagg agaaggcctt cagccccgag gtgatcccca tgttcagcgc 2400 
cctgagcgag ggcgccaccc cccaggacct gaacaccabg ctgaacaccg tgggcggcca 2460 
ccaggccgcc abgcagatgc tgaaggagac catcaacgag gaggccgccg agtgggaccg 2 520 
cgtgcacccc gtgcacgccg gccccatcgc ccccggccag atgcgcgagc cccgcggcag 2580 
cgacatcgcc ggcaccacca gcaccctgca ggagcagatc ggctggatga ccaacaaccc 2640 
ccccatcccc gtgggcgaga tctacaagcg ctggatcatc ctgggcctga acaagatcgt 2700 
gcgcabgbac agccccacca gcatcctgga catccgccag ggccccaagg agcccttccg 2760 
cgactacgtg gaccgcttct acaagacccb gcgcgccgag caggccagcc aggaggtgaa 2820 
gaactggatg accgagaccc tgctggtgca gaacgccaac cccgactgca agaccatcct 2880 
gaaggccctg ggccccgccg ccaccctgga ggagatgabg accgcctgcc agggcgtggg 2 940 
cggccccggc cacaaggccc gcgtgctggc cgaggccatg agccaggtga ccaacagcgc 3000 
caccatcatg atgcagcgcg gcaacttccg caaccagcgc aagatcgtga agbgcbbcaa 3060 
ctgcggcaag gagggccaca ccgcccgcaa ctgccgcgcc ccccgcaaga agggcbgcfcg 3120 
gaagbgcggc aaggagggcc accagatgaa ggactgcacc gagcgacagg cbaatbbbtt. 3180 
agggaagatc bggcctbccc acaagggaag gccagggaab tttcttcaga gcagaccaga 3240 
gccaacagcc ccaccagaag agagcttcag gtttggggaa gagacaacaa cbcccbcbca 3 300 
gaagcaggag ccgatagaca aggaactgta tcctttagct tccctcagat cactctttgg 3360 
cagcgacccc tcgbcacaab aaagataggg ggccagcbga aggaggccch tctagacacc 3420 
ggcgccgacg acaccgtgct ggaggagatg aaccbgcccg gccgctggaa gcccaagabg 3480 
atcggcggca bcggcggcbb catcaaggtg ggccagbacg accagabccb gatcgagatc 354 0 
tgcggccaca aggccatcgg caccgtgctg gtgggcccca cccccgtgaa catcatcggc 3600 
cgcaacctgc tgacccagab cggctgcacc cbgaacbbcc ccatcagccc catcgagacc 3660 
gbgcccgbga agctgaagcc cggcatggac ggccccaagg tgaagcagtg gcccctgacc 3720 
gaggagaaga bcaaggcccb ggtggagatc tgcaccgaga tggagaagga gggcaagatc 37 8 0 
agcaagatcg gccccgagaa cccctacaac acccccgtgt bcgccabcaa gaagaaggac 3840 
agcaccaagt ggcgcaagcb ggtggacttc cgcgagcbga acaagcgcac ccaggacttc 3900 
tgggaggtgc agctgggcat cccccacccc gccggcctga agcagaagaa gagcgtgacc 3960 
gtgctggacg bgggcgacgc ctacttcagc gbgcccctgg acaaggactb ccgcaagtac 4020 
accgccttca ccatccccag cabcaacaac gagacccccg gcatccgcta ccagtacaac 4080 
gtgctgcccc agggctggaa gggcagcccc gccatcttcc agtgcagcat gaccaagatc 414 0 
ctggagccct bccgcaagca gaaccccgac abcgbgabct accagtacat ggacgacctg 4200 
tacgbgggca gcgaccbgga gatcggccag caccgcacca agabcgagga gctgcgccag 4260 
cacctgcfcgc gcbggggctt caccaccccc gacaagaagc accagaagga gccccccbbc 4320 
cbgbggabgg gcbacgagcb gcaccccgac aagbggaccg bgcagcccab cgtgcbgccc 4380 
gagaaggaca gcbggaccgb gaacgacabc cagaagcbgg tgggcaagcb gaactgggcc 4440 
agccagabcb acgccggcat caaggbgcgc cagcbgbgca agcbgcbgcg cggcaccaag 4500 
gcccbgaccg aggbggbgcc cctgaccgag gaggccgagc bggagcbggc cgagaaccgc 4560 
gagatccbga aggagcccgb gcacggcgbg tacbacgacc ccagcaagga ccbgatcgcc 4620 
gagabccaga agcagggcca gggccagbgg accbaccaga bcbaccagga gcccbbcaag 4 680 
aaccbgaaga ccggcaagba cgcccgcabg aagggcgccc acaccaacga cgbgaagcag 474 0 
cbgaccgagg ccgbgcagaa gabcgccacc gagagcabcg tgabcbgggg caagaccccc 4800 
aagbbcaagc tgcccatcca gaaggagacc bgggaggccb ggbggaccga gtactggcag 4860 
gccaccbgga bccccgagbg ggagbbcgbg aacacccccc cccbggbgaa gcbgbggbac 4920 
cagctggaga aggagcccat cabcggcgcc gagaccbbcb acgtggacgg cgccgccaac 4980 
cgcgagacca agcbgggcaa ggccggcbac gbgaccgacc gcggccgcca gaaggbggtg 5040 
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cccctgaccg acaccaccaa ccagaagacc gagctgcagg ccatccacct ggccctgcag 5100 
gacagcggcc tggaggtgaa catcgtgacc gacagccagt acgccctggg catcatccag 5160 
gcccagcccg acaagagcga gagcgagctg gtgagccaga tcabcgagca gcbgatcaag 5220 
aaggagaagg tgtacctggc ctgggtgccc gcccacaagg gcatcggcgg caacgagcag 5280 
gtggacggcc tggtgagcgc cggcatccgc aaggtgctgt tcctggacgg catcgacaag 5340 
gcccaggagg agcacgagaa gtaccacagc aactggcgcg ccatggccag cgacttcaac 5400 
ctgccccccg tggtggccaa ggagabcgbg gccagctgcg acaagbgcca gctgaagggc 5460 
gaggccatgc acggccaggb ggactgcagc cccggcabct ggcagcbgga cbgcacccac 5520 
ctggagggca aggtgatcct ggtggccgtg cacghggcca gcggctacat cgaggccgag 5580 
gtgatccccg ccgagaccgg ccaggagacc gccbacbbcc bgctgaagct ggccggccgc 5640 
tggcccgtga agaccgtgca caccgacaac ggcagcaact tcaccagcac caccgtgaag 5700 
gccgcctgct ggbgggccgg catcaagcag gagttcggca tcccctacaa cccccagagc 5760 
cagggcgtga tcgagagcat gaacaaggag ctgaagaaga tcatcggcca ggtgcgcgac 5 82 0 
caggccgagc acctgaagac cgccgtgcag atggccgtgt tcatccacaa chtcaagcgc 5880 
aagggcggca tcggcggcta cagcgccggc gagcgcatcg tggacatcat cgccaccgac 5940 
atccagacca aggagcbgca gaagcagatc accaagatcc agaacttccg cgtgtactac 6000 
cgcgacagcc gcgaccccgt gtggaagggc cccgccaagc tgctgtggaa gggcgagggc 6060 
gccgtggtga bccaggacaa cagcgacabc aaggtggtgc cccgccgcaa ggccaagatc 6120 
atccgcgact acggcaagca gatggccggc gacgactgcg bggccagccg ccaggacgag 6180 
gactaggaat tcacgcgtcg agcatgcatc bagggcggcc aatbccgccc cbcbcccbcc 6240 
ccccccccba acgtbacbgg ccgaagccgc ttggaabaag gccggbgbgc gbbtgbcbat 6300 
abgbgabttt ccaccatabb gccgtctttb ggcaatgtga gggcccggaa acctggcccb 6360 
gbcbtcbtga cgagcattcc taggggfcctb tccccbcbcg ccaaaggaab gcaaggbctg 6420 
bbgaabgtcg bgaaggaagc agttccbctg gaagctbcbb gaagacaaac aacgbctgba 64 80 
gcgaccctbb gcaggcagcg gaacccccca cctggcgaca ggtgccbctg cggccaaaag 6540 
ccacgbgbab aagabacacc bgcaaaggcg gcacaacccc agtgccacgb tgbgagbtgg 6600 
abagbtgbgg aaagagtcaa abggcbcbcc tcaagcgbab bcaacaaggg gcbgaaggab 6660 
gcccagaagg baccccabbg babgggabcb gabcbggggc ctcggbgcac atgcbbbaca 6720 
bgbgbtbagb cgaggbbaaa aaaacgtcta ggccccccga accacgggga cgtggbbbtc 6780 
cbbbgaaaaa cacgabgata agcbbgccac aacccgggat cctctagaac accatgcgcg 6840 
bgaaggagaa gtaccagcac cbgbggcgct ggggcbggcg ctggggcacc abgcbgctgg 6900 
gcabgcbgab gabcbgcagc gccaccgaga agcbgbgggb gaccgbgbac bacggcgbgc 6960 
ccgbgtggaa ggaggccacc accacccbgt bcbgcgccag cgacgccaag gccbacgaca 7020 
ccgaggbgca caacgtgbgg gccacccacg ccbgcgtgcc caccgacccc aacccccagg 7080 
aggtggtgct ggbgaacgbg accgagaact tcgacatgtg gaagaacgac atggbggagc 7140 
agatgcacga ggacabcabc agccbgbggg accagagccb gaagcccbgc gbgaagcbga 7200 
cccccctgbg cgbgagcctg aagbgcaccg accbgaagaa cgacaccaac accaacagca 7260 
gcagcggccg cabgabcabg gagaagggcg agabcaagaa cbgcagcttc aacabcagca 7320 
ccagcabccg cggcaaggtg cagaaggagt acgccbbcbb cbacaagcbg gacabcabcc 73 80 
ccabcgacaa cgacaccacc agcbacagcc bgaccagcbg caacaccagc gbgatcaccc 7440 
aggcctgccc caaggbgagc bbcgagccca bccccatcca cbacbgcgcc cccgccggcb 7500 
tcgccatccb gaagtgcaag gacaagaagb tcaacggcaa gggccccfcgc accaacgtga 7560 
gcaccgtgca gbgcacccac ggcatccgcc ccgbggbgag cacccagcbg ctgctgaacg 7620 
gcagccbggc cgaggaggag gbggbgabcc gcagcgccaa cbbcgccgac aacgccaagg 7680 
tgatcatcgb gcagctgaac gagagcgbgg agabcaacbg cacccgcccc aacaacaaca 7740 
cccgcaagag cabccacatc ggccccggcc gcgccttcba caccaccggc gagabcabcg 7 800 
gcgacabccg ccaggcccac bgcaaccbga gccgcgccaa gtggaacgac acccbgaaca 7860 
agabcgbgab caagctgcgc gagcagbbcg gcaacaagac cabcgbgbbc aagcacagca 7920 
gcggcggcga ccccgagabc gbgacccaca gcbbcaacbg cggcggcgag btcbtctact 7 980 
gcaacagcac ccagctgbbc aacagcaccb ggbbcaacag caccbggagc accgagggca 8040 
gcaacaacac cgagggcagc gacaccabca cccbgcccbg ccgcabcaag cagabcatca 8100 
acabgbggca gaaggbgggc aaggccatgb acgccccccc cabcagcggc cagatccgcb 8160 
gcagcagcaa cabcaccggc ctgcbgcbga cccgcgacgg cggcaacagc aacaacgaga 8220 
gcgagabcbb ccgccbgggc ggcggcgaca tgcgcgacaa cbggcgcagc gagctgbaca 82 80 
agtracaaggt ggbgaagabc gagccccbgg gcgbggcccc caccaaggcc aagcgccgcg 834 0 
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bggtgcagcg cgagaagcgc gccgtgggca tcggcgccct gbbccbgggc bbccbgggcg 8400 
ccgccggcag caccatgggc gccgccagca bgacccbgac cgtgcaggcc cgccagcbgc 84.60 
tgagcggcat cgtgcagcag cagaacaacc tgctgcgcgc catcgaggcc cagcagcacc 8520 
tgctgcagct gaccgbgtgg ggcatcaagc agctgcaggc ccgcatcctg gccgtggagc 8580 
gctacctgaa ggaccagcag ctgctgggca tctggggctg cagcggcaag cttatctgca 8640 
ccaccgccgt gccctggaac gccagcbgga gcaacaagag cctggagcag atctggaacc 8700 
acaccacctg gatggagtgg gaccgcgaga tcaacaacta caccagcctg atccacagcc 8760 
tgatcgagga gagccagaac cagcaggaga agaacgagca ggagctgctg gagctggaca 8820 
agtgggccag cctgtggaac tggttcaaca tcaccaactg gctgtggtac atcaagctgt 8880 
tcatcatgat cgtgggcggc ctggtgggcc tgcgcabcgb gttcgccgtg ctgagcatcg 8940 
tgaaccgcgt gcgccagggc tacagccccc tgagcttcca gacccacctg cccatccccc 9000 
gcggccccga ccgccccgag ggcatcgagg aggagggcgg cgagcgcgac cgcgaccgca 9060 
gcatccgcct ggtgaacggc agcctggccc tgabcbggga cgacctgcgc agcctgtgcc 912 0 
tgttcagcta ccaccgcctg cgcgacctgc tgctgatcgt gacccgcatc gbggagctgc 5180 
tgggccgccg cggctgggag gccctgaagt actggtggaa cctgctgcag tacbggagcc 9240 
aggagcbgaa gaacagcgcc gbgagcctgc tgaacgccac cgccabcgcc gbggccgagg 9300 
gcaccgaccg cgbgabcgag gbggbgcagg gcgcctgccg cgccabccgc cacabccccc 9360 
gccgcatccg ccagggccbg gagcgcabcc bgcbgbgagg abccagabcb gctgbgccbb 9420 
ctagbbgcca gccatcbgbb gbbtgcccct cccccgbgcc btccbbgacc cbggaaggbg 9480 
ccacbcccac bgbccbbbcc baataaaabg aggaaabbgc atcgcabtgb cbgagbaggb 9540 
gbcabtctab bcbggggggt ggggtggggc aggacagcaa gggggaggab bgggaagaca 9600 
abagcaggca bgcbggggab gcggbgggct ctatgggbac ccaggbgcbg aagaatbgac 9660 
ccggbtcctc cbgggccaga aagaagcagg cacabccccb bcbcbgbgac acaccctgbc 9720 
cacgccccbg gbbcttagbb ccagccccac tcataggaca ctcabagcbc aggagggcbc 9780 
cgccbbcaab cccacccgcb aaagbactbg gagcggbctc bcccbcccbc abcagcccac 984 0 
caaaccaaac cbagcctcca agagbgggaa gaaabtaaag caagabaggc battaagbgc 9900 
agagggagag aaaatgccbc caacatgtga ggaagbaatg agagaaatca bagaatbbta 9960 
aggccatcab ggccbbaabc bbccgctbcc bcgcbcactg acbcgcbgcg cbcggbcgbb 10020 
cggcbgcggc gagcggbabc agcbcacbca aaggcggbaa bacggbbabc cacagaabca 10080 
ggggataacg caggaaagaa cabgbgagca aaaggccagc aaaaggccag gaaccgbaaa 1014 0 
aaggccgcgb bgcbggcgbb bbbccabagg ctccgccccc cbgacgagca tcacaaaaab 10200 
cgacgcbcaa gbcagaggbg gcgaaacccg acaggacbat aaagatacca ggcgbtbccc 10260 
ccbggaagcb cccbcgbgcg cbcbccbgbt ccgacccbgc cgcbbaccgg ataccbgbcc 10320 
gcctbtcbcc cbbcgggaag cgbggcgcbt bcbcabagcb cacgctgtag gtabctcagb 103 80 
bcggbgbagg bcgbbcgctc caagctgggc bgtgbgcacg aaccccccgb bcagcccgac 10440 
cgctgcgccb batccggtaa cbatcgtctb gagbccaacc cggtaagaca cgacbbabcg 10500 
ccacbggcag cagccacbgg baacaggabb agcagagcga ggbabgtagg cggbgcbaca 10560 
gagbbcbbga agbggbggcc baacbacggc tacacbagaa gaacagbabb tggbabcbgc 10620 
gcbcbgctga agccagbbac cbbcggaaaa agagbbggta gcbcbbgabc cggcaaacaa 10680 
accaccgcbg gbagcggbgg bbtbbbbgbb bgcaagcagc agabbacgcg cagaaaaaaa 1074 0 
ggabcbcaag aagabccbbb gabcbtbbcb acggggtcbg acgcbcagbg gaacgaaaac 10800 
tcacgbbaag ggabbbbggb cabgagatta bcaaaaagga tcbtcaccba gabccbbbta 10860 
aabbaaaaab gaagbbbbaa atcaabcbaa agbabatabg agbaaactbg gbcbgacagb 10920 
baccaabgcb baabcagbga ggcaccbabc bcagcgabcb gbctattbcg btcabccaba 10980 
gbbgccbgac bcgggggggg ggggcgcbga ggbcbgccbc gtgaagaagg bgbbgcbgac 1104 0 
bcabaccagg cctgaabcgc cccatcabcc agccagaaag bgagggagcc acggbbgabg 11100 
agagcbtbgb bgbaggbgga ccagbbggbg abbtbgaacb bbbgcbbbgc cacggaacgg 11160 
tcbgcgbbgb cgggaagabg cgbgabctga bcctbcaacb cagcaaaagb bcgabbbabb 11220 
caacaaagcc gccgbcccgb caagbcagcg baabgcbcbg ccagtgbbac aaccaabbaa 11280 
ccaabbcbga bbagaaaaac bcabcgagca bcaaabgaaa ctgcaabbba bbcababcag 11340 
gabbabcaab accababbbb bgaaaaagcc gbbbcbgtaa bgaaggagaa aacbcaccga 11400 
ggcagbbcca baggabggca agabccbggb abcggbcbgc gabbccgacb cgbccaacab 11460 
caabacaacc babbaabbbc cccbcgbcaa aaabaaggbb atcaagbgag aaatcaccab 11520 
gagbgacgac bgaabccggb gagaabggca aaagcbbabg cabbbcbbbc cagacbbgbb 11580 
caacaggcca gccabtacgc fccgbcabcaa aafccacbcgc atcaaccaaa ccgbbabbca 1164 0 
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ttcgtgattg cgcctgagcg agacgaaata cgcgatcgct gbbaaaagga caattacaaa 11700 
caggaatcga atgcaaccgg cgcaggaaca ctgccagcgc atcaacaata ttttcacctg 11760 
aatcaggata bbcbbcbaat acctggaatg ctgttttccc ggggatcgca gbggbgagba 1182 0 
accatgcatc atcaggagba cggataaaab gcbbgatggb cggaagaggc abaaabtccg 11880 
tcagccagtt tagtctgacc atctcatctg taacabcabb ggcaacgcta cctbbgccab 11940 
gtttcagaaa caacbctggc gcatcgggct tcccatacaa tcgatagatt gtcgcacctg 12000 
attgcccgac, attatcgcga gcccabtbat acccatataa atcagcatcc abgbbggaab 12060 
ttaatcgcgg cctcgagcaa gacgtttccc gttgaatatg gctcataaca ccccttgtab 12120 
tactgtttat gtaagcagac agbbtbabbg ttcatgatga bababbbbta bcbtgbgcaa 12180 
tgbaacabca gagabbtbga gacacaacgt ggctbtcccc ccccccccat babtgaagca 1224 0 
bttatcaggg bbatbgtctc abgagcggab acatabbbga atgbabttag aaaaataaac 12300 
aaabaggggb tccgcgcaca tttccccgaa aagtgccacc bgacgbctaa gaaaccatba 12360 
tbabcabgac abtaacctat aaaaabaggc gtabcacgag gcccbbtcgb c 12411 



<210> 166 
<211> 9170 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HXV genes 
<400> 166 

tcgcgcgbbb cggtgabgac ggbgaaaacc bctgacacat gcagcbcccg gagacggtca 60 
cagcbtgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
ttggcgggbg tcggggcbgg ctbaacbatg cggcabcaga gcagabtgba ctgagagtgc 180 
accatatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaataccgc abcagabbgg 240 
cbabbggcca btgcabacgb tgbabccata bcataatabg bacabbbata bbggcbcabg 300 
tccaacabba ccgccabgbb gacabtgatb abtgacbagb tabbaabagb aabcaabbac 360 
ggggbcabba gbbcabagcc catababgga gbtccgcgtb acataacbba cggbaaabgg 420 
cccgcctggc bgaccgccca acgacccccg cccatbgacg tcaabaabga cgbabgttcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb bacggbaaac 540 
bgcccacbtg gcagtacabc aagtgfcatca batgccaagb acgcccccba ttgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbtabggg acbbtccbac 660 
tbggcagbac abcbacgbab bagbcatcgc babbaccabg gbgabgcggb bbtggcagba 720 
cabcaabggg cgtggabagc ggbbbgacbc acggggabtb ccaagbcbcc accccabbga 780 
cgbcaabggg agbtbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
cbccgcccca ttgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agcbcgbbba gbgaaccgbc agabcgccbg gagacgccat ccacgcbgbb bbgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcatbg gaacgcggab 1020 
tccccgbgcc aagagbgacg baagbaccgc cbabagacbc tabaggcaca ccccbbtggc 1080 
tcbbatgcab gcbabactgb btbbggcbbg gggcctatac acccccgcbb ccbbabgcba 1140 
taggtgatgg tabagcbbag ccbabaggbg bgggbtabbg accabbabbg accacbcccc 1200 
tabtggtgac gabacttbcc abbacbaabc cabaacabgg cbcbbbgcca caacbabcbc 1260 
babtggcbab abgccaatac bcbgtcctbc agagacbgac acggactcbg tabbbbtaca 1320 
ggabggggtc ccatbbabba tbbacaaatt cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbtbtb abbaaacaba gcgbgggabc bccacgcgaa bcbcgggtac gbgbbccgga 1440 
cabgggcbct bcbccggbag cggcggagcb bccacabccg agcccbggbc ccatgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgtgbct 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbattcbgab aagagbcaga ggbaacbccc 1740 
gtbgcggbgc tgbbaacggb ggagggcagb gbagbcbgag cagbacbcgt bgcbgccgcg 1800 
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cgcgccacca gacataatag ctgacagact aacagactgt bccbbbccab gggbctbbbc 1860 

bgcagbcacc gtcgtcgaca tgggcgcccg cgccagcgtg cfcgagcggcg gcgagctgga 1920 

ccgc.tgggag aagatccgcc tgcgccccgg cggcaagaag aagtacaagc tgaagcacat 1980 

cgtgtgggcc agccgcgagc tggagcgcbb cgccgtgaac cccggcctgc bggagaccag 204 0 

cgagggcbgc cgccagatcc tgggccagct gcagcccagc ctgcagaccg gcagcgagga 2100 

gctgcgcagc ctgtacaaca ccgtggccac cctgtactgc gtgcaccagc gcatcgagat 2160 

caaggacacc aaggaggccc tggacaagat cgaggaggag cagaacaaga gcaagaagaa 2220 

ggcccagcag gccgccgccg acaccggcca cagcaaccag gtgagccaga acbaccccat 2280 

cghgcagaac atccagggcc agatggtgca ccaggccatc agcccccgca ccctgaacgc 2340 

ctgggtgaag gtggbggagg agaaggcctt cagccccgag gtgabcccca tgttcagcgc 2400 

cctgagcgag ggcgccaccc cccaggacct gaacaccatg ctgaacaccg tgggcggcca 2460 

ccaggccgcc atgcagatgc tgaaggagac catcaacgag gaggccgccg agbgggaccg 2520 

cgtgcacccc gtgcacgccg gccccatcgc ccccggccag atgcgcgagc cccgcggcag 2580 

cgacatcgcc ggcaccacca gcaccctgca ggagcagatc ggctggatga ccaacaaccc 2640 

ccccatcccc gtgggcgaga tctacaagcg cbggatzcabc ctgggcctga acaagatcgt 2700 

gcgcatgtac agccccacca gcatcctgga catccgccag ggccccaagg agccctbccg 2760 

cgacbacgbg gaccgcttct acaagaccct gcgcgccgag caggccagcc aggaggtgaa 2820 

gaactggatg accgagaccc tgctggtgca gaacgccaac cccgactgca agaccatcct 2880 

gaaggccctg ggccccgccg ccaccctgga ggagatgatg accgcctgcc agggcgtggg 2940 

cggccccggc cacaaggccc gcgtgctggc cgaggccatg agccaggtga ccaacagcgc 3000 

caccatcatg atgcagcgcg gcaacttccg caaccagcgc aagabcgtga agtgcttcaa 3060 

ctgcggcaag gagggccaca ccgcccgcaa ctgccgcgcc ccccgcaaga agggctgctg 3120 

gaagtgcggc aaggagggcc accagatgaa ggactgcacc gagcgacagg ctaataggga 3180 

agatctggcc btcccacaag ggaaggccag ggaabtbbct tcagagcaga ccagagccaa 324 0 

cagccccacc agaagagagc btcaggtbtg gggaagagac aacaactccc bctcagaagc 3 300 

aggagccgab agacaaggaa ctgbabcctt tagcttccct cagatcacbc bttggcagcg 3360 

acccctcgtc acaabaaaga tagggggcca gctgaaggag gcccttcbag acaccggcgc 3420 

cgacgacacc gtgcbggagg agatgaacct gcccggccgc bggaagccca agatgabcgg 34 80 

cggcatcggc ggctbcatca aggtgggcca gtacgaccag atccbgatcg agatcbgcgg 354 0 

ccacaaggcc abcggcaccg bgcbggtggg ccccaccccc gbgaacatca bcggccgcaa 3600 

cctgctgacc cagabcggct gcacccbgaa ctbccccatc agccccatcg agaccgbgcc 3660 

cgbgaagcbg aagcccggca bggacggccc caaggbgaag cagbggcccc tgaccgagga 3 720 

gaagatcaag gcccbggtgg agabcbgcac cgagatggag aaggagggca agatcagcaa 3780 

gabcggcccc gagaacccct acaacacccc cgbgbtcgcc abcaagaaga aggacagcac 3840 

caagtggcgc aagcbggtgg actbccgcga gcbgaacaag cgcacccagg acbtcbggga 3900 

ggtgcagctg ggcatccccc accccgccgg cctgaagcag aagaagagcg tgaccgtgcb 3960 

ggacgbgggc gacgcctact bcagcgtgcc ccbggacaag gactbccgca agbacaccgc 4020 

cbtcaccabc cccagcabca acaacgagac ccccggcabc cgcbaccagt acaacgbgct 4080 

gccccagggc bggaagggca gccccgccat ctbccagtgc agcabgacca agatcctgga 4140 

gcccbbccgc aagcagaacc ccgacabcgb gabctaccag bacabggacc acctgbacgt 4200 

gggcagcgac cbggagatcg gccagcaccg caccaagatc gaggagcbgc gccagcaccb 4260 

gctgcgctgg ggctbcacca cccccgacaa gaagcaccag aaggagcccc ccbbccbgbg 4320 

gatgggcbac gagctgcacc ccgacaagtg gaccgbgcag cccatcgtgc bgcccgagaa 4380 

ggacagctgg accgbgaacg acabccagaa gcbggtgggc aagcbgaact gggccagcca 4 44 0 

gatctacgcc ggcatcaagg bgcgccagct gtgcaagctg ctgcgcggca ccaaggccct 4500 

gaccgaggbg gtgccccbga ccgaggaggc cgagcbggag cbggccgaga accgcgagab 4560 

ccbgaaggag cccgtgcacg gcgtgtacba cgaccccagc aaggaccbga tcgccgagab 4620 
ccagaagcag ggccagggcc agtggaccba ccagatctac caggagccct bcaagaaccb 4 680 

gaagaccggc aagbacgccc gcabgaaggg cgcccacacc aacgacgtga agcagcbgac 474 0 

cgaggccgbg cagaagabcg ccaccgagag catcgbgabc bggggcaaga cccccaagbt 4800 

caagcbgccc abccagaagg agaccbggga ggccbggtgg accgagbact ggcaggccac 4860 

ctggatcccc gagtgggagb bcgbgaacac ccccccccbg gtgaagctgb ggbaccagcb 4 920 
ggagaaggag cccatcatcg gcgccgagac cbbcbacgbg gacggcgccg ccaaccgcga 4980 
gaccaagctg ggcaaggccg gctacgtgac cgaccgcggc cgccagaagg bggtgcccct 5040 
gaccgacacc accaaccaga agaccgagcb gcaggccatc caccbggccc bgcaggacag 5100 
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cggcctggag gtgaacatcg tgaccgacag ccagtacgcc ctgggcatca tccaggccca 5160 

gcccgacaag agcgagagcg agcbggtgag ccagatcatc gagcagctga tcaagaagga 522 0 

gaaggtgtac ctggcctggg tgcccgccca caagggcatc ggcggcaacg agcaggtgga 5280 

cggcctggtg agcgccggca tccgcaaggt gcbgbbccbg gacggcatcg acaaggccca 5340 

ggaggagcac gagaagtacc acagcaactg gcgcgccatg gccagcgact tcaacctgcc 5400 

ccccgtggtg gccaaggaga tcgtggccag cbgcgacaag tgccagctga agggcgaggc 54 60 

cabgcacggc caggtggact gcagccccgg catctggcag ctggcatgca cccacctgga 5520 

gggcaaggtg atcctggtgg ccgbgcacgb ggccagcggc tacatcgagg ccgaggtgat 55 80 

ccccgccgag accggccagg agaccgccba cttcctgctg aagctggccg gccgctggcc 5640 

cgtgaagacc gtgcacaccg acaacggcag caactbcacc agcaccaccg tgaaggccgc 5700 

ctgctggtgg gccggcatca agcaggagtt cggcatcccc tacaaccccc agagccaggg 5760 

cgtgatcgag agcatgaaca aggagctgaa gaagatcatc ggccaggtgc gcgaccaggc 5820 

cgagcacctg aagaccgccg bgcagatggc cgtgttcatc cacaacbbca agcgcaaggg 5880 

cggcatcggc ggcbacagcg ccggcgagcg cabcgbggac atcatcgcca ccgacatcca 5940 

gaccaaggag ctgcagaagc agatcaccaa gatccagaac bbccgcgbgb actaccgcga 6000 

cagccgcgac cccgtgtgga agggccccgc caagctgctg tggaagggcg agggcgccgb 6060 

ggbgabccag gacaacagcg acatcaaggb ggtgccccgc cgcaaggcca agatcatccg 6120 

cgactacggc aagcagatgg ccggcgacga ctgcgtggcc agccgccagg acgaggacta 6180 

ggaabbcbbg ctgtgccttc tagttgccag ccabcbgbbg tttgcccctc ccccgtgcct 6240 

tccttgaccc bggaaggbgc cacbcccacb gtcctttcct aataaaatga ggaaattgca 63 00 

bcgcabbgbc bgagbaggbg tcattctatt ctggggggtg gggtggggca gcacagcaag 63 60 

ggggaggatt gggaagacaa tagcaggcat gctggggatg cggtgggcbc babgggbacc 6420 

caggtgctga agaabbgacc cggttcctcc tgggccagaa agaagcaggc acabccccbb 64 80 

ctctgtgaca cacccbgtcc acgccccbgg htcttagttc cagccccact cataggacac 6540 

tcatagctca ggagggctcc gccttcaatc ccacccgcba aagbacbbgg agcggtctct 6600 

cccbcccbca tcagcccacc aaaccaaacc bagccbccaa gagtgggaag aaattaaagc 6660 

aagataggct attaagtgca gagggagaga aaatgcctcc aacatgtgag gaagtaatga 6720 

gagaaabcab agaattbcbt ccgcttccbc gctcacbgac bcgctgcgcb cggbcgbtcg 6780 

gctgcggcga gcggbabcag ctcactcaaa ggcggbaaba cggbbatcca cagaatcagg 6840 

ggataacgca ggaaagaaca bgtgagcaaa aggccagcaa aaggccagga accgbaaaaa 6900 

ggccgcgbtg cbggcgbttb bccabaggct ccgccccccb gacgagcabc acaaaaabcg 6960 

acgctcaagb cagaggbggc gaaacccgac aggactabaa agabaccagg cgtbbccccc 7020 

bggaagcbcc cbcgbgcgcb cbccbgbbcc gacccbgccg cbbaccggat accbgbccgc 7080 

cbtbcbcccb bcgggaagcg bggcgcbbbc bcaabgcbca cgcbgbaggt abcbcagttc 7140 

ggbgbaggfcc gbtcgctcca agcbgggctg bgbgcacgaa ccccccgbbc agcccgaccg 7200 

cbgcgccbba bccggbaacb abcgbcbtga gbccaacccg gbaagacacg acbbabcgcc 7260 

acbggcagca gccacbggfca acaggabbag cagagcgagg babgtaggcg gbgcbacaga 7320 

gbbcbbgaag bggtggccba acbacggcba cacbagaagg acagbatbbg gbabcbgcgc 7380 

tctgcbgaag ccagbbaccb bcggaaaaag agbbggbagc bcbbgabccg gcaaacaaac 7440 

caccgcbggt agcggbggtb tbbbtgbtbg caagcagcag abbacgcgca gaaaaaaagg 7500 

atcbcaagaa gabccbbbga bcbbbbcbac ggggbcbgac gcbcagfcgga acgaaaacbc 7560 

acgbbaaggg abbbbggbca tgagabbatc aaaaaggabc bbcaccbaga bccbbbbaaa 7620 

bbaaaaabga agbbbbaaab caabcbaaag babababgag baaacbbggb cbgacagbba 7680 

ccaabgcbba abcagbgagg caccbabcbc agcgabcbgb ctabbbcgbb cabccabagb 7740 

bgccbgacbc cggggggggg gggcgcbgag gbcbgccbcg bgaagaaggb gbbgcbgacb 7800 

cabaccaggc cbgaabcgcc ccabcabcca gccagaaagb gagggagcca cggbbgabga 7860 

gagcbbbgbb gbaggbggac cagbbggtga bbbbgaacbb tbgcbbbgcc acggaacggb 7920 

cbgcgbbgbc gggaagabgc gbgabcbgab ccbbcaacbc agcaaaagbb cgabbtatbc 7980 

aacaaagccg ccgbcccgbc aagbcagcgb aabgcbcbgc cagbgbbaca accaabbaac 8040 

caabbcbgab bagaaaaacb cabcgagcab caaabgaaac bgcaabbtab tcabatcagg 8100 

abbabcaaba ccababbbbt gaaaaagccg bbbcbgbaab gaaggagaaa acbcaccgag 8160 

gcagbtccab aggatggcaa gatcctggba bcggbcbgcg abtccgacbc gtccaacabc 8220 

aabacaaccb abbaabbbcc ccbcgbcaaa aabaaggbba bcaagbgaga aabcaccabg 8280 

agbgacgacb gaabccggbg agaabggcaa aagctbabgc abbbcbbtcc agacbbgbbc 8340 

aacaggccag ccabbacgct cgbcabcaaa abcacbcgca bcaaccaaac cgbbabbcab 84 00 



- 388 - 



WO 02/32943 PCT/US01/25721 

tcgtgattgc gccbgagcga gacgaaatac gcgatcgctg ttaaaaggac aabbacaaac 8460 
aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga 8520 
atcaggatat tcttctaata cctggaatgc bgtbbbcccg gggatcgcag tggtgagtaa 85B0 
ccatgcatca tcaggagtac ggataaaatg cbbgabggbc ggaagaggca baaabbccgb 8640 
cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg B700 
tbbcagaaac aactctggcg catcgggcbt cccatacaat cgatagattg bcgcaccbga 8760 
ttgcccgaca bbabcgcgag cccabtbaba cccababaaa tcagcatcca tgttggaatt 8820 
taatcgcggc ctcgagcaag acgtttcccg ttgaatatgg ctcataacac ccctbgbabb 8880 
acbgbbtabg taagcagaca gtbbbabtgb bcabgabgat atatttttat ctbgbgcaab 8940 
gtaacatcag agattttgag acacaacgbg gctttccccc cccccccabt attgaagcat 9000 
btabcagggb tattgtctca tgsgcggata cababbtgaa bgbabbbaga aaaataaaca 9060 
aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccatbab 9120 
tatcatgaca ttaacctata aaaataggcg tatcacgagg cccbtbcgtc 9170 



<210> 167 
<211> 9280 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining SIV genes 
<400> 167 

tcgcgcgttb cggtgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbbgbct ghaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 
bbggcgggbg bcggggcbgg ctbaactabg cggcabcaga gcagabbgba cbgagagtgc 180 
accabatgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagatbgg 240 
cbabtggcca btgcatacgb bgbabccata tcabaababg bacabbbaba bbggcbcabg 300 
bccaacabta ccgccabgbb gacabtgabt abbgacbagb babbaabagt aabcaabbac 360 
ggggbcabba gbbcabagcc cabatatgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabtgacg tcaataabga cgbabgtbcc 480 
cabagbaacg ccaabaggga cttbccabbg acgtcaabgg gtggagbabb bacggbaaac 540 
bgcccacbbg gcagbacabc aagbgbabca tabgccaagb acgcccccta bbgacgbcaa 600 
bgacggbaaa bggcccgcct ggcabbabgc ccagbacabg acctbabggg acbtbccbac 660 
btggcagtac abcbacgtab bagbcatcgc babbaccabg gtgabgcggb bbbggcagba 720 
catcaabggg cgbggabagc ggbbbgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgtcaabggg agtbbgbtbb ggcaccaaaa bcaacgggac ttbccaaaab gbcgbaacaa 840 
cbccgcccca bbgacgcaaa tgggcggbag gcgtgbacgg bgggaggbct atabaagcag 900 
agcbcgbbba gtgaaccgbc agabcgccbg gagacgccab ccacgcbgtb btgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabbg gaacgcggab 1020 
bccccgbgcc aagagtgacg taagbaccgc cbabagacbc babaggcaca ccccbbbggc 1080 
tctbabgcat gcbabacbgb bttbggcbtg gggccbabac acccccgctb ccbbabgcba 1140 
baggtgatgg babagctbag cctataggbg bgggttabtg accabbattg accacbcccc 12 00 
babbggbgac gabactbbcc abtactaatc cabaacabgg cbcbbtgcca caacbatcbc 1260 
battggctab atgccaabac bcbgbcctbc agagacbgac acggacbctg babbbbbaca 1320 
ggabggggtc ccabbtabba bbbacaaabb cacababaca acaacgccgt cccccgbgcc 1380 
cgcagbbttb atbaaacaba gcgbgggabc bccacgcgaa bcbcgggbac gbgbtccgga 1440 
catgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggtc ccabgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaabgagc gtggagabbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbb gbatbcbgab aagagbcaga ggbaacbccc 1740 
gbtgcggtgc bgbtaacggb ggagggcagt gbagbcbgag cagbacbcgb bgcbgccgcg 180 0 
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cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcbbbtc 1860 
bgcagbcacc gtcgtcgacg ccgccaccat gggcgtgcgc aacagcgtgc tgagcggcaa 1920 
gaaggccgac gagctggaga agabccgcct gcgcccgaac ggcaagaaga aghacahgch 1980 
gaagcacgtg gtgtgggccg ccaacgagct ggaccgcttc ggcctggccg agagcctgct 2040 
ggagaacaag gagggctgcc agaagatcct gagcgtgctg gcgccgctgg bgcccaccgg 2100 
cagcgagaac ctgaagagcc tgtacaacac cgtgtgcgtg abcbggtgca tccacgccga 2160 
ggagaaggtg aagcacaccg aggaggccaa gcagatcgtg cagcgccacc bggbggbgga 2220 
gaccggcacc accgagacca tgcccaagac cagccgcccg accgcgccca gcagcggacg 2280 
cggcggcaac tacccggtgc agcagatcgg cggcaactac gtgcacctgc cgctgagccc 2340 
gcgcaccctg aacgccbggg tgaagctgab cgaggagaag aagttcggcg ccgaggtggt 24 00 
gcccggcbbc caggccctga gcgagggctg cacgccctac gacatcaacc agatgctgaa 2460 
cbgcgbgggc gaccaccagg ccgccabgca gabcabccgc gacatcatca acgaggaggc 2520 
cgccgactgg gacctgcagc acccgcagcc cgcgccgcag cagggccagc tgcgcgagcc 2580 
cagcggcagc gacatcgccg gcaccaccag cagcgtggac gagcagabcc agtggatgta 2 640 
ccgccagcag aacccgatcc cggtgggcaa catctaccgc cgcbggatac agctgggcct 2700 
gcagaagtgc gtgcgcatgt acaacccgac caacatcctg gacgtgaagc agggcccgaa 2760 
ggagcccbtc cagagctacg tggaccgctt ctacaagagc ctgcgcgccg agcagaccga 2 820 
cgccgccgtg aagaactgga tgacccagac cctgctgatc cagaacgcca acccggactg 2 880 
caagctggtg ctgaagggcc tgggcgtgaa cccgaccctg gaggagatgc tgaccgcctg 2940 
ccagggcgtg ggcggcccgg gccagaaggc ccgcctgatg gccgaggccc tgaaggaggc 3000 
cctggcgccc gtgcccatcc cgttcgcggc cgcccagcag cgcggcccgc gcaagcccat 3060 
caagtgctgg aactgcggca aggagggcca cagcgcccgc cagbgccgcg cgccgcgccg 3120 
ccagggcbgc bggaagbgcg gcaagabgga ccacgbgatg gccaagbgcc cggaccgcca 3180 
ggcgggtagg ccbbggbcca bggggaaaga agccgcgcaa ttbcccgabg gcbcaagtgc 3240 
abcaggggcb gabgccaacb gctccgccag aggacccagc tgtggatctg cbaaagaact 3300 
acatgcagbb gggcaagcag cagagagaaa agcagagaga aagcagagag aagccbbaca 3360 
aggaggbgac agaggabbbg cbgcacctca atbcbctctb tggaggagac cagbagtcac 3420 
bgcabcgabc gagggccagc ccgbggaggb gcbtcbagac accggcgccg acgacagcat 34 80 
cgtgaccggc atcgagctgg gcccgcacta caccccgaag abcgbgggcg gcatcggcgg 3540 
cbtcatcaac accggagagt acaagaacgt ggagatcgag gtgctgggca agcgcatcaa 3 600 
gggcaccabc abgaccggcg acacgcccat caacatcttc ggccgcaacc bgcbgaccgc 3660 
ccbgggcabg agccbgaact tccccabcgc caaggbggag cccgtgaagg tggccctgaa 372 0 
gcccggcaag gacggcccca agcbgaagca gbggccccbg agcaaggaga agabcgbggc 3780 
cctgcgcgag abcbgcgaga agabggagaa ggacggccag cbggaggagg ccccacccac 3840 
caacccctac aacacgccca ccbtcgccat caagaagaag gacaagaaca agbggcgcat 3900 
gctgatcgac bbccgcgagc tgaaccgcgb gacccaggac bbcaccgagg tgcagctggg 3960 
cabcccgcac cccgccggcc bggccaagcg caagcgcabc accgtgcbgg acatcggcga 402 0 
cgccbacbbc agcabccccc bggacgagga gbbccgccag bacaccgccb tcacccbgcc 4080 
cagcgtgaac aacgccgagc ccggcaagcg cbacabcbac aaggbgcbgc cccagggcbg 4140 
gaagggcagc cccgccabct tccagtacac cabgcgccac gbgcbggagc ccttccgcaa 4200 
ggccaacccc gacgbgaccc bggtgcagba cabgcabgac abccbgabcg ccagcgaccg 4260 
caccgaccbg gagcacgacc gcgtggbgct gcagagcaag gagcbgctga acagcabcgg 4320 
cttcagcacg cccgaggaga agtbccagaa ggacccaccc ttccagtgga tgggctacga 4380 
gcbgtggccc accaagbgga agctgcagaa gabcgagcbg ccccagcgcg agaccbggac 444 0 
cgbgaacgac atccagaagc tggbgggcgb gcbgaacbgg gccgcccaga tctaccccgg 4500 
cabcaagacc aagcacctgb gccgccbgat ccgcggcaag atgacccbga ccgaggaggt 4560 
gcagtggacc gagatggccg aggccgagta cgaggagaac aagabcabcc tgagccagga 462 0 
gcaggagggc bgcbactacc aggagggcaa gccccbggag gccaccgtga tcaagagcca 4 680 
ggacaaccag tggagcbaca agatccacca ggaggacaag atccbgaagg bgggcaagbt 4 74 0 
cgccaagatc aagaacaccc acaccaacgg cgbgcgcctg cfcggcccacg bgatccagaa 4800 
gatcggcaag gaggccatcg tgabctgggg ccaggtgccc aagbbccacc tgcccgtgga 4860 
gaaggacgtg bgggagcagt ggtggaccga ctactggcag gtgacctgga tacccgagbg 4 920 
ggacttcabc agcaccccbc ccctggbgcg ccbggbgbtc aaccbggtga aggaccccab 4980 
cgagggcgag gagaccbact acaccgacgg cagcbgcaac aagcagagca aggagggcaa 5040 
ggccggcbac abcaccgacc gcggcaagga caaggbgaag gtgcbggagc agaccaccaa 510 0 
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ccagcaggcc gagctcgagg ccttcctgat ggccctgacc gacagcggcc ccaaggccaa 5160 
catcatcgtg gacagccagt acgtgatggg catcatcacc ggctgcccca ccgagagcga 5220 
gagccgcctg gtgaaccaga tcatcgagga gatgatcaag aagagcgaga tctacgtggc 5280 
ctgggtgccc gcccacaagg gcatcggcgg caaccaggag atcgaccacc tggtgagcca 5340 
gggcatccgc caggtgctgt tcctggagaa gatcgagccc gcccaggagg agcacgacaa 5400 
gtaccacagc aacgtgaagg agcbggbgbb caagttcggc ctgccccgca bcgtggcccg 5460 
ccagabcgbg gacacctgcg acaagtgcca ccagaagggc gaggccabcc acggccaggc 552 0 
caacagcgac ctgggcacct ggcagatggc ctgcacccac ctggagggca agatcabcab 5580 
cgtggccgtg cacgtggcca gcggctfccat cgaggccgag gtgabcccgc aggagaccgg 5640 
ccgccagacc gccctgttcc tgctgaagct ggccggccgc tggcccatca cccacctgca 5700 
caccgacaac ggcgccaact tcgccagcca ggaggtgaag atggtggcct ggtgggccgg 5760 
catcgagcac accttcggcg tgccctacaa cccgcagagc cagggcgtgg tggaggccat 5820 
gaaccaccac ctgaagaacc agatcgaccg catccgcgag caggccaaca gcgtggagac 5880 
cabcgbgctg atggccgtgc acbgcatgaa cttcaagcgc cgcggcggca tcggcgacat 594 0 
gacgcccgcc gagcgcctga tcaacatgat caccaccgag caggagatcc agttccagca 6000 
gagcaagaac agcaagtbca agaacttccg cgfcgtactac cgcgagggcc gcgaccagcb 6060 
gtggaagggc cccggcgagc bgctgtggaa gggcgagggc gccgtgahcc tgaaggtggg 6120 
caccgacabc aaggtggtgc cccgccgcaa ggccaagabc atcaaggact acggcggcgg 6180 
caaggaggtg gacagcagca gccacatgga ggacaccggc gaggcccgcg aggbggccbg 624 0 
agaabbcgbb taaacacgcg tgtatacgga tccagatctg cbgbgccbbc bagbbgccag 6300 
ccatctgttg tttgcccctc ccccgtgcct tccttgaccc bggaaggbgc cactcccact 6360 
gtcctttcct aataaaatga ggaaatbgca tcgcattgtc tgagtaggtg bcatbcbabb 6420 
ctggggggtg gggbggggca ggacagcaag ggggaggatt gggaagacaa bagcaggcat 6480 
gctggggabg cggtgggctc tatgggtacc caggtgctga agaattgacc cggttcctcc 6540 
tgggccagaa agaagcaggc acabccccbt ctctgtgaca caccctgtcc acgcccctgg 6600 
ttcttagbbc cagccccact cataggacac tcabagcbca ggagggctcc gccttcaatc 6660 
ccacccgcta aagtacttgg agcggtctct cccbcccbca bcagcccacc aaaccaaacc 6720 
tagcctccaa gagtgggaag aaabbaaagc aagataggct attaagtgca gagggagaga 6780 
aaatgcctcc aacabgbgag gaagbaabga gagaaabcat agaabbbbaa ggccatcabg 684 0 
gccbbaabcb bccgcbbccb cgcbcacbga cbcgcbgcgc bcggbcgbbc ggctgcggcg 6900 
agcggtatca gctcactcaa aggcggtaab acggbbabcc acagaatcag gggabaacgc 6960 
aggaaagaac abgbgagcaa aaggccagca aaaggccagg aaccgbaaaa aggccgcgtb 7020 
gcbggcgtbb bbccabaggc bccgcccccc bgacgagcab cacaaaaabc gacgctcaag 7080 
tcagaggtgg cgaaacccga caggacbaba aagataccag gcgbbbcccc cbggaagcbc 7140 
ccbcgbgcgc bcbcctgbtc cgacccbgcc gcttaccgga bacctgbccg ccbttcbccc 7200 
bbcgggaagc gbggcgcbtb cbcabagcbc acgctgbagg babcbcagbb cggtgbaggb 7260 
cgbtcgcbcc aagcbgggct gbgbgcacga accccccgbb cagcccgacc gcbgcgccbb 7320 
abccggbaac tabcgbcbtg agtccaaccc ggtaagacac gactbabcgc cactggcagc 7380 
agccactggt aacaggabta gcagagcgag gbabgtaggc ggbgcbacag agbbctbgaa 744 0 
gtggbggcct aactacggcb acacbagaag aacagtabbb ggbatcbgcg ctctgcbgaa 7500 
gccagbtacc tbcggaaaaa gagbtggbag cbcbbgabcc ggcaaacaaa ccaccgctgg 7560 
tagcggbggb bbbbtbgbbb gcaagcagca gabbacgcgc agaaaaaaag gabctcaaga 7620 
agabccbbbg abcbbbbcba cggggbcbga cgcbcagbgg aacgaaaacb cacgbbaagg 7680 
gabtbbggtc abgagatbab caaaaaggat cbtcacctag abcctbbtaa abtaaaaabg 7740 
aagbbbbaaa bcaabcbaaa gbabababga gbaaacbbgg bcbgacagbb accaabgcbb 7800 
aabcagtgag gcacctabcb cagcgabcbg bcbabbbcgb bcabccabag bbgccbgacb 7860 
cggggggggg gggcgcbgag gbcbgccbcg bgaagaaggb gbbgcbgacb cabaccaggc 7920 
cbgaabcgcc ccabcabcca gccagaaagb gagggagcca cggbbgabga gagctttgbb 7980 
gtaggbggac cagbbggtga btbbgaacbb bbgcbbbgcc acggaacggb cbgcgbbgbc 8040 
gggaagabgc gtgabcbgab ccbbcaacbc agcaaaagbb cgabbbabtc aacaaagccg 8100 
ccgbcccgtc aagbcagcgt aabgctctgc cagbgtbaca accaabtaac caabbcbgab 8160 
tagaaaaacb cabcgagcab caaabgaaac bgcaatbbab bcababcagg attabcaaba 8220 
ccababbtbb gaaaaagccg btbcbgtaab gaaggagaaa acbcaccgag gcagbtccab 8280 
aggabggcaa gabccbggba bcggbcbgcg abbccgacbc gbccaacabc aatacaacct 8340 
abbaabbbcc cctcgbcaaa aabaaggtba tcaagbgaga aatcaccabg agtgacgact 84 00 
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gaatccggtg agaatggcaa aagcbtabgc abbtctbbcc agactbgbtc aacaggccag 8460 
ccattacgct cgtcatcaaa abcacbcgca tcaaccaaac cgtbabtcab tcgbgattgc 8520 
gcctgagcga gacgaaatac gcgatcgctg bbaaaaggac aahtacaaac aggaabcgaa 85B0 
bgcaaccggc gcaggaacac tgccagcgca tcaacaatat tttcacctga atcaggatat 8640 
tcttctaata cctggaatgc tgttttcccg gggatcgcag bggtgagbaa ccabgcatca 8700 
tcaggagtac ggataaaatg ctbgatggtc ggaagaggca taaattccgt cagccagtbb 8760 
agtctgacca tctcatctgt aacatcattg gcaacgctac ctttgccatg tttcagaaac 8820 
aactctggcg catcgggctt cccatacaat cgatagattg bcgcaccbga ttgcccgaca 8880 
ttatcgcgag cccatttata cccatataaa tcagcatcca tgttggaatt taatcgcggc 8940 
ctcgagcaag acgtttcccg ttgaatatgg ctcataacac cccttgtatt acbgttbatg 9000 
taagcagaca gtbtbatbgt tcatgatgat atatttttat cttgtgcaab gtaacatcag 9060 
agattttgag acacaacgtg gctttccccc cccccccatt abtgaagcat bbatcagggb 9120 
batbgtctca bgagcggaba catatttgaa tgtatttaga aaaataaaca aataggggtt 9180 
ccgcgcacat btccccgaaa agtgccacct gacgtctaag aaaccabbab tabcabgaca 9240 
ttaacctata aaaataggcg tatcacgagg ccctbbcgbc 9280 



<210> 168 
<211> 9166 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plaemid pVR1012x/s containing HIV genes 
<400> 168 

bcgcgcgttb cggtgatgac ggtgaaaacc tcbgacacab gcagcbcccg gagacggtca 60 
cagcttgbct gbaagcggab gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
btggcgggtg tcggggcbgg cttaactabg cggcatcaga gcagattgta ctgagagtgc 180 
accabatgcg gtgbgaaaba ccgcacagat gcgbaaggag aaaataccgc atcagabbgg 240 
ctattggcca ttgcatacgt tgtabccata tcataatabg tacabbbata ttggctcatg 300 
bccaacatba ccgccatgtt gacatbgabb attgactagb tatbaatagt aabcaabtac 360 
ggggbcatba gbbcabagcc cababatgga gttccgcgbt acataactta cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg tcaabaatga cgtatgttcc 480 
catagtaacg ccaataggga cbbbccatbg acgbcaabgg gbggagtabb bacggtaaac 54 0 
tgcccactbg gcagtacatc aagbgtatca batgccaagt acgcccccta ttgacgbcaa 600 
bgacggbaaa tggcccgccb ggcatbabgc ccagbacabg accbtatggg acttbccbac 660 
tbggcagtac abctacgtab bagbcatcgc tabbaccatg gbgabgcggb tttggcagta 720 
catcaabggg cgbggatagc ggbbbgacbc acggggabbt ccaagtctcc accccabbga 780 
cgbcaatggg agbttgttbt ggcaccaaaa tcaacgggac ttbccaaaab gtcgtaacaa 840 
cbccgcccca bbgacgcaaa bgggcggtag gcgbgtacgg tgggaggtcb atabaagcag 900 
agcbcgbbta gbgaaccgtc agabcgcctg gagacgccat ccacgcbgbb btgaccbcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggtgcabbg gaacgcggat 1020 
tccccgbgcc aagagbgacg taagtaccgc cbabagacbc babaggcaca ccccbbbggc 1080 
tctbatgcab gcbabacbgb tbbtggcttg gggccbabac acccccgctb cctbatgcba 1140 
taggbgatgg babagctbag ccbabaggbg bgggbbattg accabtatbg accacbcccc 1200 
battggtgac gatacbtbcc attacbaabc cabaacabgg ctcbbtgcca caactabctc 1260 
babbggctab atgccaabac tcbgtccttc agagacbgac acggactcbg babttbbaca 1320 
ggatggggtc ccabttabba tttacaaabt cacatabaca acaacgccgt cccccgbgcc 13 80 
cgcagtbttt attaaacata gcgtgggabc tccacgcgaa tcbcgggtac gtgbbccgga 1440 
catgggcbcb tcbccggbag cggcggagcb bccacabccg agccctggbc ccabgcctcc 1500 
agcggctcat ggtcgcbcgg cagcbccbtg cbcctaacag tggaggccag acttaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggtagg gbabgbgbct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag abggaagacb taaggcagcg 1680 



- 392 - 



WO 02/32943 



PCT/US01/25721 



gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagt 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca tgggcgcccg 
cgcctgggag cgcatcaagc tgaagcccgg 
ggtgtgggcc agccgcgagc tggagcgctt 
cgagggctgc aagcagatca tgaagcagct 
gctgatcagc ctgcacaaca ccgbggccac 
gcgcgacacc aaggaggccc tggacaagat 
gacccagcag gccgaggccg ccgacaaggg 
gaacctgcag ggccagatgg tgcaccagcc 
gaaggtggtg gaggagaagg ccttcagccc 
cgagggcgcc accccccagg acctgaacac 
cgccatgcag atgctgaagg acaccatcaa 
ccccgtgcac gccggccccg tggcccccgg 
cgccggcacc accagcaccc tgcaggagca 
gcccgtgggc gacatctaca agcgctggat 
gtacagcccc gtgagcatcc tggacatcaa 
cgtggaccgc ttcttcaaga ccctgcgcgc 
gatgaccgac accctgctgg tgcagaacgc 
cctgggcccc ggcgccagcc tggaggagat 
cggccacaag gcccgcgtgc tggccgaggc 
gatgcagcgc agcaactgca agggccccaa 
ggagggccac ctggcccgca actgccgcgc 
caaggagggc caccaggtga aggactgcac 
ctggccttcc cacaagggaa ggccagggaa 
cccaccagaa gagagcttca ggtttgggga 
gccgatagac aaggaactgt atcctttagc 
ctcgtcacaa taaagatagg gggccagctg 
gacaccgtgc tggaggagat gaacctgccc 
atcggcggct tcatcaaggt gggccagtac 
aaggccatcg gcaccgtgct ggtgggcccc 
ctgacccaga tcggctgcac cctgaacttc 
aagctgaagc ccggcatgga cggccccaag 
atcaaggccc tggtggagat ctgcaccgag 
ggccccgaga acccctacaa cacccccgtg 
tggcgcaagc tggtggactt ccgcgagctg 
cagctgggca tcccccaccc cgccggcctg 
gtgggcgacg cctacttcag cgtgcccctg 
accatcccca gcatcaacaa cgagaccccc 
cagggctgga agggcagccc cgccatcttc 
ttccgcaagc agaaccccga catcgtgatc 
agcgacctgg agatcggcca gcaccgcacc 
cgctggggct tcaccacccc cgacaagaag 
ggctacgagc tgcaccccga caagtggacc 
agctggaccg tgaacgacat ccagaagctg 
tacgccggca tcaaggtgcg ccagctgtgc 
gaggtggtgc ccctgaccga ggaggccgag 
aaggagcccg tgcacggcgt gtactacgac 
aagcagggcc agggccagtg gacctaccag 
accggcaagt acgcccgcat gaagggcgcc 
gccgtgcaga agatcgccac cgagagcatc 
ctgcccatcc agaaggagac ctgggaggcc 
atccccgagt gggagttcgt gaacaccccc 
aaggagccca tcatcggcgc cgagaccttc 



gtattctgat aagagtcaga ggtaactccc 1740 
gtagtctgag cagtactcgt tgctgccgcg 1800 
aacagactgt tcctttccat gggtcttttc 1860 
cgccagcatc ctgcgcggcg gcaagctgga 1920 
cggcaagaag cactacatga tgaagcacct 1980 
cgccctggac cccggcctgc tggagaccag 2040 
gcagcccgcc ctgcagaccg gcaccaagga 2100 
cctgtactgc gtgcacgaga agatcgacgt 2160 
caaggaggag cagaacaaga gccagcagaa 2220 
caaggtgagc cagaactacc ccatcgtgca 2280 
catcagcgcc cgcaccctga acgcctgggt 2340 
cgaggtgatc cccatgttca ccgccctgag 2400 
catgctgaac accgtgggcg gccaccaggc 24 60 
cgaggaggcc gccgagtggg accgcctgca 2520 
ccagatgcgc gagccccgcg gcagcgacat 25 BO 
gatcacctgg atgaccaaca acccccccgt 2640 
catcctgggc ctgaacaaga tcgtgcgcat 2 700 
gcagggcccc aaggagccct tccgcgacta 27 60 
cgagcaggcc acccaggacg tgaagaactg 2 820 
caaccccgac tgcaagacca tcctgcgcgc 2880 
gatgaccgcc tgccagggcg tgggcggccc 2940 
catgagcaag gtgaacaaca ccaacatcat 3000 
gcgcaccatc aagtgcttca actgcggcaa 3 060 
cccccgcaag aagggctgct ggaagtgcgg 3120 
cgagcgacag gctaattttt tagggaagat 3180 
ttttcttcag agcagaccag agccaacagc 3240 
agagacaaca actccctctc agaagcagga 3300 
ttccctcaga tcactctttg gcagcgaccc 3360 
aaggaggccc ttctagacac cggcgccgac 3420 
ggccgctgga agcccaagat gatcggcggc 3480 
gaccagatcc tgatcgagat ctgcggccac 354 0 
acccccgtga acatcatcgg ccgcaacctg 3600 
cccatcagcc ccatcgagac cgtgcccgtg 3660 
gtgaagcagt ggcccctgac cgaggagaag 3720 
atggagaagg agggcaagat cagcaagatc 3780 
ttcgccatca agaagaagga cagcaccaag 3840 
aacaagcgca cccaggactt ctgggaggtg 3900 
aagcagaaga agagcgtgac cgtgctggac 3960 
gacaaggact tccgcaagta caccgccttc 4020 
ggcatccgct accagtacaa cgtgctgccc 4 080 
cagtgcagca tgaccaagat cctggagccc 414 0 
taccagtaca tggaccacct gtacgtgggc 4200 
aagatcgagg agctgcgcca gcacctgctg 4260 
caccagaagg agcccccctt cctgtggatg 4320 
gtgcagccca tcgtgctgcc cgagaaggac 4380 
gtgggcaagc tgaactgggc cagccagatc 4440 
aagctgctgc gcggcaccaa ggccctgacc 4 500 
ctggagctgg ccgagaaccg cgagatcctg 4560 
cccagcaagg acctgatcgc cgagatccag 4620 
atctaccagg agcccttcaa gaacctgaag 4680 
cacaccaacg acgtgaagca gctgaccgag 474 0 
gtgatctggg gcaagacccc caagttcaag 4 800 
tggtggaccg agtactggca ggccacctgg 4 860 
cccctggtga agctgtggta ccagctggag 4920 
tacgtggacg gcgccgccaa ccgcgagacc 4980 
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aagcbgggca aggccggcba cgbgaccgac 
gacaccacca accagaagac cgagctgcag 
cbggaggbga acatcgtgac cgacagccag 
gacaagagcg agagcgagct ggtgagccag 
gtgtacctgg ccbgggbgcc cgcccacaag 
cbggbgagcg ccggcatccg caaggbgctg 
gagcacgaga agtaccacag caactggcgc 
gtggtggcca aggagatcgt ggccagctgc 
cacggccagg tggactgcag ccccggcatc 
aaggtgatcc tggtggccgt gcacgtggcc 
gccgagaccg gccaggagac cgcctacttc 
aagaccgtgc acaccgacaa cggcagcaac 
tggtgggccg gcatcaagca ggagttcggc 
atcgagagca tgaacaagga gctgaagaag 
caccbgaaga ccgccgtgca gatggccgtg 
atcggcggct acagcgccgg cgagcgcatc 
aaggagctgc agaagcagat caccaagabc 
cgcgaccccg tgtggaaggg ccccgccaag 
atccaggaca acagcgacab caaggtggtg 
tacggcaagc agatggccgg cgacgactgc 
bbcbbgcbgb gccttctagt tgccagccat 
tgaccctgga aggtgccact cccactgtcc 
attgtctgag taggtgbcab tcbabtctgg 
aggattggga agacaatagc aggcatgctg 
tgctgaagaa tbgacccggt bccbccbggg 
gtgacacacc ctgtccacgc ccctggttct 
agctcaggag ggcbccgccb tcaatcccac 
ccctcatcag cccaccaaac caaaccbagc 
taggctatta agtgcagagg gagagaaaat 
aabcabagaa btbcbbccgc bbccbcgctc 
cggcgagcgg tatcagctca ctcaaaggcg 
aacgcaggaa agaacatgtg agcaaaaggc 
gcgttgctgg cgtttbbcca taggcbccgc 
bcaagbcaga ggtggcgaaa cccgacagga 
agctccctcg tgcgctctcc tgttccgacc 
ctcccttcgg gaagcgbggc gctttctcaa 
taggtcgttc gctccaagct gggctgtgfcg 
gccttatccg gtaactatcg tcttgagtcc 
gcagcagcca cbggbaacag gattagcaga 
bbgaagtggb ggcctaacta cggctacact 
ctgaagccag tbaccbbcgg aaaaagagtt 
gctggtagcg gbggbbbbbb tgtttgcaag 
caagaagatc ctttgatctt ttctacgggg 
baagggabbb bggbcabgag abbatcaaaa 
aaabgaagbb tbaaabcaab cbaaagbaba 
tgcbbaabca gbgaggcacc tatcbcagcg 
bgacbccggg gggggggggc gcbgaggbcb 
ccaggccbga abcgccccab catccagcca 
bbbgbbgbag gbggaccagb tggbgabtbb 
gtbgbcggga agabgcgbga bcbgabcctb 
aagccgccgb cccgbcaagt cagcgtaatg 
bctgattaga aaaacbcabc gagcabcaaa 
bcaabaccab abbbbbgaaa aagccgbttc 
btccabagga tggcaagabc cbggbabcgg 
caaccbabba abbbccccbc gbcaaaaata 



cgcggccgcc agaaggbggb gcccctgacc 5040 
gccabccacc bggccctgca ggacagcggc 5100 
bacgccctgg gcabcabcca ggcccagccc 5160 
abcabcgagc agcbgabcaa gaaggagaag 5220 
ggcatcggcg gcaacgagca ggtggacggc 52 80 
tbccbggacg gcatcgacaa ggcccaggag 5340 
gccabggcca gcgactbcaa cctgcccccc 5400 
gacaagbgcc agcbgaaggg cgaggccatg 5460 
bggcagcbgg cabgcaccca ccbggagggc 5520 
agcggcbaca bcgaggccga ggtgabcccc 5580 
ctgctgaagc bggccggccg cbggcccgbg 5640 
bbcaccagca ccaccgbgaa ggccgccbgc 5700 
abccccbaca acccccagag ccagggcgbg 5760 
abcabcggcc aggbgcgcga ccaggccgag 5820 
bbcabccaca acbbcaagcg caagggcggc 5880 
gbggacabca bcgccaccga cabccagacc 5940 
cagaacbbcc gcgbgtacba ccgcgacagc 6000 
cbgcbgbgga agggcgaggg cgccgtggbg 6060 
ccccgccgca aggccaagab cabccgcgac 6120 
gbggccagcc gccaggacga ggacbaggaa 6180 
cbgbbgbbbg ccccbccccc gtgccbbccb 6240 
bbbccbaata aaatgaggaa abbgcatcgc 6300 
ggggtggggt ggggcagcac agcaaggggg 6360 
gggatgcggb gggcbcbabg ggbacccagg 6420 
ccagaaagaa gcaggcacab ccccbbcbct 6480 
tagbbccagc cccacbcaba ggacacbcab 6540 
ccgcbaaagb acbbggagcg gbcbcbccct 6600 
cbccaagagb gggaagaaat baaagcaaga 6660 
gccbccaaca bgtgaggaag baatgagaga 672 0 
actgacbcgc bgcgcbcggb cgbbcggcbg 6780 
gbaabacggb babccacaga abcaggggat 6840 
cagcaaaagg ccaggaaccg baaaaaggcc 6900 
cccccbgacg agcabcacaa aaatcgacgc 6960 
cbabaaagat accaggcgbb bcccccbgga 7020 
cbgccgcbba ccggabaccb gbccgccbbt 7080 
bgcbcacgct gtaggbabcb cagbtcggbg 7140 
cacgaacccc ccgbbcagcc cgaccgcbgc 7200 
aacccggbaa gacacgactb abcgccacbg 7260 
gcgaggbabg baggcggbgc tacagagbbc 7320 
agaaggacag babbbggbab cbgcgcbcbg 7380 
ggtagcbcbb gabccggcaa acaaaccacc 7440 
cagcagabba cgcgcagaaa aaaaggabcb 7500 
bcbgacgcbc agbggaacga aaacbcacgb 7560 
aggabctbca ccbagabccb bbbaaabbaa 7620 
tabgagbaaa cbbggbcbga cagbbaccaa 7680 
atctgbctab tbcgbbcabc cabagbbgcc 7740 
gccbcgbgaa gaaggbgbbg cbgacbcata 7800 
gaaagbgagg gagccacggb bgabgagagc 7860 
gaacbbbbgc bbbgccacgg aacggbcbgc 7920 
caacbcagca aaagbbcgab bbabbcaaca 7980 
cbcbgccagb gbbacaacca abbaaccaat 8040 
bgaaacbgca abbtabbcab abcaggabba 8100 
bgtaabgaag gagaaaacbc accgaggcag 8160 
bcbgcgabbc cgacbcgbcc aacabcaaba 8220 
aggbtabcaa gbgagaaabc accabgagbg 8280 
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acgactgaat ccggbgagaa bggcaaaagc bbabgcabbb ctttccagac ttgttcaaca 8340 
ggccagccat tacgctcgtc atcaaaatca ctcgcatcaa ccaaaccgbb attcattcgt 8400 
gattgcgcct gagcgagacg aaatacgcga tcgctgttaa aaggacaatt acaaacagga 8460 
atcgaatgca accggcgcag gaacactgcc agcgcatcaa caababbbbc acctgaatca 8520 
ggatabbcbt cbaabaccbg gaatgcbgtt ttcccgggga tcgcagtggt gagtaaccat 8580 
gcabcabcag gagtacggat aaaatgcttg atggtcggaa gaggcataaa ttccgtcagc 8640 
cagtttagtc tgaccatctc atctgtaaca tcattggcaa cgctaccttt gccatgtttc 8700 
agaaacaact ctggcgcatc gggcttccca bacaabcgab agabbgbcgc acctgatbgc 8760 
ccgacabbat cgcgagccca tttataccca tataaatcag catccatgtt ggaatttaab 8820 
cgcggccbcg agcaagacgb tbcccgtbga ababggcbca baacaccccb bgtabbacbg 8880 
bttatgbaag cagacagbbb batbgttcab gabgababab bbttabcttg bgcaabgtaa 8940 
cabcagagab bbbgagacac aacgbggcbb tccccccccc cccatbatbg aagcabttab 9000 
cagggbbabb gbcbcabgag cggabacaba tbbgaabgba bbbagaaaaa Caaacaaaba 9060 
ggggtbccgc gcacattbcc ccgaaaagbg ccaccbgacg bctaagaaac cabbabbabc 9120 
abgacabbaa ccbataaaaa taggcgbabc acgaggcccb bbcgbc 9166 



<210> 169 
<211> 9169 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 169 

tcgcgcgbbb cggbgabgac ggbgaaaacc bcbgacacat gcagctcccg gagacggbca 60 
cagcbbgbcb gbaagcggab gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 12 0 
bbggcgggbg bcggggctgg cbbaacbabg cggcabcaga gcagabbgta ctgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaabaccgc abcagabbgg 240 
ctabbggcca bbgcabacgb bgbabccaba bcabaababg bacabbbaba bbggcbcabg 300 
bccaacabta ccgccatgbb gacabbgabb abbgacbagt babbaabagb aabcaabbac 360 
ggggbcabba gbbcatagcc cabababgga gbbccgcgbt acabaactba cggtaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaabaabga cgbabgtbcc 480 
cabagbaacg ccaabaggga cbbbccabbg acgbcaabgg gtggagbabb bacggbaaac 54 0 
bgcccacbbg gcagtacabc aagbgbabca babgccaagb acgcccccba bbgacgbcaa 600 
tgacggbaaa bggcccgccb ggcabbabgc ccagbacabg accbbabggg acbbbccbac 660 
bbggcagbac abcbacgbab tagtcabcgc babbaccabg gbgabgcggb bbbggcagba 720 
cabcaabggg cgbggabagc ggbttgacbc acggggabbb ccaagbcbcc accccabbga 780 
cgbcaabggg agbbbgbbbb ggcaccaaaa bcaacgggac bbbccaaaab gbcgbaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agctcgbbta gbgaaccgbc agabcgccbg gagacgccab ccacgcbgbb bbgaccbcca 960 
tagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcatbg gaacgcggat 1020 
bccccgbgcc aagagbgacg taagbaccgc cbabagacbc bataggcaca ccccbbtggc 1080 
bcbbabgcab gcbabacbgb bbbbggcbbg gggccbatac acccccgctb cctbabgcba 1140 
baggbgabgg babagcbbag cctabaggtg bgggbbattg accatbatbg accacbcccc 12 00 
babbggbgac gabacbbbcc abbacbaabc cataacatgg ctcbbbgcca caactabcbc 1260 
babbggcbab abgccaatac bctgbccbbc agagacbgac acggacbcbg batbbbbaca 1320 
ggabggggbc ccatbbabba bbtacaaabb cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbbbbb abbaaacaba gcgbgggatc bccacgcgaa bctcgggbac gbgbtccgga 1440 
cabgggcbcb bcbccggbag cggcggagcb bccacabccg agcccbggbc ccatgccbcc 1500 
agcggcbcab ggbcgcbcgg cagcbccbtg cbccbaacag bggaggccag acbtaggcac 1560 
agcacaatgc ccaccaccac cagbgbgccg cacaaggccg bggcggtagg gbabgbgbcb 1620 
gaaaabgagc gbggagatbg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
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gcagaagaag atgcaggcag ctgagttgtt 
gttgcggtgc tgttaacggt ggagggcagb 
cgcgccacca gacataatag ctgacagact 
tgcagtcacc gtcgtcgaca tgggcgcccg 
cgcctgggag aagatccgcc tgcgccccgg 
gabcbgggcc agccgcgagc tggaccgctt 
cgacggctgc cagcagatca tggagaagct 
gabcaagagc ctgbbcaaca ccgtggccac 
gaaggacacc aaggaggccc tggacaagat 
gacccagcag gccgccgccg acaccggcaa 
cgtgcagaac gcccagggcc agatgatcca 
ctgggtgaag gtgatcgagg agaaggccbt 
cctgagcgag ggcgccaccc cccaggacct 
ccaggccgcc atgcagatgc tgaaggacac 
cctgcacccc gtgcacgccg gccccatccc 
cgacatcgcc ggcaccacca gcacccccca 
ccccatcccc gtgggcgaca tctacaagcg 
gcgcatgtac agccccgbga gcatcctgga 
cgactacgtg gaccgcbbcb tcaagacctt 
gaactggatg accgacaccc tgctggtgca 
gcgcgccctg ggccccggcg ccaccctgga 
cggccccggc cacaaggccc gcgtgctggc 
catcatgatg cagcgcggca acttcaaggg 
caaggagggc caccbggccc gcaactgccg 
cggcaaggag ggccaccaga tgaaggactg 
gatctggcct tcccacaagg gaaggccagg 
agccccacca gaagagagcb tcaggtttgg 
ggagccgata gacaaggaac bgtatcctbt 
cccctcgtca caataaagat agggggccag 
gacgacaccg tgctggagga gatgaaccbg 
ggcatcggcg gcttcatcaa ggtgggccag 
cacaaggcca tcggcaccgt gctggtgggc 
ctgctgaccc agatcggctg caccctgaac 
gtgaagctga agcccggcab ggacggcccc 
aagatcaagg ccctggtgga gabcbgcacc 
atcggccccg agaaccccta caacaccccc 
aagtggcgca agctggtgga cbbccgcgag 
gtgcagctgg gcatccccca ccccgccggc 
gacgbgggcg acgccbacbb cagcgbgccc 
ttcaccatcc ccagcatcaa caacgagacc 
ccccagggct ggaagggcag ccccgccabc 
cccttccgca agcagaaccc cgacatcgtg 
ggcagcgacc tggagatcgg ccagcaccgc 
ctgcgctggg gcttcaccac ccccgacaag 
atgggctacg agctgcaccc cgacaagtgg 
gacagctgga ccgtgaacga catccagaag 
abctacgccg gcatcaaggt gcgccagctg 
accgaggtgg tgcccctgac cgaggaggcc 
ctgaaggagc ccgtgcacgg cgtgtactac 
cagaagcagg gccagggcca gbggacctac 
aagaccggca agtacgcccg catgaagggc 
gaggccgtgc agaagatcgc caccgagagc 
aagcbgccca tccagaagga gacctgggag 
tggatccccg agbgggagbb cgtgaacacc 
gagaaggagc ccatcatcgg cgccgagacc 



gtattctgat aagagtcaga ggtaactccc 1740 
gtagtctgag cagtactcgt bgcbgccgcg 1800 
aacagactgt bccbtbccat gggtcttttc I860 
cgccagcgtg ctgagcggcg gcaagttcga 1920 
cggcaagaag aagtaccgca tgaagcacct 1980 
cgccctgaac cccagcctgc tggagaccgc 2040 
gcagcccgcc ctgaagaccg gcaccgagga 2100 
cctgbacbgc gtgcaccagc gcahcgacgt 2160 
cgaggagatc aagaacaaga gcaagcagaa 2220 
cagcagcaac gtgagccaga actaccccat 2280 
ccagagcctg agcccccgca ccctgaacgc 2340 
cagccccgag gtgafccccca tgttcagcgc 2400 
gaacatgatg cbgaacabcg bgggcggcca 2460 
catcaacgag gaggccgccg agtgggaccg 2520 
ccccggccag atgcgcgagc cccgcggcag 2580 
ggagcagatc ggctggatga ccggcaaccc 2640 
ctggatcatc ctgggcctga acaagatcgt 2700 
catcaagcag ggccccaagg agcccttccg 2760 
ccgcgccgag caggccaccc aggacgtgaa 2 820 
gaacgccaac cccgactgca aggccabccb 2880 
ggagatgatg accgcctgcc agggcgbggg 2940 
cgaggccatg agccaggtgc agcacaccaa 3000 
ccagaagcgc atcaagtgct tcaactgcgg 3 060 
cgccctgcgc aagaagggct gctggaagtg 3120 
caccgagcga caggctaatt ttttagggaa 3180 
gaattttctt cagagcagac cagagccaac 3240 
ggaagagaca acaactccct ctcagaagca 33 00 
agctbcccbc agatcactct ttggcagcga 3360 
ctgaaggagg cccttctaga caccggcgcc 3420 
cccggccgct ggaagcccaa gatgatcggc 3480 
tacgaccaga tcctgatcga gatctgcggc 3540 
cccacccccg tgaacatcat cggccgcaac 3600 
btccccabca gccccatcga gaccgtgccc 3660 
aaggtgaagc agtggcccct gaccgaggag 3 720 
gagabggaga aggagggcaa gatcagcaag 37 80 
gtgttcgcca tcaagaagaa ggacagcacc 3 840 
ctgaacaagc gcacccagga cttctgggag 3900 
ctgaagcaga agaagagcgt gaccgtgctg 3960 
ctggacaagg acttccgcaa gtacaccgcc 4020 
cccggcatcc gctaccagta caacgtgctg 4080 
ttccagtgca gcatgaccaa gatcctggag 4140 
abcbaccagt acatggacca ccbgbacgtg 4200 
accaagatcg aggagctgcg ccagcacctg 4260 
aagcaccaga aggagccccc cttcctgtgg 4320 
accgbgcagc ccatcgtgct gcccgagaag 4380 
ctggtgggca agcbgaacbg ggccagccag 444 0 
bgcaagcbgc bgcgcggcac caaggccctg 4500 
gagcbggagc bggccgagaa ccgcgagatc 4560 
gaccccagca aggaccbgat cgccgagatc 4 620 
cagabcbacc aggagcccbt caagaacctg 4680 
gcccacacca acgacgbgaa gcagctgacc 4740 
atcgbgabcb ggggcaagac ccccaagbtc 4800 
gcctggbgga ccgagtacbg gcaggccacc 4860 
cccccccbgg bgaagcbgtg gbaccagctg 4920 
ttcbacgbgg acggcgccgc caaccgcgag 4980 
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accaagcbgg gcaaggccgg ctacgtgacc gaccgcggcc gccagaaggt ggtgcccctg 5040 
accgacacca ccaaccagaa gaccgagctg caggccatcc accbggcccb gcaggacagc 5100 
ggcctggagg tgaacatcgt gaccgacagc cagtacgccc tgggcatcat ccaggcccag 5160 
cccgacaaga gcgagagcga gctggtgagc cagatcatcg agcagctgab caagaaggag 522 0 
aaggtgtacc tggcctgggt gcccgcccac aagggcatcg gcggcaacga gcaggtggac 5280 
ggcctggtga gcgccggcat ccgcaaggtg cbgbbccbgg acggcatcga caaggcccag 5340 
gaggagcacg agaagtacca cagcaactgg cgcgccatgg ccagcgacth caaccbgccc 5400 
cccgtggtgg ccaaggagat cgtggccagc tgcgacaagt gccagctgaa gggcgaggcc 54 60 
atgcacggcc aggtggactg cagccccggc atctggcagc tggcatgcac ccacctggag 5520 
ggcaaggtga bcctggbggc cgtgcacgtg gccagcggct acatcgaggc cgaggtgatc 55 80 
cccgccgaga ccggccagga gaccgccbac ttccbgcbga agcbggccgg ccgctggccc 564 0 
gtgaagaccg tgcacaccga caacggcagc aacttcacca gcaccaccgt gaaggccgcc 5700 
tgctggtggg ccggcatcaa gcaggagbbc ggcatcccct acaaccccca gagccagggc 5760 
gtgatcgaga gcatgaacaa ggagctgaag aagatcatcg gccaggtgcg cgaccaggcc 5820 
gagcacctga agaccgccgt gcagatggcc gtgttcatcc acaacbbcaa gcgcaagggc 5880 
ggcatcggcg gctacagcgc cggcgagcgc atcgtggaca tcatcgccac cgacatccag 594 0 
accaaggagc tgcagaagca gatcaccaag atccagaact tccgcgtgta cbaccgcgac 6000 
agccgcgacc ccgtgtggaa gggccccgcc aagctgctgt ggaagggcga gggcgccgtg 6060 
gtgatccagg acaacagcga catcaaggtg gbgccccgcc gcaaggccaa gatcatccgc 6120 
gactacggca agcagatggc cggcgacgac tgcgtggcca gccgccagga cgaggactag 6180 
gaattcttgc tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6240 
ccttgaccct ggaaggtgcc actcccactg tcctttccba ataaaatgag gaaattgcat 6300 
cgcabbgbct gagbaggtgt cabbcbabtc tggggggtgg ggtggggcag cacagcaagg 6360 
gggaggabbg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atgggtaccc 642 0 
aggtgctgaa gaattgaccc ggbbccbccb gggccagaaa gaagcaggca catccccttc 6480 
tctgtgacac accctgtcca cgcccctggt tcttagttcc agccccactc ataggacact 6540 
cabagcbcag gagggctccg ccbbcaabcc cacccgctaa agtacttgga gcggtcbcbc 6600 
cctccctcat cagcccacca aaccaaacct agcctccaag agbgggaaga aattaaagca 6660 
agataggcta btaagtgcag agggagagaa aabgccbcca acatgtgagg aagtaatgag 6720 
agaaatcata gaabbtcbbc cgcttcctcg . cbcacbgacb cgctgcgctc ggtcgttcgg 6780 
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggbbabccac agaatcaggg 6B40 
gataacgcag gaaagaacab gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 6900 
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 6960 
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtbtccccct 7020 
ggaagctccc bcgtgcgctc bccbgttccg acccbgccgc btaccggata ccbgtccgcc 7080 
tbbctcccbb cgggaagcgt ggcgcbbbcb caatgcbcac gcbgbaggta tcbcagtbcg 7140 
gbgbaggbcg tbcgctccaa gcbgggcbgt gbgcacgaac cccccgbbca gcccgaccgc 7200 
bgcgccbbab ccggbaacba bcgbctbgag bccaacccgg taagacacga ctbatcgcca 7260 
cbggcagcag ccacbggbaa caggattagc agagcgaggb abgtaggcgg bgctacagag 7320 
bbcbbgaagt ggtggccbaa ctacggcbac acbagaagga cagbatttgg tabcbgcgct 7380 
cbgctgaagc cagttacctb cggaaaaaga gbtggbagct cbbgatccgg caaacaaacc 7440 
accgctggta gcggbggbtb bbtbgtttgc aagcagcaga bbacgcgcag aaaaaaagga 7500 
bcbcaagaag abccbbbgab cbbbbctacg gggbcbgacg cbcagtggaa cgaaaactca 7560 
cgbtaaggga tbtbggtcat gagabbabca aaaaggatcb tcacctagat ccttbbaaab 7620 
baaaaatgaa gttbtaaatc aabcbaaagt atatabgagb aaacttggtc tgacagbtac 7680 
caatgctbaa tcagbgaggc accfcabctca gcgabcbgtc tattbcgbtc atccatagbb 774 0 
gcctgacfccc gggggggggg ggcgcbgagg bctgccbcgb gaagaaggbg btgcbgacbc 7800 
abaccaggcc bgaabcgccc catcabccag ccagaaagbg agggagccac ggbtgabgag 7860 
agcbbtgbbg baggbggacc agtbggtgat bttgaacbbb bgctbbgcca cggaacggbc 7920 
bgcgbbgbcg ggaagabgcg tgabcbgabc cttcaacbca gcaaaagtbc gabtbabbca 7980 
acaaagccgc cgbcccgbca agtcagcgba abgctcbgcc agbgtbacaa ccaabtaacc 8040 
aabtcbgabb agaaaaactc atcgagcabc aaabgaaacb gcaattbabt cababcagga 8100 
bbatcaabac cababttttg aaaaagccgb ttcbgbaabg aaggagaaaa cbcaccgagg 8160 
cagtbccata ggatggcaag abcctggtab cggtcbgcga fctccgactcg bccaacabca 8220 
abacaaccta bbaatbtccc cbcgbcaaaa ataaggbbat caagbgagaa abcaccabga 8280 
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gbgacgacbg aatccggtga gaatggcaaa agcttatgca tttctttcca gactbgbbca 8340 

acaggccagc cattacgctc gbcatcaaaa tcactcgcat caaccaaacc gttattcatt 8400 

cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt fcaaaaggaca attacaaaca 8460 

ggaatcgaat gcaaccggcg caggaacact gccagcgcab caacaatabt btcaccbgaa 8520 

tcaggatatt ctbctaabac ctggaatgct gbbbbcccgg ggabcgcagb ggtgagtaac 8580 

cabgcabcab caggagtacg gabaaaabgc ttgatggtcg gaagaggcat aaattccgtc 8640 

agccagttta gbcbgaccab ctcatctgta acatcabbgg caacgctacc tttgccatgt 8700 

ttcagaaaca actcbggcgc atcgggcttc ccatacaatc gabagabbgt cgcacctgat 8760 

tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat gttggaatht 8820 

aatcgcggcc tcgagcaaga cgbbbcccgb bgaatatggc bcataacacc ccttgbatta 8880 

cbgtbbabgb aagcagacag btbbatbgbb catgatgata babtbbtatc btgtgcaatg 8940 

taacahcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta tbgaagcatb 9000 

tabcagggbb abtgbcbcab gagcggatac atatttgaat gtatttagaa aaataaacaa 9060 

ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga aaccattatt 9120 

abcatgacat baaccbataa aaabaggcgb abcacgaggc ccbtbcgbc 9169 



<210> 170 
<211> 9792 
<212> DNA 

<213> Arbificial Sequence 

r 
* 

<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 170 

tcgcgcgttb cggbgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 

cagctbgbct gtaagcggat gccgggagca gacaagcccg bcagggcgcg tcagcgggbg 120 

bbggcgggbg bcggggcbgg cbbaactatg cggcabcaga gcagabtgba ctgagagbgc 180 

accababgcg gbgtgaaaba ccgcacagat gcgtaaggag aaaataccgc atcagabbgg 240 

ctabbggcca bbgcabacgb bgtabccaba tcabaatabg bacattbaba bbggcbcabg 300 

bccaacabba ccgccatgbb gacabbgatb abbgacbagb babbaatagt aabcaabtac 360 

ggggbcatba gbbcabagcc catababgga gtbccgcgbt acataacbba cggbaaatgg 420 

cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbatgttcc 480 

cabagbaacg ccaabaggga cbbbccabtg acgtcaabgg gbggagbabb bacggbaaac 540 

bgcccactbg gcagbacabc aagbgtatca babgccaagb acgcccccta ttgacgbcaa 600 

tgacggbaaa bggcccgccb ggcabbatgc ccagbacabg acctbabggg actbbccbac 660 

tbggcagbac abcbacgtat bagbcabcgc babbaccabg gbgabgcggb btbggcagba 720 

catcaatggg cgtggatagc ggbbbgactc acggggabbb ccaagtcbcc accccabbga 780 

cgbcaatggg agbbtgbbtt ggcaccaaaa bcaacgggac tbbccaaaat gtcgbaacaa 840 

cbccgcccca bbgacgcaaa bgggcggtag gcgbgbacgg tgggaggbcb atataagcag 900 

agcbcgbbba gbgaaccgbc agabcgccbg gagacgccab ccacgctgbb btgacctcca 960 

bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcatbg gaacgcggab 1020 

bccccgtgcc aagagbgacg baagbaccgc ctabagacbc babaggcaca cccctbbggc 1080 

bcbtatgcab gctabacbgb btbtggcbtg gggcctabac acccccgcbb ccbbabgcba 1140 

baggbgabgg batagcbtag cctabaggtg bgggbbabbg accabtabbg accactcccc 1200 

babbggtgac gatacbbtcc atbacbaabc cataacabgg cbcbbbgcca caactabctc 1260 

babbggcbab abgccaabac bcbgbcctbc agagacbgac acggacbcbg babbbbbaca 1320 

ggatggggbc ccatbbabba bfctacaaabb cacababaca acaacgccgb cccccgbgcc 1380 

cgcagbbbtb abbaaacaba gcgbgggabc bccacgcgaa bcbcgggbac gbgbtccgga 1440 

catgggcbcb tcbccggbag cggcggagcb bccacabccg agcccbggtc ccabgccbcc 1500 

agcggcbcab ggbcgcbcgg cagcbcctbg cbcctaacag bggaggccag acttaggcac 1560 

agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggtagg gbabgbgbcb 1620 

gaaaabgagc gbggagabbg ggcbcgcacg gctgacgcag abggaagacb baaggcagcg 1680 
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gcagaagaag atgcaggcag ctgagttgtt gbabbcbgab aagagbcaga ggbaactccc 1740 
gtbgcggbgc bgbbaacggb ggagggcagt gtagtctgag cagtactcgt bgcbgccgcg 1800 
cgcgccacca gacataatag cbgacagacb aacagacbgb tcctttccat gggbcbtbbc 1860 
tgcagtcacc gtcgtcgaca tgggcgcccg cgccagcgtg ctgagcggcg gcgagcbgga 1920 
ccgctgggag aagatccgcc tgcgccccgg cggcaagaag aagtacaagc tgaagcacat 1980 
cgtgtgggcc agccgcgagc tggagcgcbt cgccgtgaac cccggcctgc bggagaccag 2040 
cgagggctgc cgccagatcc tgggccagcb gcagcccagc ctgcagaccg gcagcgagga 2100 
gctgcgcagc ctgtacaaca ccgtggccac cctgtactgc gbgcaccagc gcatcgagat 2160 
caaggacacc aaggaggccc tggacaagat cgaggaggag cagaacaaga gcaagaagaa 222 0 
ggcccagcag gccgccgccg acaccggcca cagcaaccag gtgagccaga actaccccat 22 80 
cgtgcagaac atccagggcc agatggtgca ccaggccatc agcccccgca ccctgaacgc 2340 
ctgggtgaag gtggfcggagg agaaggccbt cagccccgag gtgatcccca tgttcagcgc 2400 
cctgagcgag ggcgccaccc cccaggacct gaacaccatg ctgaacaccg tgggcggcca 24 60 
ccaggccgcc atgcagatgc tgaaggagac catcaacgag gaggccgccg agtgggaccg 2520 
cgtgcacccc gtgcacgccg gccccatcgc ccccggccag atgcgcgagc cccgcggcag 2580 
cgacatcgcc ggcaccacca gcaccctgca ggagcagatc ggctggatga ccaacaaccc 2640 
ccccatcccc gtgggcgaga tctacaagcg cbggabcabc ctgggcctga acaagatcgt 2700 
gcgcatgtac agccccacca gcatcctgga cabccgccag ggccccaagg agcccttccg 27 60 
cgacbacgbg gaccgcttct acaagaccct gcgcgccgag caggccagcc aggaggtgaa 2820 
gaactggatg accgagaccc tgctggtgca gaacgccaac cccgactgca agaccatcct 2880 
gaaggccctg ggccccgccg ccaccctgga ggagatgatg accgcctgcc agggcgtggg 2940 
cggccccggc cacaaggccc gcgbgcbggc cgaggccatg agccaggtga ccaacagcgc 3 0 00 
caccabcabg atgcagcgcg gcaacttccg caaccagcgc aagabcgbga agtgcbbcaa 3060 
ctgcggcaag gagggccaca ccgcccgcaa ctgccgcgcc ccccgcaaga agggctgctg 3120 
gaagtgcggc aaggagggcc accagatgaa ggactgcacc gagcgacagg ctaataggga 3180 
agatctggcc tbcccacaag ggaaggccag ggaattttct tcagagcaga ccagagccaa 3240 
cagccccacc agaagagagc ttcaggtttg gggaagagac aacaactccc bcbcagaagc 3300 
aggagccgat agacaaggaa ctgtatcctt bagcbbcccb cagabcacbc tttggcagcg 3360 
acccctcgtc acaataaaga hagggggcca gctgaaggag gcccttctag acaccggcgc 3420 
cgacgacacc gbgcbggagg agatgaacct gcccggccgc tggaagccca agabgatcgg 34 8 0 
cggcatcggc ggcttcatca aggtgggcca gfcacgaccag abccbgabcg agatctgcgg 3540 
ccacaaggcc atcggcaccg tgctggtggg ccccaccccc gtgaacatca tcggccgcaa 3600 
cctgctgacc cagatcggct gcaccctgaa cttccccatc agccccatcg agaccgtgcc 3660 
cgtgaagcbg aagcccggca tggacggccc caaggtgaag cagtggcccc tgaccgagga 372 0 
gaagatcaag gcccbggbgg agatctgcac cgagabggag aaggagggca agatcagcaa 3780 
gatcggcccc gagaacccct acaacacccc cgtgttcgcc atcaagaaga aggacagcac 384 0 
caagtggcgc aagctggtgg acttccgcga gctgaacaag cgcacccagg acttctggga 3900 
ggfcgcagctg ggcatccccc accccgccgg cctgaagcag aagaagagcg tgaccgtgct 3960 
ggacgtgggc gacgcctact tcagcgtgcc cctggacaag gacttccgca agtacaccgc 4020 
cttcaccatc cccagcatca acaacgagac ccccggcatc cgctaccagt acaacgtgcb 4080 
gccccagggc fcggaagggca gccccgccat cttccagtgc agcatgacca agatcctgga 414 0 
gcccttccgc aagcagaacc ccgacatcgt gatctaccag tacatggacc acctgtacgt 4200 
gggcagcgac ctggagatcg gccagcaccg caccaagatc gaggagcbgc gccagcacct 4260 
gctgcgctgg ggcbbcacca cccccgacaa gaagcaccag aaggagcccc ccttcctgtg 432 0 
gatgggctac gagctgcacc ccgacaagtg gaccgtgcag cccatcgbgc bgcccgagaa 4380 
ggacagcbgg accgbgaacg acabccagaa gcbggbgggc aagcbgaacb gggccagcca 444 0 
gabcbacgcc ggcabcaagg bgcgccagcb gtgcaagctg cbgcgcggca ccaaggcccb 450 0 
gaccgaggtg gbgccccbga ccgaggaggc cgagcbggag cbggccgaga accgcgagab 4560 
ccbgaaggag cccgbgcacg gcgbgbacba cgaccccagc aaggaccbga bcgccgagab 462 0 
ccagaagcag ggccagggcc agtggaccba ccagabcbac caggagcccb bcaagaaccb 4680 
gaagaccggc aagbacgccc gcabgaaggg cgcccacacc aacgacgbga agcagctgac 474 0 
cgaggccgbg cagaagabcg ccaccgagag cabcgbgatc bggggcaaga cccccaagtb 4800 
caagctgccc abccagaagg agaccbggga ggccbggbgg accgagbacb ggcaggccac 4860 
cbggabcccc gagbgggagb bcgbgaacac cccccccctg gbgaagcbgb ggbaccagcb 4920 
ggagaaggag cccabcabcg gcgccgagac cbbcbacgbg gacggcgccg ccaaccgcga 4980 
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gaccaagctg ggcaaggccg gcbacgbgac 
gaccgacacc accaaccaga agaccgagcb 
cggcctggag gtgaacatcg tgaccgacag 
gcccgacaag agcgagagcg agctggtgag 
gaaggbgtac ctggccbggg tgcccgccca 
cggccbggbg agcgccggca tccgcaaggt 
ggaggagcac gagaagtacc acagcaactg 
ccccgtggtg gccaaggaga tcgbggccag 
catgcacggc caggtggact gcagccccgg 
gggcaaggtg atcctggtgg ccgtgcacgt 
ccccgccgag accggccagg agaccgccta 
cgtgaagacc gtgcacaccg acaacggcag 
ctgctggtgg gccggcatca agcaggagbb 
cgtgatcgag agcatgaaca aggagctgaa 
cgagcacctg aagaccgccg tgcagatggc 
cggcatcggc ggctacagcg ccggcgagcg 
gaccaaggag ctgcagaagc agatcaccaa 
cagccgcgac cccgtgtgga agggccccgc 
ggtgatccag gacaacagcg acatcaaggt 
cgactacggc aagcagatgg ccggcgacga 
gggcggcaag tggagcaaga gcagcgbgab 
ccgcgccgag cccgccgccg acggcgbggg 
cgccatcacc agcagcaaca ccgccgccaa 
ggaggaggag gaggtgggct tccccgtgac 
caaggccgcc gtggacctga gccactbccb 
ccacagccag cgccgccagg acatcctgga 
ccccgactgg cagaactaca cccccggccc 
gbgcbacaag ctggtgcccg tggagcccga 
caccagccbg ctgcaccccg tgagcctgca 
ggagtggcgc ttcgacagcc gccbggccbt 
gtacttcaag aactgctgag gatcaattct 
tgtttgcccc tcccccgtgc cttccttgac 
ctaataaaat gaggaaattg cabcgcabbg 
bggggtgggg cagcacagca agggggagga 
tgcggtgggc tctatgggta cccaggbgcb 
aaagaagcag gcacatcccc btcbctgtga 
bccagcccca ctcataggac actcatagct 
baaagbacbb ggagcggtct ctccctccct 
aagagtggga agaaattaaa gcaagatagg 
ccaacatgtg aggaagtaat gagagaaatc 
actcgctgcg ctcggtcgtt cggctgcggc 
tacggttatc cacagaatca ggggataacg 
aaaaggccag gaaccgtaaa aaggccgcgt 
ctgacgagca tcacaaaaat cgacgcbcaa 
aaagabacca ggcgbttccc ccbggaagct 
cgctbaccgg abaccbgtcc gcctttcbcc 
cacgcbgbag gtabctcagt bcggtgbagg 
aaccccccgb tcagcccgac cgcbgcgcct 
cggbaagaca cgacbbatcg ccacbggcag 
ggbabgbagg cggbgcbaca gagbbcbbga 
ggacagbatb tggbabctgc gcbcbgcbga 
gcbcbbgabc cggcaaacaa accaccgcbg 
agattacgcg cagaaaaaaa ggabcbcaag 
acgctcagbg gaacgaaaac tcacgtbaag 
tctbcaccba ^atccbttba aabbaaaaab 



cgaccgcggc cgccagaagg tggtgcccct 504 0 
gcaggccatc cacctggccc tgcaggacag 5100 
ccagbacgcc cbgggcabca tccaggccca 5160 
ccagabcatc gagcagcbga bcaagaagga 5220 
caagggcabc ggcggcaacg agcaggbgga 5280 
gctgbbccbg gacggcabcg acaaggccca 5340 
gcgcgccatg gccagcgacb bcaacctgcc 5400 
cfcgcgacaag bgccagctga agggcgaggc 5460 
catctggcag cbggcabgca cccacctgga 5520 
ggccagcggc bacatcgagg ccgaggbgab 5580 
ctbccbgctg aagctggccg gccgcbggcc 5640 
caacttcacc agcaccaccg tgaaggccgc 5700 
cggcabcccc tacaaccccc agagccaggg 5760 
gaagabcatc ggccaggtgc gcgaccaggc 5820 
cgbgbbcatc cacaacttca agcgcaaggg 5880 
catcgbggac atcabcgcca ccgacatcca 5940 
gabccagaac btccgcgtgb acbaccgcga 6000 
caagctgcbg tggaagggcg agggcgccgb 6060 
ggtgccccgc cgcaaggcca agatcatccg 6120 
cbgcgtggcc agccgccagg acgaggacat 6180 
cggctggccc gccgtgcgcg agcgcatgcg 6240 
cgccgtgagc cgcgaccbgg agaagcacgg 63 00 
caacgccgcc bgcgccbggc tggaggccca 6360 
cccccaggtg ccccbgcgcc ccatgaccba 6420 
gaaggagaag ggcggccbgg agggcctgat 64 80 
ccbgbggatc baccacaccc agggcbacbb 6540 
cggcgbgcgc baccccctga ccbtcggcbg 6600 
caaggbggag gaggccaaca agggcgagaa 6660 
cggcabggac gaccccgagc gcgaggbgcb 672 0 
ccaccacgfcg gcccgcgagc bgcaccccga 6780 
bgctgbgcct tctagttgcc agccatctgt 6840 
cctggaaggt gccactccca ctgbccttbc 6900 
bcbgagtagg tgbcatbcta ttcbgggggg 6960 
tbgggaagac aatagcaggc atgctgggga 7020 
gaagaatbga cccggttcct ccbgggccag 7080 
cacacccbgt ccacgccccb ggbtcttagb 7140 
caggagggct ccgcctbcaa tcccacccgc 7200 
cabcagccca ccaaaccaaa ccbagcctcc 7260 
cbabbaagbg cagagggaga gaaaabgccb 7320 
abagaatbbc ttccgcbbcc tcgcbcactg 7380 
gagcggtabc agcbcacbca aaggcggbaa 7440 
caggaaagaa catgtgagca aaaggccagc 7500 
tgcbggcgtt ttbccabagg ctccgccccc 7560 
gbcagaggtg gcgaaacccg acaggacbab 7620 
cccbcgtgcg cbcbccbgtb ccgaccctgc 7680 
cbbcgggaag cgbggcgctb bctcaabgct 7740 
tcgbbcgcbc caagcbgggc bgbgbgcacg 7800 
babccggtaa ctatcgbctb gagbccaacc 7860 
cagccacbgg baacaggatt agcagagcga 792 0 
agbggbggcc baacbacggc bacacbagaa 79B0 
agccagtbac cbbcggaaaa agagbbggba 8040 
gbagcggbgg ttbtbbbgbb bgcaagcagc 8100 
aagabcctbt gatcbfcbtcb acggggtcbg 8160 
ggattttggb cabgagatba fccaaaaagga 8220 
gaagbbtbaa abcaabcbaa agbabatabg 82B0 
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agtaaacbbg gtcbgacagb taccaatgct 
gtcbatttcg ttcatccata gttgcctgac 
cgtgaagaag gfcgttgctga cbcabaccag 
gtgagggagc cacggttgat gagagctttg 
tbtbgcbbtg ccacggaacg gtctgcgttg 
tcagcaaaag ttcgabtbat tcaacaaagc 
gccagtgtta caaccaatta accaattctg 
actgcaattt attcatatca ggattatcaa 
atgaaggaga aaacbcaccg aggcagttcc 
cgattccgac tcgtccaaca bcaatacaac 
tatcaagtga gaaatcacca tgagtgacga 
gcatbtctbb ccagacttgt tcaacaggcc 
catcaaccaa accgttattc attcgtgatt 
tgttaaaagg acaattacaa acaggaatcg 
catcaacaat attttcacct gaatcaggat 
cggggabcgc agtgghgagt aaccabgcab 
tcggaagagg cabaaattcc gbcagccagb 
tggcaacgct acctttgcca tgtttcagaa 
atcgatagat bgbcgcacct gattgcccga 
aatcagcatc catgttggaa tttaatcgcg 
ggctcataac accccbbgta ttactgttta 
atatatbtbt abcttgbgca atgtaacatc 
ccccccccca tbattgaagc atttatcagg 
aatgtattta gaaaaataaa caaatagggg 
ctgacgtcta agaaaccatt attatcatga 
ggcccbtbcg tc 
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taatcagtga ggcacctatc tcagcgatct 8340 
tccggggggg gggggcgctg aggtcbgccb 8400 
gcctgaatcg ccccatcafcc cagccagaaa 8460 
ttgtaggtgg accagttggt gattttgaac 8520 
tcgggaagat gcgbgabcbg atccbbcaac 8580 
cgccgtcccg bcaagtcagc gtaatgctcfc 8640 
attagaaaaa ctcatcgagc atcaaatgaa 8700 
taccatattt ttgaaaaagc cgbtbcbgta 8760 
ataggatggc aagatcctgg batcggbctg 8820 
ctattaattt cccctcgbca aaaataaggt 8880 
ctgaatccgg tgagaatggc aaaagcttat 8940 
agccattacg ctcgtcatca aaatcacbcg 9000 
gcgccbgagc gagacgaaat acgcgatcgc 9060 
aabgcaaccg gcgcaggaac acbgccagcg 9120 
atbcttcbaa bacctggaat gcbgttbtcc 9180 
cabcaggagt acggabaaaa bgcbtgabgg 9240 
tbagtcbgac cabcbcabcb gtaacabcat 9300 
acaacbctgg cgcabcgggc bbcccabaca 9360 
catbatcgcg agcccabbta tacccabata 9420 
gcctcgagca agacgtttcc cgttgaabat 9480 
bgbaagcaga cagtbttabt gttcabgabg 9540 
agagatbbbg agacacaacg tggcbbbccc 9600 
gbbabtgbcb cabgagcgga tacatabbbg 9660 
bbccgcgcac abbbccccga aaagbgccac 9720 
cabbaaccba baaaaabagg cgbabcacga 9780 

9792 



<210> 171 
<211> 9167 
<212> DNA 

<213> Arbificial Sequence 

* 

<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 171 

tcgcgcgttb cggtgatgac ggtgaaaacc bctgacacat gcagctcccg gagacggtca 60 
cagcbttjtcb gtaagcggat gccgggagca gacaagcccg bcagggcgcg tcagcgggbg 120 
btggcgggbg tcggggcbgg cbtaactabg cggcabcaga gcagabbgba ctgagagbgc 180 
accabatgcg gtgbgaaaba ccgcacagab gcgtaaggag aaaabaccgc atcagatbgg 240 
cbabbggcca tbgcatacgb tgbabccaba tcataabatg bacabttata ttggcbcabg 300 
tccaacatta qcgccatgbt gacabbgatb abtgactagt tattaatagb aatcaabbac 360 
ggggbcabba gbtcatagcc cabababgga gbbccgcgbt acataactta cggbaaatgg 420 
cccgccbggc bgaccgccca acgacccccg cccatbgacg bcaataabga cgbatgbbcc 480 
cabagbaacg ccaabaggga ctbtccattg acgtcaabgg gtggagbatb bacggbaaac 540 
bgcccacbtg gcagbacabc aagbgtatca babgccaagb acgcccccta btgacgtcaa 600 
bgacggbaaa bggcccgccb ggcabbatgc ccagtacabg acctbatggg actbbcctac 660 
btggcagbac atctacgtat bagbcabcgc tabbaccatg gtgatgcggb btbggcagta 720 
cabcaabggg cgtggabagc ggbbbgacbc acggggabbb ccaagbctcc accccabbga 780 
cgbcaatggg agtbtgbtbb ggcaccaaaa bcaacgggac bbtccaaaab gbcgtaacaa 840 
cbccgcccca bbgacgcaaa tgggcggbag gcgtgbacgg .bgggaggtct atabaagcag 900 
agcbcgbbta gbgaaccgtc agatcgccbg gagacgccab ccacgcbgtb ttgaccbcca 960 
tagaagacac cgggaccgat ccagccbccg cggccgggaa cggtgcabbg gaacgcggab .1020 
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tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tctfcabgcab gcbabacbgb bbbbggcbtg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tabagcbbag ccfcabaggtg tgggbtabtg accattattg accactcccc 1200 
batbggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactabcbc 1260 
tattggctat atgccaatac tcbgbccbtc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccabbbabba ttbacaaatb cacatataca acaacgccgt cccccgtgcc 13B0 
cgcagttttt attaaacata gcgbgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggcbcb tctccggtag cggcggagct tccacabccg agccctggbc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaabgagc gbggagabbg ggcbcgcacg gcbgacgcag abggaagact baaggcagcg 1680 
gcagaagaag abgcaggcag ctgagtbgtb gtabbctgat aagagbcaga ggtaacbccc 1740 
gbtgcggbgc bgbtaacggt ggagggcagt gbagbctgag cagbacbcgt bgcbgccgcg 1800 
cgcgccacca gacabaatag ctgacagact aacagacbgt tccbtbccab gggbctbtbc 1860 
bgcagtcacc gbcgbcgaca bggcccgcgc cagcgtgcbg agcggcggcg agcbggaccg 1920 
cbgggagaag abccgccbgc gccccggcgg caagaagaag tacaagctga agcacabcgt 1980 
gbgggccagc cgcgagcbgg agcgcbtcgc cgtgaacccc ggcctgctgg agaccagcga 2040 
gggctgccgc cagabcctgg gccagcbgca gcccagccbg cagaccggca gcgaggagcb 2100 
gcgcagcctg tacaacaccg tggccaccct gbactgcgtg caccagcgca tcgagatcaa 2160 
ggacaccaag gaggccctgg acaagatcga ggaggagcag aacaagagca agaagaaggc 2220 
ccagcaggcc gccgccgaca ccggccacag caaccaggtg agccagaact accccatcgb 2280 
gcagaacatc cagggccaga tggtgcacca ggccatcagc ccccgcaccc bgaacgccbg 234 0 
ggtgaaggtg gbggaggaga aggccbbcag ccccgaggbg abccccabgt tcagcgcccb 2400 
gagcgagggc gccacccccc aggaccbgaa caeca tgcbg aacaccgtgg gcggccacca 24 60 
ggccgccafcg cagatgebga aggagaccab caacgaggag gccgccgagb gggaccgcgb 2520 
gcaccccgtg cacgccggcc ccatcgcccc eggecagabg cgcgagcccc geggcagega 2580 
catcgccggc accaccagca ccctgcagga geagategge tggatgacca acaacccccc 2640 
catccccgbg ggcgagabcb acaagegctg gatcabecbg ggectgaaca agategbgeg 2700 
catgtacagc cccaccagca bectggacat ccgccagggc cccaaggagc ccbbccgcga 2760 
cbacgtggac cgcbtcbaca agacccbgcg cgccgagcag gccagccagg aggbgaagaa 2820 
ctggatgacc gagaccctgc fcggbgcagaa cgccaacccc gaetgeaaga ccatccbgaa 2 880 
ggccctgggc cccgccgcca ccctggagga gabgabgacc gcctgccagg gcgtgggcgg 294 0 
ccccggccac aaggcccgcg bgctggcega ggecatgage caggtgacca acagcgccac 3000 
catcatgatg cagcgcggca acbbccgcaa ccagcgcaag abegbgaagb getbeaacbg 3060 
eggcaaggag ggccacaccg cccgcaactg ccgcgccccc cgcaagaagg gctgctggaa 3120 
gtgcggcaag gagggccacc agatgaagga ctgcaccgag cgacaggcta abagggaaga 3180 
bctggccbtc ccacaaggga aggecaggga abtbbcbbca gagcagacca gagecaacag 3240 
ccccaccaga agagagctbc aggbbbgggg aagagacaac aactccctct cagaagcagg 3300 
agecgataga caaggaacbg tatccttbag cbtcccbcag abeacbebtb ggcagcgacc 3360 
cctcgtcaca abaaagabag ggggccagcb gaaggaggee ebbebagaca ccggcgccga 3420 
cgacaccgbg cbggaggaga bgaacctgcc cggccgctgg aageccaaga bgabeggegg 3480 
catcggcggc bbcabcaagg bgggccagba cgaccagabc cbgatcgaga tctgcggcca 354 0 
caaggccatc ggcaccgtgc bggbgggccc cacccccgbg aacatcatcg gccgcaacct 3600 
gcbgacccag ateggebgea cccbgaacbb ccccatcagc cccatcgaga ccgbgcccgt 3660 
gaagctgaag cccggcabgg acggccccaa ggbgaagcag tggcccctga ccgaggagaa 3720 
gabcaaggee cbggbggaga bcbgcaccga gabggagaag gagggcaaga bcagcaagat 3780 
cggccccgag aaccccbaca acacccccgb gbbcgccabc aagaagaagg acagcaccaa 3840 
gtggcgcaag cbggbggacb bccgcgagcfc gaacaagege acccaggact tcbgggaggb 3900 
gcagctgggc abcccccacc ccgccggccb gaagcagaag aagagcgbga ccgtgcbgga 3 960 
egbgggegae gccbacbbca gcgbgccccb ggacaaggac ttccgcaagt acaccgcctb 4 020 
caccatcccc agcabcaaca acgagacccc cggcatccgc baccagtaca acgbgcbgcc 4 080 
ccagggctgg aagggcagee ccgccabcbb ccagbgcagc atgaccaaga bccbggagcc 4140 
cbbccgcaag cagaaccccg acabegbgab cbaccagbac atggaccacc bgtacgbggg 4200 
cagcgacctg gagabeggee agcaccgcac caagabegag gagcbgcgcc agcaccbgcb 4260 
gcgcbggggc tbcaccaccc ccgacaagaa gcaccagaag gagccccccb bccbgbggab 4320 
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gggctacgag cbgcaccccg acaagtggac 
cagctggacc gtgaacgaca tccagaagct 
ctacgccggc atcaaggtgc gccagctgtg 
c 9 a ggtggtg cccctgaccg aggaggccga 
gaaggagccc gtgcacggcg bgtacbacga 
gaagcagggc cagggccagb ggacctacca 
gaccggcaag tacgcccgca bgaagggcgc 
ggccgtgcag aagatcgcca ccgagagcat 
gctgcccatc cagaaggaga cctgggaggc 
gabccccgag tgggagttcg tgaacacccc 
gaaggagccc atcatcggcg ccgagacctt 
caagctgggc aaggccggct acgbgaccga 
cgacaccacc aaccagaaga ccgagctgca 
ccbggaggbg aacabcgbga ccgacagcca 
cgacaagagc gagagcgagc tggtgagcca 
ggtgtacctg gcctgggtgc ccgcccacaa 
cctggbgagc gccggcatcc gcaaggtgct 
ggagcacgag aagtaccaca gcaactggcg 
cghggtggcc aaggagatcg tggccagctg 
gcacggccag gtggactgca gccccggcat 
caaggtgatc ctggtggccg tgcacgtggc 
cgccgagacc ggccaggaga ccgcctactt 
gaagaccgbg cacaccgaca acggcagcaa 
ctggtgggcc ggcatcaagc aggagtbcgg 
gatcgagagc atgaacaagg agctgaagaa 
gcacctgaag accgccgtgc agatggccgt 
catcggcggc bacagcgccg gcgagcgcat 
caaggagctg cagaagcaga tcaccaagat 
ccgcgacccc gbgtggaagg gccccgccaa 
gabccaggac aacagcgaca tcaaggtggt 
ctacggcaag cagabggccg gcgacgactg 
atbcbbgcbg tgccttctag bbgccagcca 
ttgacccbgg aaggtgccac tcccactgtc 
catbgtcbga gtaggtgtca ttctattctg 
gaggattggg aagacaatag caggcabgct 
gtgcbgaaga abbgacccgg bbccbccbgg 
bgbgacacac ccbgbccacg cccctggbbc 
tagcbcagga gggcbccgcc ttcaatccca 
bccctcatca gcccaccaaa ccaaaccbag 
ataggctatb aagbgcagag ggagagaaaa 
aaabcabaga abttctbccg cbbcctcgct 
gcggcgagcg gbabcagctc actcaaaggc 
taacgcagga aagaacabgb gagcaaaagg 
cgcgbtgcbg gcgbbttbcc abaggcbccg 
cbcaagbcag aggtggcgaa acccgacagg 
aagctcccbc gbgcgcbcbc cbgttccgac 
bcbcccbbcg ggaagcgbgg cgcbtbcbca 
gbaggbcgbb cgcbccaagc tgggcbgbgt 
cgccbbabcc ggtaacbabc gbcbbgagbc 
ggcagcagcc actggtaaca ggabbagcag 
cbtgaagbgg bggccbaact acggctacac 
gctgaagcca gbtaccbtcg gaaaaagagt 
cgcbggbagc ggbggbbtbt tbgbbbgcaa 
bcaagaagab ccbbbgatcb tbbcbacggg 
bbaagggabb ttggtcatga gabbabcaaa 



cgtgcagccc abcgbgcbgc ccgagaagga 43 80 
ggtgggcaag cbgaacbggg ccagccagab 4440 
caagcbgctg cgcggcacca aggcccbgac 4500 
gcbggagcbg gccgagaacc gcgagabccb 4550 
ccccagcaag gaccbgabcg ccgagabcca 4620 
gatcbaccag gagccctbca agaaccbgaa 4680 
ccacaccaac gacgbgaagc agcbgaccga 4740 
cgbgabcbgg ggcaagaccc ccaagbbcaa 4800 
cbggbggacc gagbacbggc aggccaccbg 4860 
cccccbggbg aagctgbggb accagcbgga 4920 
cbacgbggac ggcgccgcca accgcgagac 4980 
ccgcggccgc cagaaggbgg bgccccbgac 5040 
ggccatccac cbggcccbgc. aggacagcgg 5100 
gtacgcccbg ggcatcatcc aggcccagcc 5160 
gatcabcgag cagcbgafcca agaaggagaa 522 0 
gggcabcggc ggcaacgagc aggbggacgg 52 80 
gtbccbggac ggcatcgaca aggcccagga 5340 
cgccabggcc agcgacbbca accbgccccc 5400 
cgacaagtgc cagcbgaagg gcgaggccat 5460 
cbggcagcbg gcabgcaccc acctggaggg 5520 
cagcggcbac abcgaggccg aggtgabccc 5580 
ccbgcbgaag cbggccggcc gcbggcccgb 5640 
cbbcaccagc accaccgbga aggccgccbg 5700 
cabccccbac aacccccaga gccagggcgb 5760 
gabcabcggc caggtgcgcg accaggccga 5820 
gbbcabccac aactbcaagc gcaagggcgg 5880 
cgtggacabc atcgccaccg acabccagac 5940 
ccagaacbbc cgcgbgbacb accgcgacag 6000 
gctgcbgfcgg aagggcgagg gcgccgtggt 6060 
gccccgccgc aaggccaaga tcatccgcga 6120 
cgtggccagc cgccaggacg aggacbagga 6180 
bcbgttgtbb gccccbcccc cgtgccbbcc 6240 
cbtbccbaab aaaabgagga aattgcatcg 6300 
gggggtgggg bggggcagca cagcaagggg 6360 
ggggatgcgg bgggcbcbat gggbacccag 6420 
gccagaaaga agcaggcaca bccccbbcbc 64 80 
tbagbbccag ccccacbcab aggacactca 6540 
cccgctaaag bactbggagc ggtcbcbccc 6600 
ccbccaagag bgggaagaaa bbaaagcaag 6660 
bgccbccaac abgbgaggaa gbaatgagag 6720 
cactgacbcg cbgcgctcgg bcgtbcggcb 6780 
ggbaabacgg bbabccacag aabcagggga 6840 
ccagcaaaag gccaggaacc gbaaaaaggc 6900 
ccccccbgac gagcabcaca aaaatcgacg 6960 
acbabaaaga taccaggcgb tbcccccbgg 7020 
ccbgccgcbb accggabacc tgbccgccbt 7080 
abgcbcacgc tgtaggbabc tcagbbcggb 7140 
gcacgaaccc cccgtbcagc ccgaccgcbg 7200 
caacccggba agacacgact babcgccacb 7260 
agcgaggbat gbaggcggtg cbacagagbb 7320 
bagaaggaca gbabbbggba bcbgcgcbct 7380 
bggbagcbct bgabccggca aacaaaccac 7440 
gcagcagabt acgcgcagaa aaaaaggabc 7500 
gtcbgacgct cagbggaacg aaaacbcacg 7560 
aaggatcbbc acctagabcc bbbbaaabta 7620 
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aaaatgaagb tttaaatcaa tctaaagtat 
atgcttaatc agtgaggcac cbabcbcagc 
cbgacbccgg gggggggggg cgcbgaggtc 
accaggcctg aatcgcccca tcatccagcc 
ctttgttgta ggtggaccag ttggtgattt 
cgbbgtcggg aagatgcgtg atcbgabcct 
aaagccgccg tcccgtcaag bcagcgbaab 
ttctgattag aaaaactcat cgagcatcaa 
atcaatacca babbbbbgaa aaagccgttt 
gttccatagg atggcaagat cctggtatcg 
acaacctatt aabttccccb cgtcaaaaat 
gacgactgaa tccggtgaga atggcaaaag 
aggccagcca bbacgcbcgb catcaaaatc 
bgabbgcgcc tgagcgagac gaaatacgcg 
aatcgaatgc aaccggcgca ggaacactgc 
aggatattct tctaatacct ggaatgctgt 
tgcatcatca ggagtacgga taaaatgctt 
ccagtttagt ctgaccatct cabcbgbaac 
cagaaacaac tcbggcgcab cgggcttccc 
cccgacatta tcgcgagccc atttataccc 
tcgcggcctc gagcaagacg tfctcccgttg 
gbbbabgbaa gcagacagbb bbattgbtca 
acatcagaga btbtgagaca caacgtggct 
tcagggttat tgtctcatga gcggatacat 
aggggttccg cgcacatttc cccgaaaagt 
catgacatta acctataaaa ataggcgtat 



atatgagtaa acttggtctg acagttacca 7680 
gatctgtcta tttcgttcat ccatagttgc 7740 
bgccbcgbga agaaggtgtt gcbgacbcab 7800 
agaaagtgag ggagccacgg ttgatgagag 7860 
bgaacbbbbg cbbbgccacg gaacggtctg 7920 
tcaactcagc aaaagttcga bbtatbcaac 7980 
gctctgccag tgttacaacc aattaaccaa 8040 
atgaaactgc aahttattca tatcaggatt 8100 
ctgtaatgaa ggagaaaact caccgaggca 8160 
gtctgcgatt ccgactcgtc caacatcaat 8220 
aaggttatca agtgagaaat caccatgagt 8280 
cbbabgcabb tctttccaga cbbgbbcaac 8340 
actcgcatca accaaaccgb batbcabbcg 8400 
abcgcbgbba aaaggacaat tacaaacagg 8460 
cagcgcatca acaabatttb cacctgaatc 8520 
tttcccgggg atcgcagtgg tgagtaacca 8580 
gatggtcgga agaggcataa attccgtcag 8640 
atcattggca acgctacctt tgccatgttt 8700 
atacaatcga tagattgtcg caccbgabbg 8760 
atataaatca gcatccatgt tggaatbbaa 8820 
aababggcbc abaacacccc ttgbabbacb 8880 
bgabgataba tbtbtatctb gbgcaatgta 8940 
bbcccccccc ccccabbabb gaagcattta 9000 
abbbgaabgb abbbagaaaa ataaacaaab 9060 
gccacctgac gbctaagaaa ccabbabbab 9120 
cacgaggccc bbtcgbc 9167 



<210> 172 
<211> 9783 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 172 

bcgcgcgbbb cggtgabgac ggbgaaaacc bcbgacacab gcagcbcccg gagacggbca 60 
cagcbtgtcb gtaagcggab. gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
bbggcgggtg bcggggctgg cbbaacbatg cggcabcaga gcagabbgta ctgagagbgc 180 
accababgcg gtgbgaaaba ccgcacagat gcgbaaggag aaaabaccgc abcagattgg 24 0 
ctabbggcca bbgcatacgb tgbabccata tcabaababg bacabtbaba bbggcbcabg 300 
bccaacabba ccgccatgbb gacatbgabt abbgacbagb babbaabagb aabcaabbac 360 
ggggbcabba gbtcabagcc catababgga gbbccgcgbb acabaacbba cggbaaabgg 420 
cccgccbggc bgaccgccca acgacccccg cccabbgacg bcaabaabga cgbabgbbcc 480 
catagbaacg ccaabaggga cbbbccabbg acgbcaabgg gbggagbabb tacggbaaac 540 
tgcccacbbg gcagbacatc aagbgbabca babgccaagb acgcccccba tbgacgbcaa 600 
bgacggbaaa bggcccgccb ggcabbatgc ccagbacabg accbtabggg acbbbccbac 660 
bbggcagbac abcbacgbat bagbcabcgc batbaccabg gbgabgcggb bttggcagba 720 
cabcaatggg cgbggabagc ggbbtgacbc acggggatbb ccaagbcbcc accccabbga 780 
cgbcaatggg agbbbgbbbb ggcaccaaaa bcaacgggac btbccaaaab gbcgtaacaa 840 
cbccgcccca bbgacgcaaa bgggcggbag gcgbgbacgg bgggaggbcb ababaagcag 900 
agcbcgbbba gbgaaccgbc agabcgcctg gagacgccab ccacgcbgbb bbgacctcca 960 
bagaagacac cgggaccgab ccagccbccg cggccgggaa cggbgcabtg gaacgcggab 1020 
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tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttatgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattatfcg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctttgcca caactatcfcc 1260 
tattggctafc atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttt atfcaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 1440 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acfctaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccafc gggtcttthc 1860 
tgcagtcacc gtcgtcgaca tggcccgcgc cagcgtgctg agcggcggcg agctggaccg 1920 
ctgggagaag atccgcctgc gccccggcgg caagaagaag tacaagctga agcacatcgt 1980 
gtgggccagc cgcgagctgg agcgcttcgc cgtgaacccc ggcctgctgg agaccagcga 2 04 0 
gggctgccgc cagatcctgg gccagctgca gcccagcctg cagaccggca gcgaggagct 2100 
gcgcagcctg tacaacaccg tggccaccct gtactgcgtg caccagcgca tcgagatcaa 2160 
ggacaccaag gaggccctgg acaagatcga ggaggagcag aacaagagca agaagaaggc 2220 
ccagcaggcc gccgccgaca ccggccacag caaccaggtg agccagaact accccatcgt 2280 
gcagaacatc cagggccaga tggtgcacca ggccafccagc ccccgcaccc tgaacgcctg 2340 
ggtgaaggtg gtggaggaga aggccttcag ccccgaggtg atccccatgt tcagcgccct 24 00 
gagcgagggc gccacccccc aggacctgaa caccatgctg aacaccgtgg gcggccacca 2460 
ggccgccatg cagatgctga aggagaccat caacgaggag gccgccgagt gggaccgcgt 2520 
gcaccccgtg cacgccggcc ccatcgcccc cggccagatg cgcgagcccc gcggcagcga 2580 
catcgccggc accaccagca ccctgcagga gcagatcggc tggatgacca acaacccccc 2640 
catccccgtg ggcgagatct acaagcgctg gatcafccctg ggcctgaaca agafccgtgcg 2700 
catgtacagc cccaccagca tcctggacat ccgccagggc cccaaggagc ccttccgcga 2760 
ctacgtggac cgcttctaca agaccctgcg cgccgagcag gccagccagg aggtgaagaa 2 820 
ctggatgacc gagaccctgc tggtgcagaa cgccaacccc gactgcaaga ccatcctgaa 2880 
ggccctgggc cccgccgcca ccctggagga gatgatgacc gcctgccagg gcgtgggcgg 2940 
ccccggccac aaggcccgcg tgctggccga ggccatgagc caggtgacca acagcgccac 3000 
catcatgatg cagcgcggca acttccgcaa ccagcgcaag atcgtgaagt gcttcaactg 3060 
cggcaaggag ggccacaccg cccgcaactg ccgcgccccc cgcaagaagg gctgctggaa 3120 
gtgcggcaag gagggccacc agatgaagga ctgcaccgag cgacaggcta atagggaaga 3180 
tctggccttc ccacaaggga aggccaggga attttcttca gagcagacca gagccaacag 324 0 
ccccaccaga agagagcttc aggtttgggg aagagacaac aactccctct cagaagcagg 3300 
agccgataga caaggaactg tatcctttag ctfcccctcag atcactcttt ggcagcgacc 3360 
cctcgtcaca ataaagatag ggggccagct gaaggaggcc cttctagaca ccggcgccga 3420 
cgacaccgtg ctggaggaga tgaacctgcc cggccgctgg aagcccaaga tgatcggcgg 3480 
catcggcggc ttcatcaagg tgggccagta cgaccagatc ctgatcgaga tctgcggcca 3540 
caaggccatc ggcaccgtgc tggtgggccc cacccccgtg aacatcafccg gccgcaacct 3600 
gctgacccag atcggctgca ccctgaactt ccccatcagc cccatcgaga ccgtgcccgt 3660 
gaagctgaag cccggcatgg acggccccaa ggtgaagcag tggcccctga ccgaggagaa 3720 
gatcaaggcc ctggtggaga bctgcaccga gatggagaag gagggcaaga tcagcaagat 3780 
cggccccgag aacccctaca acacccccgt gttcgccatc aagaagaagg acagcaccaa 3840 
gtggcgcaag ctggtggact tccgcgagct gaacaagcgc acccaggact tctgggaggt 3900 
gcagctgggc atcccccacc ccgccggcct gaagcagaag aagagcgtga ccgtgctgga 3960 
cgtgggcgac gcctacttca gcgfcgcccch ggacaaggac ttccgcaagt acaccgcctt 4020 
caccatcccc agcatcaaca acgagacccc cggcatccgc taccagtaca acgtgctgcc 4080 
ccagggctgg aagggcagcc ccgccatctt ccagtgcagc atgaccaaga tcctggagcc 4140 
ctfcccgcaag cagaaccccg acafccgtgat ctaccagtac atggaccacc tgtacgtggg 4200 
cagcgacctg gagatcggcc agcaccgcac caagatcgag gagctgcgcc agcacctgct 4260 
gcgctggggc ttcaccaccc ccgacaagaa gcaccagaag gagcccccct tcctgtggat 4320 
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gggctacgag ctgcaccccg acaagtggac 
cagctggacc gbgaacgaca tccagaagct 
ctacgccggc atcaaggtgc gccagctgtg 
cgaggtggtg cccctgaccg aggaggccga 
gaaggagccc gbgcacggcg tgtactacga 
gaagcagggc cagggccagb ggacctacca 
gaccggcaag tacgcccgca bgaagggcgc 
ggccgtgcag aagabcgcca ccgagagcat 
gcbgcccabc cagaaggaga cctgggaggc 
gatccccgag bgggagttcg tgaacacccc 
gaaggagccc abcabcggcg ccgagacctt 
caagctgggc aaggccggcfc acgtgaccga 
cgacaccacc aaccagaaga ccgagctgca 
ccbggaggbg aacabcgbga ccgacagcca 
cgacaagagc gagagcgagc bggbgagcca 
ggtgtacctg gcctgggtgc ccgcccacaa 
cctggtgagc gccggcatcc gcaaggtgct 
ggagcacgag aagtaccaca gcaactggcg 
cgtggtggcc aaggagatcg tggccagctg 
gcacggccag gtggactgca gccccggcat 
caaggtgatc cbggbggccg tgcacgtggc 
cgccgagacc ggccaggaga ccgccbacbt 
gaagaccgbg cacaccgaca acggcagcaa 
cbggtgggcc ggcabcaagc aggagttcgg 
gatcgagagc abgaacaagg agctgaagaa 
gcacctgaag accgccgbgc agacggccgb 
cafccggcggc tacagcgccg gcgagcgcat 
caaggagctg cagaagcaga tcaccaagat 
ccgcgacccc gtgtggaagg gccccgccaa 
gatccaggac aacagcgaca tcaaggtggt 
ctacggcaag cagatggccg gcgacgacbg 
cggcaagbgg agcaagagca gcgtgatcgg 
cgccgagccc gccgccgacg gcgbgggcgc 
cabcaccagc agcaacaccg ccgccaacaa 

99 a 99 a 99 a 9 gtgggcttcc ccgtgacccc 
ggccgccgbg gaccbgagcc actbcctgaa 
cagccagcgc cgccaggaca bccbggaccb 
cgacbggcag aacbacaccc ccggccccgg 
cbacaagcbg gbgcccgbgg agcccgacaa 
cagccbgctg caccccgbga gccbgcacgg 
gbggcgcbtc gacagccgcc tggccbtcca 
cbtcaagaac tgcbgaggab caabbcbbgc 
tbgcccctcc cccgtgcctb cccbgacccb 
abaaaabgag gaaattgcab cgcabbgbcb 
99bggggcag cacagcaagg gggaggatbg 
ggbgggcbcb abgggtaccc aggbgcbgaa 
gaagcaggca cabccccttc tcbgbgacac 
agccccactc abaggacacb catagcbcag 
agbacbbgga gcggbctcbc ccbcccbcat 
agbgggaaga aatbaaagca agabaggcba 
acabgtgagg aagbaabgag agaaatcata 
cgcbgcgcbc ggbcgbbcgg ctgcggcgag 
ggtbatccac agaabcaggg gabaacgcag 
aggccaggaa ccgbaaaaag gccgcgbbgc 
acgagcabca caaaaabcga cgctcaagbc 



cgbgcagccc abcgbgcbgc ccgagaagga 4380 
ggtgggcaag cbgaacbggg ccagccagab 44 4 0 
caagctgcbg cgcggcacca aggcccbgac 4 500 
gcbggagcbg gccgagaacc gcgagabccb 4560 
ccccagcaag gacctgabcg ccgagabcca 4620 
gabcbaccag gagcccbbca agaaccbgaa 4680 
ccacaccaac gacgbgaagc agcbgaccga 4740 
cgbgabcbgg ggcaagaccc ccaagbbcaa 4800 
cbggbggacc gagbactggc aggccaccbg 4 860 
cccccbggbg aagcbgtggt accagcbgga 4920 
cbacgbggac ggcgccgcca accgcgagac 4980 
ccgcggccgc cagaaggtgg bgcccctgac 5 040 
ggccatccac ctggcccbgc aggacagcgg 5100 
gbacgcccbg ggcabcabcc aggcccagcc 5160 
gabcatcgag cagcbgabca agaaggagaa 5220 
gggcabcggc ggcaacgagc aggbggacgg 5280 
gbbccbggac ggcabcgaca aggcccagga 5340 
cgccabggcc agcgacbbca accbgccccc 5400 
cgacaagbgc cagcbgaagg gcgaggccab 5460 
cbggcagcbg gcabgcaccc accbggaggg 552 0 
cagcggcbac abcgaggccg aggbgatccc 5580 
ccbgctgaag ctggccggcc gcbggcccgb 564 0 
cbbcaccagc accaccgbga aggccgccbg 5700 
catccccbac aacccccaga gccagggcgt 5760 
gabcabcggc. caggtgcgcg accaggccga 582 0 
gbbcabccac aacbbcaagc gcaagggcgg 5880 
cgbggacabc abcgccaccg acatccagac 5940 
ccagaacbtc cgcgtgtacb accgcgacag 6000 
gctgcbgbgg aagggcgagg gcgccgbggb 6060 
gccccgccgc aaggccaaga bcabccgcga 6120 
cgtggccagc cgccaggacg aggacabggg 6180 
cbggcccgcc gbgcgcgagc gcabgcgccg 6240 
cgbgagccgc gacctggaga agcacggcgc 63 00 
cgccgcctgc gccbggcbgg aggcccagga 6360 
ccaggbgccc cbgcgcccca bgaccbacaa 6420 
ggagaagggc ggccbggagg gccbgabcca 6480 
gbggabcbac cacacccagg gcbacbbccc 6540 
cgbgcgcbac cccctgaccb bcggcbggbg 6600 
ggtggaggag gccaacaagg gcgagaacac 6660 
catggacgac cccgagcgcg aggtgcbgga 6720 
ccacgbggcc cgcgagcbgc accccgagba 6780 
bgbgccbbcb agbbgccagc cabcbgbbgb 6840 
ggaaggbgcc acbcccacbg bccbbtccta 6900 
gagbaggbgb cabbcbabbc bggggggbgg 6960 
ggaagacaab agcaggcabg cbggggabgc 7020 
gaabbgaccc ggbtccbcct gggccagaaa 7080 
acccbgbcca cgccccbggt bcbbagbbcc 7140 
gagggcbccg ccbbcaabcc cacccgcbaa 7200 
cagcccacca aaccaaacct agccbccaag 7260 
bbaagbgcag agggagagaa aatgccbcca 7320 
gaabbbcbbc cgcbbcctcg cbcacbgacb 7380 
cggbabcagc bcacbcaaag gcggbaabac 7440 
gaaagaacab gbgagcaaaa ggccagcaaa 7500 
bggcgbbbbb ccabaggcbc cgccccccbg 7560 
a 9 a 99bggcg aaacccgaca ggactabaaa 7620 
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gabaccaggc gtttccccct ggaagctccc 
ttaccggata cctgtccgcc tttctccctt 
gctgtaggta tctcagttcg gtgtaggtcg 
cccccgbtca gcccgaccgc tgcgccttat 
taagacacga cttatcgcca cbggcagcag 
abgbaggcgg bgcbacagag bbcbbgaagb 
cagtatttgg batctgcgcb ctgctgaagc 
cttgatccgg caaacaaacc accgcbggba 
gcagaaaaaa aggatctcaa gaagabccbb 
ggaacgaaaa ctcacgttaa gggabbbtgg 
agatcctbbb aaattaaaaa bgaagtttta 
ggbcbgacag btaccaatgc btaabcagtg 
gttcatccab agtbgcctga ctccgggggg 
ggtgttgcbg acbcabacca ggccbgaabc 
ccacggbtga bgagagcttt gbbgbaggbg 
gccacggaac ggbctgcgbt gbcgggaaga 
gttcgabbta btcaacaaag ccgccgbccc 
acaaccaatt aaccaattct gabbagaaaa 
battcatatc aggattatca abaccatabt 
aaaactcacc gaggcagtbc cabaggatgg 
cbcgbccaac abcaabacaa ccbatbaabt 
agaaatcacc atgagbgacg actgaabccg 
bccagactbg btcaacaggc cagccabtac 
aaccgtbabb catbcgtgat tgcgccbgag 
gacaatbaca aacaggaatc gaabgcaacc 
tatbtbcacc tgaabcagga battcbbcta 
cagtggtgag baaccatgca bcabcaggag 
gcabaaabbc cgtcagccag bbbagbctga 
baccbttgcc atgbtbcaga aacaactctg 
bbgbcgcacc bgabtgcccg acattabcgc 
ccatgtbgga attbaabcgc ggccbcgagc 
caccccbbgb abtacbgttt abgtaagcag 
tatcbtgtgc aatgtaacat cagagattbt 
abtabtgaag catttatcag ggttabbgbc 
agaaaaabaa acaaabaggg gbtccgcgca 
aagaaaccat babtabcabg acabbaacct 
gbc 



bcgbgcgcbc bccbgbtccg acccbgccgc 7680 

cgggaagcgb ggcgcbttcb caabgctcac 7740 

ttcgcbccaa gctgggctgb gbgcacgaac 7800 

ccggbaacta tcgbcbtgag tccaacccgg 7860 

ccacbggbaa caggattagc agagcgaggb 7920 

ggbggccbaa ctacggcbac actagaagga 7980 

cagbbacctb cggaaaaaga gbbggbagcb 8040 

gcggbggtgb ttgcaagcag cagabbacgc 8100 

tgabcbbbtc tacggggbct gacgcbcagb 8160 

bcabgagatb atcaaaaagg abcbtcacct 8220 

aabcaatcta aagbatabat gagbaaacbb 8280 

aggcacctat ctcagcgabc tgtcbabtbc 8340 

ggggggcgct gaggtcbgcc tcgbgaagaa 84.00 

gccccatcab ccagccagaa agtgagggag 84 60 

gaccagbbgg tgattbtgaa cbtttgctbb 8520 

bgcgbgabcb gatccbtcaa cbcagcaaaa 8580 

gbcaagtcag cgbaabgctc tgccagbgbb 8640 

acbcabcgag catcaaabga aacbgcaabb 8700 

bbbgaaaaag ccgbbbcbgt aatgaaggag 8760 

caagabccbg gbabcggbcb gcgabbcc'ga 8820 

bccccbcgbc aaaaabaagg bbabcaagbg B880 

gbgagaabgg caaaagcbta bgcatbbcbt 8940 

gcbcgbcabc aaaabcactc gcatcaacca 9000 

cgagacgaaa bacgcgabcg cbgbtaaaag 9060 

ggcgcaggaa cacbgccagc gcabcaacaa 9120 

abaccbggaa bgcbgbbbbc ccggggatcg 9180 

tacggabaaa abgcbbgabg gbcggaagag 9240 

ccatcbcabc tgtaacabca ttggcaacgc 9300 

gcgcabcggg cbtcccabac aabcgabaga 9360 

gagcccatbb abacccabat aaabcagcab 94 20 

aagacgbbbc ccgbbgaaba bggcbcabaa 94 80 

acagbbbbab bgtbcatgat gabababttb 9540 

gagacacaac gbggcbbbcc cccccccccc 9600 

bcatgagcgg abacababbt gaabgbatbb 9660 

catbbccccg aaaagbgcca ccbgacgbcb 9720 

abaaaaabag gcgbabcacg aggcccbtbc 9780 

9783 



<210> 173 
<211> 9782 
<212> DNA 

<213> Arbificial Sequence 
<220> 

<223> plasmid pVR1012x/s conbaining HIV genes 
<400> 173 

tcgcgcgbbb cggbgabgac ggbgaaaacc tcbgacacab gcagctcccg gagacggbca 60 
cagcbbgbcb gbaagcggat gccgggagca gacaagcccg bcagggcgcg bcagcgggbg 120 
ttggcgggbg bcggggcbgg cbbaacbabg cggcabcaga gcagabbgba cbgagagbgc 180 
accababgcg gbgbgaaaba ccgcacagab gcgbaaggag aaaataccgc abcagabtgg 24 0 
ctabbggcca bbgcabacgb bgbabccaba tcabaababg bacatbtaba bbggcbcabg 300 
tccaacabba ccgccabgtb gacattgabb abbgacbagt babtaabagb aatcaabbac 360 
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ggggbcabba gttcatagcc catafcatgga gttccgcgtt acabaacbba cggtaaatgg 42 0 

cccgcctggc tgaccgccca acgacccccg cccattgacg bcaabaabga cgtatgttcc 480 

catagtaacg ccaataggga cbbbccattg acgbcaabgg gbggagbabb tacggtaaac 540 

tgcccacttg gcagbacabc aagtgtatca batgccaagb acgcccccta ttgacgtcaa 600 

tgacggbaaa tggcccgcct ggcattatgc ccagtacatg acctbabggg acbtbccbac 660 

ttggcagtac abcbacgtat bagtcatcgc tattaccatg gtgatgcggt tbbggcagba 720 

catcaatggg cgtggatagc ggbbbgacbc acggggabtb ccaagtctcc accccabbga 780 

cgbcaatggg agbbbgbbbt ggcaccaaaa tcaacgggac bbbccaaaab gtcgtaacaa 840 

ctccgcccca ttgacgcaaa fcgggcggbag gcgbgbacgg tgggaggtct atataagcag 900 

agcbcgttba gtgaaccgbc agabcgcctg gagacgccat ccacgctgtt ttgacctcca 960 

tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 

tccccgtgcc aagagtgacg taagtaccgc cbatagacbc tataggcaca ccccbbtggc 1080 

tcttatgcat gcbabacbgb bbbbggcttg gggcctabac acccccgctt ccbtatgcba 1140 

taggtgatgg babagctbag cctataggtg bgggbbabbg accattattg accactcccc 1200 

tacbggtgac gatactttcc abtactaabc cataacatgg cbcbbbgcca caactatctc 1260 

tattggctat atgccaatac tcbgtccbtc agagactgac acggactctg tabbbbbaca 1320 

ggabggggtc ccabtbabta tttacaaatt cacatataca acaacgccgh cccccgtgcc 1380 

cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gbgttccgga 1440 

cabgggcbcb bcbccggbag cggcggagct bccacabccg agcccbggbc ccatgcctcc 1500 

agcggcbcat ggbcgcbcgg cagcbccbbg cbcctaacag bggaggccag acbtaggcac 1560 

agcacaabgc ccaccaccac cagtgtgccg cacaaggccg bggcggbagg gbatgtgbcb 1620 

gaaaabgagc gbggagatbg ggcbcgcacg gcbgacgcag atggaagacb baaggcagcg 1680 

gcagaagaag abgcaggcag ctgagbtgfct gbabtcbgab aagagtcaga ggtaacbccc 1740 

gbtgcggbgc bgbbaacggb ggagggcagb gbagbcbgag cagbacbcgb bgcbgccgcg 1800 

cgcgccacca gacataatag cbgacagact aacagacbgb bccbbtccab gggbctbbbc 1860 

bgcaghcacc gbcgtcgaca bggcaaagbg gagcaagagc agcgtgabcg gcbggcccgc 1920 

cgbgcgcgag cgcatgcgcc gcgccgagcc cgccgccgac ggcgtgggcg ccgtgagccg 1980 

cgacctggag aagcacggcg ccabcaccag cagcaacacc gccgccaaca acgccgcctg 2040 

cgccbggcbg gaggcccagg aggaggagga ggbgggcbbc cccgbgaccc cccaggtgcc 2100 

cctgcgcccc abgaccbaca aggccgccgb ggaccbgagc cacbbcctga aggagaaggg 2160 

cggccbggag ggccbgatcc acagccagcg ccgccaggac abccbggacc bgbggabcta 2220 

ccacacccag ggcbacbbcc ccgactggca gaacbacacc cccggccccg gcgbgcgcta 2280 

ccccctgacc btcggcbggb gctacaagcfc ggbgcccgtg gagcccgaca aggbggagga 2340 

ggccaacaag ggcgagaaca ccagcctgct gcaccccgbg agccbgcacg gcatggacga 24 00 

ccccgagcgc gaggbgcbgg agbggcgcbt cgacagccgc cbggccbbcc accacgtggc 2460 

ccgcgagctg caccccgagb acbbcaagaa chgcabggcc cgcgccagcg bgcbgagcgg 2 520 

cggcgagctg gaccgcbggg agaagabccg ccbgcgcccc ggcggcaaga agaagbacaa 25 80 

gcbgaagcac atcgbgbggg ccagccgcga gcbggagcgc bbcgccgtga accccggccb 2640 

gcbggagacc agcgagggcb gccgccagab ccbgggccag cbgcagccca gcctgcagac 2 700 

cggcagcgag gagctgcgca gccbgbacaa caccgbggcc acccbgtacb gcgtgcacca 2760 

gcgcabcgag abcaaggaca ccaaggaggc cctggacaag atcgaggagg agcagaacaa 2820 

gagcaagaag aaggcccagc aggccgccgc cgacaccggc cacagcaacc aggtgagcca 2880 

gaacbacccc atcgbgcaga acabccaggg ccagabggbg caccaggcca .bcagcccccg 294 0 

cacccbgaac gccbgggbga aggbggbgga ggagaaggcc btcagccccg aggbgabccc 3000 

catgtbcagc gcccbgagcg agggcgccac cccccaggac ctgaacacca bgcbgaacac 3 060 

cgbgggcggc caccaggccg ccabgcagat gctgaaggag accabcaacg aggaggccgc 3120 

cgagbgggac cgcgbgcacc ccgbgcacgc cggccccabc gcccccggcc agatgcgcga 3180 

gccccgcggc agcgacabcg ccggcaccac cagcacccbg caggagcaga bcggcbggab 3240 

gaccaacaac ccccccabcc ccgbgggcga gatcbacaag cgcbggabca bccbgggccb 33 00 

gaacaagabc gtgcgcabgt acagccccac cagcafcccbg gacabccgcc agggccccaa 3360 

ggagcccbbc cgcgacbacg bggaccgctb cbacaagacc ctgcgcgccg agcaggccag 3420 

ccaggaggbg aagaacbgga bgaccgagac ccbgcbggbg cagaacgcca accccgacbg 3480 

caagaccabc cbgaaggccc bgggccccgc cgccacccbg gaggagabga tgaccgccbg 3 54 0 

ccagggcgbg ggcggccccg gccacaaggc ccgcgbgcbg gccgaggcca tgagccaggt 3 600 

gaccaacagc gccaccabca tgabgcagcg cggcaacbbc cgcaaccagc gcaagatcgb 3660 
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gaagtgcttc aactgcggca aggagggcca caccgcccgc aacbgccgcg ccccccgcaa 3720 

gaagggctgc bggaagbgcg gcaaggaggg ccaccagabg aaggactgca ccgagcgaca 3780 

ggctaatagg gaagatctgg ccttcccaca agggaaggcc agggaabtbt cttcagagca 3840 

gaccagagcc aacagcccca ccagaagaga gcttcaggtt tggggaagag acaacaactc 3900 

cctctcagaa gcaggagccg atagacaagg aactgtatcc bbtagcbbcc ctcagatcac 3960 

tctttggcag cgacccctcg tcacaataaa gatagggggc cagctgaagg aggcccttct 4020 

agacaccggc gccgacgaca ccgbgcbgga ggagatgaac ctgcccggcc gctggaagcc 4080 

caagatgatc ggcggcabcg gcggcttcat caaggtgggc cagtacgacc agatcctgat 4140 

cgagatctgc ggccacaagg ccatcggcac cgtgctggtg ggccccaccc ccgtgaacat 4200 

catcggccgc aacctgctga cccagatcgg cbgcaccctg aacttcccca tcagccccat 4260 

cgagaccgtg cccgtgaagc tgaagcccgg catggacggc cccaaggtga agcagbggcc 432 0 

ccbgaccgag gagaagabca aggccctggt ggagabctgc accgagabgg agaaggaggg 43 80 

caagabcagc aagatcggcc ccgagaaccc ctacaacacc cccgtgttcg ccatcaagaa 4440 

gaaggacagc accaagtggc gcaagctggt ggacttccgc gagcbgaaca agcgcaccca 4500 

ggacttctgg gaggtgcagc bgggcatccc ccaccccgcc ggccbgaagc agaagaagag 4560 

cgtgaccgbg cbggacgbgg gcgacgccba cttcagcgbg ccccbggaca aggacbbccg 4620 

caagtacacc gccbtcacca tccccagcat caacaacgag acccccggca tccgcbacca 4680 

gbacaacgtg cbgccccagg gctggaaggg cagccccgcc atcbtccagt gcagcabgac 4740 

caagatcctg gagcccbbcc gcaagcagaa ccccgacabc gtgatcbacc agfcacabgga 4800 

ccaccbgtac gbgggcagcg accbggagat cggccagcac cgcaccaaga tcgaggagct 48 60 

gcgccagcac cbgcbgcgct ggggctbcac cacccccgac aagaagcacc agaaggagcc 4920 

cccctbcctg tggatgggcb acgagcbgca ccccgacaag tggaccgbgc agcccatcgb 4 980 

gctgcccgag aaggacagcb ggaccgbgaa cgacatccag aagcbggtgg gcaagctgaa 5040 

ctgggccagc cagabcbacg ccggcatcaa ggbgcgccag cbgtgcaagc tgcbgcgcgg 5100 

caccaaggcc ctgaccgagg bggtgcccct gaccgaggag gccgagcbgg agctggccga 5160 

gaaccgcgag abccbgaagg agcccgtgca cggcgbgbac bacgacccca gcaaggacct 5220 

gatcgccgag abccagaagc agggccaggg ccagbggacc baccagatct accaggagcc 5280 

cttcaagaac cbgaagaccg gcaagbacgc ccgcabgaag ggcgcccaca ccaacgacgb 534 0 

gaagcagcbg accgaggccg bgcagaagat cgccaccgag agcatcgbga bctggggcaa 54 00 

gacccccaag bbcaagctgc ccabccagaa ggagacctgg gaggcctggb ggaccgagta 54 60 

cbggcaggcc accbggatcc ccgagbggga gbtcgbgaac accccccccc tggtgaagcb 5520 

gtggbaccag ctggagaagg agcccatcab cggcgccgag accbbcbacg tggacggcgc 55 80 

cgccaaccgc gagaccaagc tgggcaaggc cggcbacgtg accgaccgcg gccgccagaa 564 0 

ggtggbgccc ctgaccgaca ccaccaacca gaagaccgag cbgcaggcca tccaccbggc 5700 

ccbgcaggac agcggccbgg aggbgaacab cgbgaccgac agccagbacg ccctgggcab 576 0 

catccaggcc cagcccgaca agagcgagag cgagctggtg agccagabca bcgagcagcb 5820 

gabcaagaag gagaaggbgt accbggccbg ggtgcccgcc cacaagggca bcggcggcaa 5880 

cgagcaggbg gacggccbgg bgagcgccgg catccgcaag gbgctgbbcc bggacggcab 5940 

cgacaaggcc caggaggagc acgagaagba ccacagcaac tggcgcgcca tggccagcga 6000 

cttcaacctg ccccccgbgg bggccaagga gabcgbggcc agctgcgaca agtgccagcb 6060 

gaagggcgag gccabgcacg gccaggtgga ctgcagcccc ggcatcbggc agctggcatg 6120 

cacccaccbg gagggcaagg tgabccbggb ggccgbgcac gbggccagcg gcbacatcga 6180 

ggccgaggtg abccccgccg agaccggcca ggagaccgcc bacbtccbgc tgaagctggc 6240 

cggccgcbgg cccgbgaaga ccgbgcacac cgacaacggc agcaacbbca ccagcaccac 63 00 

cgtgaaggcc gcctgctggt gggccggcat caagcaggag bbcggcabcc ccbacaaccc 63 60 

ccagagccag ggcgbgabcg agagcabgaa caaggagctg aagaagabca bcggccaggt 6420 

gcgcgaccag gccgagcacc bgaagaccgc cgbgcagabg gccgtgbbca bccacaactt 64 80 

caagcgcaag ggcggcabcg gcggcbacag cgccggcgag cgcabcgbgg acabcatcgc 6540 

caccgacabc cagaccaagg agcbgcagaa gcagabcacc aagabccaga acbbccgcgb 6600 

gbactaccgc gacagccgcg accccgbgtg gaagggcccc gccaagcbgc bgbggaaggg 6660 

cgagggcgcc gbggbgabcc aggacaacag cgacabcaag gbggtgcccc gccgcaaggc 6720 

caagatcatc cgcgactacg gcaagcagat ggccggcgac gacbgcgbgg ccagccgcca 6780 

ggacgaggac baggaabbcb tgctgtgcct bcbagtbgcc agccabctgb bgbtbgcccc 6840 

bcccccgtgc cbbccbbgac ccbggaaggt gccactccca cbgtccbtbc cbaabaaaab 6900 

gaggaaatbg catcgcabbg tcbgagbagg bgbcabbcba bbcbgggggg tggggbgggg 6960 
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cagcacagca agggggagga bbgggaagac 
bcbabgggta cccaggtgct gaagaattga 
gcacatcccc bbcbcbgbga cacaccctgt 
ctcataggac actcatagct caggagggct 
ggagcggtct cfcccctccct catcagccca 
agaaattaaa gcaagatagg ctattaagtg 
aggaagtaat gagagaaatc atagaatttc 
ctcggtcgtt cggctgcggc gagcggtatc 
cacagaatca ggggataacg caggaaagaa 
gaaccgtaaa aaggccgcgt tgctggcgtt 
tcacaaaaat cgacgctcaa gtcagaggtg 
ggcgtttccc cctggaagct ccctcgtgcg 
atacctgtcc gcctttctcc cttcgggaag 
gtatctcagt tcggtgtagg tcgttcgctc 
tcagcccgac cgctgcgcct tatccggtaa 
cgacttatcg ccactggcag cagccacbgg 
cggtgcbaca gagttcttga agtggtggcc 
tggtatctgc gcbctgcbga agccagttac 
cggcaaacaa accaccgctg gbagcggtgg 
cagaaaaaaa ggabcbcaag aagatccttt 
gaacgaaaac tcacgttaag ggattttggt 
gatcctttta aattaaaaat gaagtbbbaa 
gtctgacagt taccaatgct taatcagtga 
bbcatccaba gttgcctgac bccggggggg 
gtgttgctga ctcataccag gccbgaabcg 
cacggbbgab gagagctttg bbgbaggbgg 
ccacggaacg gtctgcgttg tcgggaagat 
ttcgatttafc tcaacaaagc cgccgtcccg 
caaccaabba accaattctg attagaaaaa 
abbcatabca ggattafccaa baccababbb 
aaactcaccg aggcagttcc ataggatggc 
tcgtccaaca tcaatacaac ctattaattt 
gaaatcacca tgagtgacga ctgaatccgg 
ccagacttgt tcaacaggcc agccattacg 
accgttattc attcgtgatt gcgcctgagc 
acaabbacaa acaggaatcg aatgcaaccg 
attttcacct gaatcaggat attcttctaa 
agtggtgagt aaccatgcat catcaggagt 
cataaattcc gtcagccagb ttagtctgac 
acctttgcca tgtttcagaa acaactctgg 
tgtcgcacct gattgcccga cabbabcgcg 
catgttggaa tttaatcgcg gcctcgagca 
accccttgta bbacbgbbba bgtaagcaga 
abcbbgtgca abgtaacatc agagabbbbg 
ttabbgaagc abttahcagg gfctabbgtcb 
gaaaaabaaa caaabagggg btccgcgcac 
agaaaccabt abbabcabga cabbaaccba 
be 
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aatagcaggc abgctgggga tgcggbgggc 7020 
cccggtbccb ccbgggccag aaagaagcag 7 080 
ccacgccccb ggbbcbbagt tccagcccca 714 0 
ccgcctbcaa bcccacccgc taaagbactb 7200 
ccaaaccaaa ccbagccbcc aagagbggga 7260 
cagagggaga gaaaabgeeb ccaacabgbg 7320 
bbccgcbbcc bcgcbcacbg acbcgcbgcg 7380 
agcbcactca aaggeggbaa baeggbbabe 7440 
cabgbgagca aaaggecage aaaaggccag 7500 
tttccatagg ctccgccccc cbgacgagca 7560 
gcgaaacccg acaggactab aaagabacca 7620 
cbctccbgbb ccgaccctgc cgcbbaccgg 7680 
egbggegebb bebcaabgeb caegctgbag 774 0 
caagebggge bgbgbgcacg aaccccccgt 7800 
cbategbebb gagtccaacc eggbaagaca 7860 
taacaggabt ageagagega ggbatgbagg 7920 
taacbaegge bacacbagaa ggacagtabt 7980 
ebbeggaaaa agagtbggba gebebbgabe 8040 
ttbbtbbgbb bgcaagcagc agabtacgeg 810O 
gabebbbbeb aeggggbebg aegebcagtg 8160 
catgagabba bcaaaaagga bcbbcaccba 8220 
atcaabctaa agbabababg agbaaacttg 8280 
ggcacctabc beagegatcb gbebabbbeg 8340 
999g9 c 9cbg aggbebgeeb egbgaagaag 84 00 
ccccabcabc cagecagaaa gbgagggagc 8460 
accagbbggb gatbtbgaac btbbgctbbg 8520 
gcgtgabctg abccbtcaac tcagcaaaag 8580 
bcaagbcagc gtaatgcbcb gecagbgbba 8640 
ctcabcgagc abcaaabgaa acbgcaatbt 8700 
tbgaaaaagc cgbbbcbgba abgaaggaga 8760 
aagabccbgg tabeggbebg cgatbccgac 8820 
ccccbcgbca aaaabaaggb batcaagbga 8880 
bgagaabggc aaaagebbab gcatbbcbtt 8940 
ctcgbcabca aaabcacbeg cabcaaccaa 9000 
gagacgaaab acgcgabcgc bgbbaaaagg 9060 
gegcaggaac acbgccagcg cabcaacaab 9120 
taccbggaat gcbgbbbbcc eggggabege 9180 
aeggabaaaa tgctbgatgg beggaagagg 924 0 
cabcbcatcb gbaacabcab bggcaacgcb 9300 
cgcabcgggc btcccabaca atcgabagab 9360 
ageccabbba bacccababa aabcagcabc 9420 
agaegbbbec egbbgaabab ggebcataac 9480 
cagbbbbabb gbbcabgabg atababtbbb 9540 
agacacaacg bggcbtbccc ccccccccca 9600 
cabgagegga bacababbbg aabgbatbta 9660 
abtbccccga aaagtgccac cbgaegbeba 9720 
baaaaabagg egbatcaega ggccctbbcg 9780 

9782 



<210> 174 
<211> 9788 
<212> DNA 

<213> Arbificial Sequence 
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<220> 

<223> plasmid pVRI012x/s containing HIV genes 
<400> 174 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcbbgbct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
bbggcgggbg tcggggctgg ctbaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagab gcgtaaggag aaaabaccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacabbbaba ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt hahtaatagt aatcaattac 360 
ggggbcabba gttcatagcc catatatgga gttccgcgtt acataactba cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaabaabga cgtatgttcc 480 
catagtaacg ccaataggga ctttccabbg acgbcaatgg gbggagbatb tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta bbgacgbcaa 600 
tgacggtaaa tggcccgcct ggcabtabgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac abcbacgbab tagtcatcgc babtaccabg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggbttgacbc acggggabtb ccaagtctcc accccattga 780 
cgtcaatggg agbbbgbbbb ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca bbgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agahcgcctg gagacgccat ccacgctgbb bbgaccbcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagbgacg baagbaccgc ctatagactc bataggcaca ccccbbbggc 1080 
tcttatgcat gctatactgt bbbbggcbbg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tabagcbtag cctataggbg bgggbbabtg accabbatbg accacbcccc 1200 
tabbggbgac gabactttcc atbacbaabc cataacabgg cbcttbgcca caactatcbc 1260 
battggctat atgccaabac tcbgbccbbc agagactgac acggactctg tattbtbaca 1320 
ggabggggbc ccabbbabba tbbacaaatb cacababaca acaacgccgb cccccgbgcc 1380 
cgcagbbbbb abbaaacaba gcgtgggabc bccacgcgaa bcbcgggbac gbgbbccgga 1440 
catgggctcb bctccggbag cggcggagcb bccacatccg agcccbggbc ccabgccbcc 1500 
agcggctcat ggbcgcbcgg cagcbccbbg cbccbaacag bggaggccag acbbaggcac 1560 
agcacaabgc ccaccaccac cagbgbgccg cacaaggccg bggcggbagg gbabgbgbcb 1620 
gaaaatgagc gbggagabtg ggcbcgcacg gcbgacgcag abggaagacb baaggcagcg 1680 
gcagaagaag abgcaggcag cbgagbbgbt gbatbcbgab aagagbcaga ggbaactccc 1740 
gbbgcggbgc tgbbaacggb ggagggcagb gbagtctgag cagtacbcgt bgcbgccgcg 1800 
cgcgccacca gacabaabag cbgacagacb aacagactgt bccbtbccab gggbcttbbc 1860 
tgcagbcacc gtcgtcgaca bgggcgcccg cgccagcabc cbgcgcggcg gcaagcbgga 1920 
cgccbgggag cgcabcaagc bgaagcccgg cggcaagaag cacbacabga bgaagcaccb 1980 
ggbgtgggcc agccgcgagc bggagcgcbb cgcccbggac cccggccbgc bggagaccag 2 04 0 
cgagggcbgc aagcagabca tgaagcagct gcagcccgcc ctgcagaccg gcaccaagga 2100 
gcbgatcagc cbgcacaaca ccgbggccac cctgbacbgc gtgcacgaga agatcgacgb 2160 
gcgcgacacc aaggaggccc bggacaagat caaggaggag cagaacaaga gccagcagaa 2220 
gacccagcag gccgaggccg ccgacaaggg caaggbcjagc cagaactacc ccabcgbgca 2280 
gaacctgcag ggccagabgg bgcaccagcc cabcagcgcc cgcacccbga acgccbgggb 2340 
gaaggbggbg gaggagaagg ccbbcagccc cgaggbgabc cccabgbbca ccgccctgag 24 00 
cgagggcgcc accccccagg accbgaacac cabgcbgaac accgtgggcg gccaccaggc 2460 
cgccabgcag abgcbgaagg acaccatcaa cgaggaggcc gccgagbggg accgccbgca 2520 
ccccgbgcac gccggccccg bggcccccgg ccagabgcgc gagccccgcg gcagcgacab 2580 
cgccggcacc accagcaccc bgcaggagca gabcaccbgg atgaccaaca acccccccgt 2640 
gcccgbgggc gacatcbaca agcgcbggab cabccbgggc ctgaacaaga tcgbgcgcat 2700 
gtacagcccc gbgagcabcc bggacabcaa gcagggcccc aaggagcccb tccgcgacba 2760 
cgtggaccgc tbcbbcaaga cccbgcgcgc cgagcaggcc acccaggacg tgaagaacbg 2820 
gatgaccgac accctgcbgg bgcagaacgc caaccccgac bgcaagacca tcctgcgcgc 2880 
cctgggcccc ggcgccagcc bggaggagab gatgaccgcc bgccagggcg bgggcggccc 2940 
cggccacaag gcccgcgtgc bggccgaggc catgagcaag gtgaacaaca ccaacabcat 3000 
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gatgcagcgc agcaactgca agggccccaa 
ggagggccac ctggcccgca actgccgcgc 
caaggagggc caccaggtga aggactgcac 
cbbcccacaa gggaaggcca gggaattttc 
cagaagagag cttcaggttt ggggaagaga 
tagacaagga actgtatcct ttagcttccc 
cacaabaaag abagggggcc agctgaagga 
cgbgcfcggag gagatgaacc tgcccggccg 
cggcbbcabc aaggtgggcc agtacgacca 
catcggcacc gtgctggtgg gccccacccc 
ccagatcggc tgcaccctga actbccccab 
gaagcccggc atggacggcc ccaaggbgaa 
ggccctggtg gagatctgca ccgagatgga 
cgagaacccc bacaacaccc ccgtgttcgc 
caagctggtg gacttccgcg agctgaacaa 
gggcatcccc caccccgccg gcctgaagca 
cgacgcctac ttcagcgtgc ccctggacaa 
ccccagcatc aacaacgaga cccccggcat 
ctggaagggc agccccgcca tcttccagtg 
caagcagaac cccgacatcg tgatctacca 
cctggagatc ggccagcacc gcaccaagat 
gggcttcacc acccccgaca agaagcacca 
cgagctgcac cccgacaagt ggaccgtgca 
gaccgtgaac gacatccaga agctggtggg 
cggcabcaag gtgcgccagc tgtgcaagct 
ggtgcccctg accgaggagg ccgagctgga 
gcccgtgcac ggcgtgtact acgaccccag 
gggccagggc cagtggacch accagafccta 
caagtacgcc cgcatgaagg gcgcccacac 
gcagaagatc gccaccgaga gcatcgtgat 
catccagaag gagacctggg aggcctggtg 
cgagtgggag ttcgbgaaca ccccccccct 
gcccatcatc ggcgccgaga ccbtctacgb 
gggcaaggcc ggcbacgbga ccgaccgcgg 
caccaaccag aagaccgagc tgcaggccat 
ggtgaacabc gtgaccgaca gccagtacgc 
gagcgagagc gagctggtga gccagabcab 
cctggcctgg gtgcccgccc acaagggcat 
gagcgccggc atccgcaagg tgctgttcct 
cgagaagtac cacagcaact ggcgcgccat 
ggccaaggag abcgtggcca gctgcgacaa 
ccaggtggac tgcagccccg gcabctggca 
gatcctggtg gccgtgcacg tggccagcgg 
gaccggccag gagaccgcct acttcctgct 
cgbgcacacc gacaacggca gcaacttcac 
ggccggcabc aagcaggagb tcggcatccc 
gagcabgaac aaggagctga agaagatcat 
gaagaccgcc gtgcagatgg ccgtgttcat 
cggctacagc gccggcgagc gcabcgbgga 
gptgcagaag cagatcacca agatccagaa 
ccccgtgtgg aagggccccg ccaagctgct 
ggacaacagc gacatcaagg tggtgccccg 
caagcagatg gccggcgacg actgcgtggc 
abggagcaag tgtagcactg tgggccgccc 
cgcagccgaa ggagbgggbc cagcttcaca 



gcgcaccabc aagbgcbtca actgcggcaa 3060 
cccccgcaag aagggctgch ggaagbgcgg 312 0 
cgagcgacag gcbaabaggg aagatctggc 3180 
ttcagagcag accagagcca acagccccac 3240 
caacaactcc ctctcagaag caggagccga 3300 
tcagabcacb ctttggcagc gaccccbcgt 3360 
ggcccbbcta gacaccggcg ccgacgacac 3420 
ctggaagccc aagatgatcg gcggcatcgg 3480 
gatcctgatc gagabcbgcg gccacaaggc 3540 
cgtgaacatc atcggccgca acctgctgac 3600 
cagccccatc gagaccgtgc ccgtgaagct 3 660 
gcagtggccc ctgaccgagg agaagatcaa 3720 
gaaggagggc aagatcagca agatcggccc 3780 
cabcaagaag aaggacagca ccaagtggcg 3840 
gcgcacccag gacttctggg aggtgcagct 3900 
gaagaagagc gtgaccgtgc tggacgtggg 3960 
ggacttccgc aagfcacaccg ccttcaccafc 4020 
ccgctaccag tacaacgtgc tgccccaggg 4 080 
cagcatgacc aagatcctgg agcccttccg 4140 
gbacabggac cacctgtacg tgggcagcga 4200 
cgaggagctg cgccagcacc tgctgcgctg 4260 
gaaggagccc cccttcctgt ggatgggcta 4320 
gcccatcgtg ctgcccgaga aggacagctg 43 80 
caagctgaac tgggccagcc agatctacgc 4440 
gctgcgcggc accaaggccc tgaccgaggt 45 00 
gctggccgag aaccgcgaga tcctgaagga 4560 
caaggacctg atcgccgaga tccagaagca 4620 
ccaggagccc ttcaagaacc tgaagaccgg 4 6 80 
caacgacgtg aagcagctga ccgaggccgt 474 0 
ctggggcaag acccccaagt tcaagctgcc 4800 
gaccgagtac tggcaggcca cctggatccc 4 8 60 
ggtgaagchg bggtaccagc tggagaagga 4 920 
ggacggcgcc gccaaccgcg agaccaagct 4980 
ccgccagaag gtggtgcccc tgaccgacac 5040 
ccacctggcc ctgcaggaca gcggcctgga 5100 
cctgggcatc atccaggccc agcccgacaa 5160 
cgagcagctg atcaagaagg agaaggtgta 5220 
cggcggcaac gagcaggtgg acggcctggt 52 80 
ggacggcatc gacaaggccc aggaggagca 5340 
ggccagcgac ttcaacctgc cccccgtggt 5400 
gtgccagctg aagggcgagg ccatgcacgg 54 60 
gctggcatgc acccacctgg agggcaaggt 5520 
ctacatcgag gccgaggtga tccccgccga 5580 
gaagctggcc ggccgctggc ccgtaaagac 5640 
cagcaccacc gtgaaggccg cctgcbggtg 5700 
ctacaacccc cagagccagg gcgtgatcga 5760 
cggccaggbg cgcgaccagg ccgagcaccb 5820 
ccacaacbbc aagcgcaagg gcggcabcgg 5880 
cabcabcgcc accgacabcc agaccaagga 5940 
cbbccgcgbg bacbaccgcg acagccgcga 6000 
gtggaagggc gagggcgccg bggbgatcca 6060 
ccgcaaggcc aagabcabcc gcgacbacgg 6120 
cagccgccag gacgaggaca bgggbaabaa 6180 
bgccatcagg gagagaabgc gccgcgcccc 6240 
ggabagbgac aaabacggcg cbcbgacabc 63 00 
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cagcagbacc cccgctaaca acgccgactg tgcatggchc gaagcccagc aggaagaaga 6360 
ggaggbcgga bbcccbgbga ggcctcaggt tcctctcaga cchatgactt acaaggccgt 6420 
ggbcgacttg tctthtttcc ttgaagagaa gggaggcttg gaggggctga tttacagcaa 6480 
gaagcgccag gatattttgg attfcgtgggt ctacaacaca cagggttact tccccgattg 6540 
gcagaactac acccccggcc caggcgtgcg gbbtccbctb accttbggtb ggtgcbbbaa 6600 
gctcgtgcct gbcgabccba gagaagtgga ggaagcaaac acaggcgaga acaacagtct 6660 
gctgcatcct atgagccttc abggcabgga ggabbctcab cgggaagtgc bbcagbggaa 6720 
attcgatagc ttgctcgcta gaagacacat ggcbcgggag tbgcabccbg agtatbabaa 6780 
ggabbgbbga acacgtggcg aattcttgct gtgccttcta gbbgccagcc atctgttgtt 6840 
tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa 6900 
taaaabgagg aaabbgcabc gcattgbcbg agbaggbgbc abbcbabbcfc ggggggbggg 6960 
gtggggcagc acagcaaggg ggaggatbgg gaagacaata gcaggcabgc bggggatgcg 7020 
gtgggcbcba tgggtaccca ggtgcbgaag aattgacccg gbbcctccbg ggccagaaag 7080 
aagcaggcac -abccccttct ctgtgacaca ccctgtccac gcccctggtt cfctagbbcca 7140 
gccccactca taggacactc atagcbcagg agggcbccgc cttcaatccc acccgctaaa 7200 
gtacttggag cggtcbctcc ctccctcatc agcccaccaa accaaaccta gcctccaaga 7260 
gtgggaagaa atbaaagcaa gataggcbab baagbgcaga gggagagaaa atgccbccaa 7320 
cabgbgagga agtaabgaga gaaatcabag aatbbcbtcc gcbbccbcgc tcactgactc 7380 
gctgcgctcg gtcgttcggc tgcggcgagc ggbatcagcb cactcaaagg cggtaatacg 7440 
gtbatccaca gaatcagggg ataacgcagg aaagaacabg tgagcaaaag gccagcaaaa 7500 
ggccaggaac cgtaaaaagg ccgcgttgcb ggcgtttttc cataggctcc gcccccctga 7560 
cgagcatcac aaaaabcgac gctcaagbca gaggbggcga aacccgacag gacbataaag 7620 
ataccaggcg tttcccccbg gaagctcccb cgtgcgcbct ccbgbbccga cccbgccgct 7680 
taccggabac ctgtccgcct tbcbcccbbc gggaagcgbg gcgctbbcbc aabgcbcacg 7740 
ctgtaggtab cbcagtbcgg bgtaggtcgb tcgctccaag ctgggctgbg tgcacgaacc 7800 
ccccgbtcag cccgaccgct gcgccbtatc cggtaactab cgtcttgagt ccaacccggt 7860 
aagacacgac btatcgccac bggcagcagc cactggbaac aggabbagca gagcgaggba 7920 
tgbaggcggb gcbacagagb bcbbgaagtg gbggccbaac bacggctaca ctagaaggac 7980 
agbabbbggb abcbgcgcbc bgcbgaagcc agbbaccbbc ggaaaaagag tbggbagcbc 8040 
ttgatccggc aaacaaacca ccgcbggbag cggbggbttb btbgttbgca agcagcagab 8100 
tacgcgcaga aaaaaaggat ctcaagaaga tccbttgabc ttbbcbacgg ggtcbgacgc 816 0 
tcagtggaac gaaaactcac gttaagggab tbtggbcabg agattabcaa aaaggatcbb 8220 
cacctagatc ctbttaaatt aaaaatgaag ttbtaaatca atcbaaagta tatabgagta 8280 
aacttggbct gacagtbacc aabgcbbaab cagbgaggca cctatcbcag cgatcbgbcb 8340 
abttcgbtca bccatagbtg ccbgactccg gggggggggg gcgcbgaggt ctgccbcgbg 84 00 
aagaaggbgt tgctgacbca taccaggccb gaabcgcccc abcabccagc cagaaagbga 8460 
gggagccacg gbbgabgaga gcbbtgttgt aggbggacca gbbggtgatb bbgaacbbbt 8520 
gcbttgccac ggaacggtcb gcgbbgbcgg gaagatgcgb gabcbgatcc ttcaacbcag 8580 
caaaagbbcg abbtabtcaa caaagccgcc gtcccgbcaa gtcagcgbaa tgctctgcca 8640 
gtgbbacaac caabbaacca atbctgatta gaaaaacbca bcgagcabca aatgaaactg 8700 
caabbtattc ababcaggab babcaatacc atabbbbbga aaaagccgtt tctgbaabga 8760 
aggagaaaac tcaccgaggc agbbccabag gabggcaaga bcctggbatc ggtctgcgab 8820 
tccgacbcgb ccaacabcaa tacaacctab taabtbcccc tcgbcaaaaa baaggttatc 8880 
aagtgagaaa tcaccabgag bgacgacbga abccggtgag aabggcaaaa gcttabgcat 894 0 
ttctbtccag acttgttcaa caggccagcc atbacgctcg tcabcaaaab cactcgcabc 9000 
aaccaaaccg tbabbcabbc gtgabbgcgc cbgagcgaga cgaaabacgc gatcgcbgbt 9060 
aaaaggacaa ttacaaacag gaabcgaatg caaccggcgc aggaacactg ccagcgcabc 9120 
aacaatabbt tcaccbgaab caggababbc tbcbaabacc bggaabgcbg bbttcccggg 9180 
gabcgcagtg gfcgagbaacc abgcabcabc aggagbacgg abaaaatgcb bgatggbcgg 9240 
aagaggcata aabbccgtca gccagbbbag tctgaccabc bcatcbgbaa catcatbggc 9300 
aacgcbaccb bbgccabgbb bcagaaacaa cbcbggcgca tcgggcbtcc catacaabcg 9360 
atagatbgtc gcaccbgabb gcccgacatt abcgcgagcc cabbbabacc cababaaabc 9420 
agcabccabg btggaabtta atcgcggccb cgagcaagac gbbbcccgbb gaatabggcb 94 80 
cabaacaccc ctbgbattac bgtttabgba agcagacagb bbbatbgbbc abgatgatat 9540 
atbttbabcb tgtgcaabgb aacabcagag abbbbgagac acaacgbggc bttccccccc 9600 
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cccccattat bgaagcatbt atcagggtta ttgtctcatg agcggataca tabbtgaabg 9660 

tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 9720 

cgbctaagaa . accattabta tcatgacabb aacctataaa aataggcgta tcacgaggcc 9780 
ctttcgtc 9788 



<210> 175 
<211> 9407 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 175 

bcgcgcgbtt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagctbgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatabgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240 
cbabtggcca bbgcatacgt tgtatccata tcataatatg tacattbata btggcbcatg 300 
bccaacabba ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgbt acataactba cggbaaabgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgbtcc 480 
cabagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta btgacgbcaa 600 
tgacggtaaa tggcccgccfc ggcabbabgc ccagtacatg accbbabggg acbttccbac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgbggatagc ggtttgactc acggggahtt ccaagtctcc accccattga 780 
cgtcaatggg agbbtgtbbt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa bgggcggbag gcgtgtacgg tgggaggtct atataagcag 900 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgcbgtt btgacctcca 960 
tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca ccccbttggc 1080 
tcttabgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgafcgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tafctggtgac gahactttcc attactaatc cacaacacgg ctctttgcca caactatctc 1260 
tattggcfcab atgccaatac tctgtccttc agagactgac acggactcfcg tatttttaca 1320 
ggatggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 1380 
cgcagttttb attaaacaba gcgtgggabc tccacgcgaa bctcgggtac gbgbtccgga 144 0 
cabgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccabgccbcc 150 0 
agcggcbcat ggtcgctcgg cagcbccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaabgc ccaccaccac cagtgtgccg cacaaggccg tggcggbagg gbabgbgtct 1620 
gaaaabgagc gtggagabtg ggctcgcacg gctgacgcag atggaagact baaggcagcg 1680 
gcagaagaag abgcaggcag ctgagttgbb gtattcbgat aagagbcaga ggbaactccc 174 0 
gttgcggtgc tgttaacggb ggagggcagb gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataabag ctgacagacb aacagactgt tcctttccat gggbcbbtbc 1860 
bgcagtcacc gtcgtcgaca bggcccgcgc cagcgbgcbg agcggcggcg agctggaccg 192 0 
ctgggagaag atccgcctgc gccccggcgg caagaagaag tacaagctga agcacatcgt 1980 
gtgggccagc cgcgagcbgg agcgcttcgc cgtgaacccc ggcctgctgg agaccagcga 204 0 
gggctgccgc cagatccbgg gccagctgca gcccagcctg cagaccggca gcgaggagcb 2100 
gcgcagcctg tacaacaccg tggccaccct gtactgcgtg caccagcgca. tcgagatcaa 2160 
ggacaccaag gaggccctgg acaagatcga ggaggagcag aacaagagca agaagaaggc 2220 
ccagcaggcc gccgccgaca ccggccacag caaccaggbg agccagaacb accccabcgt 22 80 
gcagaacabc cagggccaga bggtgcacca ggccabcagc ccccgcaccc bgaacgccbg 2340 
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ggtgaaggtg gtggaggaga aggccttcag ccccgaggtg atccccatgt tcagcgccct 2400 
gagcgagggc gccacccccc aggacctgaa caeca tgctg aacaccgtgg gcggccacca 24 60 
ggccgccatg cagatgetga aggagaccat caacgaggag gccgccgagt gggaccgcgt 2 520 
gcaccccgtg cacgccggcc ccatcgcccc eggecagatg cgcgagcccc geggcagega 25 80 
catcgccggc accaccagca ccctgcagga geagategge tggatgacca acaacccccc 2640 
catccccgtg ggegagatet acaagegctg gatcatcctg ggectgaaca agatcgtgcg 2700 
catgtacagc cccaccagca tcctggacat ccgccagggc cccaaggagc ccttccgcga 2760 
ctacgtggac cgcttctaca agaccctgcg cgccgagcag gccagccagg aggtgaagaa 2 820 
ctggatgacc gagaccctgc tggtgcagaa cgccaacccc gaetgeaaga ccatcctgaa 2880 
ggccctgggc cccgccgcca ccctggagga gatgatgacc gcctgccagg gcgtgggcgg 2 94 0 
ccccggccac aaggcccgcg tgctggccga ggecatgage caggtgacca acagcgccac 3000 
catcatgatg cagcgcggca acttccgcaa ccagcgcaag atcgtgaagt gcttcaactg 3060 
eggcaaggag ggccacaccg cccgcaactg ccgcgccccc cgcaagaagg gctgctggaa 3120 
gtgcggcaag gagggccacc agatgaagga ctgcaccgag cgacaggcta ataacgccgc 3180 
ctgcgcctgg ctggaggccc aggaggagga ggaggtgggc ttccccgtga ccccccaggt 324 0 
gcccctgcgc cccatgacct acaaggccgc cgtggacctg agccacttcc tgaaggagaa 3300 
gggcggcctg gagggectga tccacagcca gcgccgccag gacatcctgg acctgtggat 3360 
ctaccacacc cagggctact tccccgactg gcagaactac acccccggcc ccggcgtgcg 3420 
ctaccccctg accttcggct ggtgctacaa gctggtgccc gtggagcccg acaaggtgga 3480 
ggaggecaac aagggegaga acaccagcct gctgcacccc gtgagcctgc acggcatgga 3540 
cgaccccgag cgcgaggtgc tggagtggcg cttcgacagc cgcctggcct tccaccacgt 3 600 
ggcccgcgag ctgcaccccg agtacttcaa gaactgcgcc cttctagaca ccggcgccga 3660 
cgacaccgtg ctggaggaga tgaacctgcc cggccgctgg aageccaaga tgateggegg 3720 
catcggcggc ttcatcaagg tgggccagta cgaccagatc ctgatcgaga tctgcggcca 37 80 
caaggccabc ggcaccgtgc tggtgggccc cacccccgtg aacatcatcg gccgcaacct 3840 
gctgacccag ateggctgea ccctgaactt ccccatcagc cccatcgaga ccgtgcccgt 3900 
gaagctgaag cccggcatgg acggccccaa ggtgaagcag tggcccctga ccgaggagaa 3960 
gatcaaggee cfcggtggaga tctgcaccga gatggagaag gagggcaaga tcagcaagat 4020 
cggccccgag aacccctaca acacccccgt gttcgccatc aagaagaagg acagcaccaa 4080 
gtggcgcaag ctggtggact tccgcgagct gaacaagege acccaggact tctgggaggt 4140 
gcagctgggc atcccccacc ccgccggcct gaagcagaag aagagcgtga ccgtgctgga 42 00 
cgtgggcgac gcctacttca gcgtgcccct ggacaaggac ttccgcaagt acaccgcctt 4260 
caccatcccc agcatcaaca acgagacccc cggcatccgc taccagtaca acgtgctgcc 4320 
ccagggctgg aagggcagee ccgccatctt ccagtgcagc atgaccaaga tcctggagcc 43 80 
cttccgcaag cagaaccccg acategtgat ctaccagtac atggaccacc tgtacgtggg 4440 
cagcgacctg gagateggee agcaccgcac caagatcgag. gagctgcgcc agcacctgct 4 500 
gcgctggggc ttcaccaccc ccgacaagaa gcaccagaag gagcccccct hcctgtggat 4560 
gggctacgag ctgcaccccg acaagtggac cgtgcagccc atcgtgctgc ccgagaagga 4 620 
cagctggacc gtgaacgaca tccagaagct ggtgggcaag ctgaacbggg ccagccagat 4 680 
cbacgccggc atcaaggtgc gccagctgtg caagctgetg cgcggcacca aggccctgac 474 0 
c 9 a 99bggtg cccctgaccg aggaggcega gctggagctg gccgagaacc gegagatect 4 800 
gaaggagece gtgcacggcg tgtactacga ccccagcaag gacctgatcg ccgagatcca 4 860 
gaagcagggc cagggecagt ggacctacca gatctaccag gagcccttca agaacctgaa 492 0 
gaeeggcaag bacgcccgca tgaagggege ccacaccaac gaegtgaage agctgaccga 4 980 
ggccgtgcag aagatcgeca ccgagagcat cgtgatctgg ggcaagaccc ccaagttcaa 5 040 
gcbgcccatc cagaaggaga cctgggaggc ctggtggacc gagtactggc aggccacctg 5100 
gatccccgag tgggagttcg tgaacacccc ccccctggbg aagctgbggt accagctgga 5160 
9 aa 99 a 9 c cc ateateggeg ccgagacctt ctacgtggac ggcgccgcca accgcgagac 5220 
caagctgggc aaggcegget aegtgacega ccgcggccgc cagaaggtgg tgcccctgac 5280 
cgacaccacc aaccagaaga ccgagctgca ggccatccac ctggccctgc aggacagegg 5340 
cctggaggtg aacatcgtga ccgacagcca gtacgccctg ggcatcatcc aggcccagcc 5400 
cgacaagagc gagagegage tggtgagcca gatcatcgag cagctgatca agaaggagaa 5460 
ggtgtacctg gcctgggtgc ccgcccacaa gggcategge ggcaacgagc aggtggacgg 5520 
cctggtgagc gccggcatcc geaaggtget gttcctggac ggcatcgaca aggeccagga 55 80 
ggagcacgag aagtaccaca gcaactggcg cgccatggcc agegacttea acctgccccc 564 0 
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cgtggtggcc aaggagatcg bggccagcbg cgacaagtgc cagctgaagg gcgaggccat 570 0 
gcacggccag gtggactgca gccccggcat ctggcagctg gcatgcaccc acctggaggg 5760 
caaggbgabc ctggtggccg tgcacgtggc cagcggchac atcgaggccg aggtgatccc 5820 
cgccgagacc ggccaggaga ccgcctactt cctgctgaag ctggccggcc gctggcccgt 5880 
gaagaccgtg cacaccgaca acggcagcaa cttcaccagc accaccgtga aggccgccbg 594 0 
cbggbgggcc ggcatcaagc aggagttcgg catcccctac. aacccccaga gccagggcgt 6000 
gatcgagagc atgaacaagg agcbgaagaa gatcatcggc caggtgcgcg accaggccga 6060 
gcacctgaag accgccgtgc agatggccgt gttcatccac aacbbcaagc gcaagggcgg 6120 
catcggcggc tacagcgccg gcgagcgcat cgbggacabc atcgccaccg acatccagac 6180 
caaggagctg cagaagcaga tcaccaagab ccagaacttc cgcgtgtact accgcgacag 6240 
ccgcgacccc gtgtggaagg gccccgccaa gcbgctgbgg aagggcgagg gcgccgtggt 63 00 
gatccaggac aacagcgaca tcaaggtggt gccccgccgc aaggccaaga tcatccgcga 6360 
ctacggcaag cagatggccg gcgacgactg cgbggccagc cgccaggacg aggactagga 64 2 0 
attcttgctg tgccttcbag bbgccagcca tcbgbbgtbb gcccctcccc cgtgccttcc 6480 
ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga aabtgcabcg 6540 
cabbgbctga gtaggtgtca bbcbatbcbg gggggtgggg tggggcagca cagcaagggg 6600 
gaggattggg aagacaatag caggcabgcb ggggatgcgg tgggctctah gggtacccag 6660 
gtgctgaaga attgacccgg bbccbccbgg gccagaaaga agcaggcaca tccccttctc 6720 
tgtgacacac cctgtccacg cccctggttc ttagttccag ccccactcat aggacactca 6780 
tagctcagga gggctccgcc ttcaatccca cccgctaaag tacttggagc ggbctctccc 6840 
tccctcatca gcccaccaaa ccaaacctag ccfcccaagag tgggaagaaa bbaaagcaag 6900 
ataggctatt aagtgcagag ggagagaaaa bgcctccaac atgtgaggaa gtaatgagag 6960 
aaatcataga atttcttccg cttcctcgct cacbgacbcg ctgcgctcgg bcgbbcggcb 7020 
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 7080 
baacgcagga aagaacabgb gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 7140 
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 7200 
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 7260 
aagctccctc gbgcgcbctc ctgttccgac cctgccgcbb accggabacc tgtccgcctt 7320 
tctcccbtcg ggaagcgtgg cgctttctca atgctcacgc bgbaggbabc tcagttcggt 7380 
gbaggbcgbb cgctccaagc bgggcbgbgb gcacgaaccc cccgbbcagc ccgaccgctg 7440 
cgccbbatcc ggbaacbatc gbcbtgagbc caacccggba agacacgacb batcgccacb 7500 
ggcagcagcc actggbaaca ggabtagcag agcgaggbab gbaggcggtg cbacagagbb 7560 
cttgaagtgg bggccbaacb acggcbacac bagaaggaca gbabbbggba bctgcgcbcb 7620 
gcbgaagcca gbtaccbbcg gaaaaagagb bggbagcbcb tgabccggca aacaaaccac 7680 
cgctggbagc ggtggtbbbb bbgbbbgcaa gcagcagabb acgcgcagaa aaaaaggabc 7740 
bcaagaagat ccbbtgabcb bbbcbacggg gbcbgacgct cagbggaacg aaaacbcacg 7800 
bbaagggabb bbggbcabga gatbatcaaa aaggabcttc accbagabcc btbbaaabta 7860 
aaaabgaagb bbbaaatcaa bcbaaagbab atatgagbaa acbtggbcbg acagbbacca 7920 
abgcbbaabc agtgaggcac cbatctcagc gabctgbcba bbtcgbbcat ccatagbbgc 7980 
cbgacbccgg gggggggggg cgcbgaggbc bgcctcgbga agaaggbgtt gcbgacbcab 8040 
accaggccbg aabcgcccca bcabccagcc agaaagbgag ggagccacgg btgatgagag 8100 
cbbbgbtgta ggbggaccag bbggbgabtb bgaacbbbbg cbbbgccacg gaacggbctg 8160 
cgbbgbcggg aagabgcgbg abcbgabccb bcaactcagc aaaagbbcga ttbatbcaac 8220 
aaagcegccg bcccgbcaag bcagcgbaab gcbctgccag bgbtacaacc aabbaaccaa 8280 
bbcbgabbag aaaaacbcab cgagcabcaa abgaaacbgc aatbbabbca batcaggatt 8340 
abcaabacca tatbbbbgaa aaagccgbtb cbgbaabgaa ggagaaaact caccgaggca 8400 
gtbccabagg abggcaagab cctggbabcg gtcbgcgabb ccgacbcgbc caacabcaab 8460 
acaaccbatb aabtbccccb cgtcaaaaab aaggbbatca agbgagaaat caccabgagb 8520 
gacgacbgaa bccggbgaga abggcaaaag cbtabgcatb bctbtccaga cbbgbbcaac 85B0 
aggccagcca bbacgcbcgb catcaaaabc acbcgcatca accaaaccgt babtcabbcg 8640 
bgabbgcgcc bgagcgagac gaaabacgcg abcgcbgbba aaaggacaab bacaaacagg 87 00 
aabcgaabgc aaccggcgca ggaacacbgc cagcgcabca acaatabbbb caccbgaabc 8760 
aggababbcb bcbaataccb ggaabgcbgb btbcccgggg abcgcagbgg bgagbaacca 8820 
bgcabcabca ggagbacgga baaaabgcbb gabggbcgga agaggcabaa abbccgbcag 8880 
ccagbbbagb cbgaccabcb cabcbgbaac atcabbggca acgcbaccbb bgccatgbbb 8940 
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cagaaacaac tctggcgcat cgggcttccc atacaatcga tagattgtcg cacctgattg 9000 

cccgacafcta tcgcgagccc atttataccc atataaatca gcatccatgt tggaatttaa 9060 

tcgcggcctc gagcaagacg tttcccgttg aatatggctc ataacacccc ttgtattact 9120 

gtttatgtaa gcagacagtt ttattgttca tgatgatata tbtttatctt gtgcaatgta 9180 

acatcagaga tttbgagaca caacgtggct ttcccccccc ccccattatt gaagcattta 9240 

tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 9300 

aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 9360 

catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtc 9407 



<210> 17.6 
<211> 9785 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid pVR1012x/s containing HIV genes 
<400> 176 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 18 0 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 24 0 
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300 
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360 
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420 
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480 
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540 
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600 
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660 
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720 
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780 
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840 
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 90 0 
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960 
fcagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat 1020 
tccccgtgcc aagagtgacg taagtaccgc ctatagactc tataggcaca cccctttggc 1080 
tcttafcgcat gctatactgt ttttggcttg gggcctatac acccccgctt ccttatgcta 1140 
taggtgatgg tatagcttag cctataggtg tgggttattg accattattg accactcccc 1200 
tattggtgac gatactttcc attactaatc cataacatgg ctctctgcca caactatctc 1260 
tattggctat atgccaatac tctgtccttc agagactgac acggactctg tatttttaca 1320 
ggacggggtc ccatttatta tttacaaatt cacatataca acaacgccgt cccccgtgcc 13 80 
cgcagttttt attaaacata gcgtgggatc tccacgcgaa tctcgggtac gtgttccgga 144 0 
catgggctct tctccggtag cggcggagct tccacatccg agccctggtc ccatgcctcc 1500 
agcggctcat ggtcgctcgg cagctccttg ctcctaacag tggaggccag acttaggcac 1560 
agcacaatgc ccaccaccac cagtgtgccg cacaaggccg tggcggtagg gtatgtgtct 1620 
gaaaatgagc gtggagattg ggctcgcacg gctgacgcag atggaagact taaggcagcg 1680 
gcagaagaag atgcaggcag ctgagttgtt gtattctgat aagagtcaga ggtaactccc 1740 
gttgcggtgc tgttaacggt ggagggcagt gtagtctgag cagtactcgt tgctgccgcg 1800 
cgcgccacca gacataatag ctgacagact aacagactgt tcctttccat gggtcttttc 1860 
tgcagtcacc gtcgtcgaca tgggcgcccg cgccagcgtg ctgagcggcg gcaagttcga 1920 
cgcctgggag aagatccgcc tgcgccccgg cggcaagaag aagtaccgca tgaagcacct 198 0 
gatctgggcc agccgcgagc tggaccgctt cgccctgaac cccagcctgc tggagaccgc 2040 
cgacggctgc cagcagatca tggagaagct gcagcccgcc ctgaagaccg gcaccgagga 2100 
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gatcaagagc ctgttcaaca ccgtggccac 
gaaggacacc aaggaggccc bggacaagab 
gacccagcag gccgccgccg acaccggcaa 
cgtgcagaac gcccagggcc agatgatcca 
ctgggtgaag gbgabcgagg agaaggccbb 
ccbgagcgag ggcgccaccc cccaggacct 
ccaggccgcc atgcagatgc tgaaggacac 
ccbgcacccc gbgcacgccg gccccatccc 
cgacatcgcc ggcaccacca gcacccccca 
ccccatcccc gtgggcgaca tctacaagcg 
gcgcatgtac agccccgtga gcabcctgga 
cgacbacgbg gaccgcbbcb tcaagacctt 
gaactggatg accgacaccc tgctggtgca 
gcgcgccctg ggccccggcg ccaccctgga 

4 

cggccccggc cacaaggccc gcgtgctggc 
cabcabgabg cagcgcggca acttcaaggg 
caaggagggc cacctggccc gcaacbgccg 
cggcaaggag ggccaccaga bgaaggacbg 
ggccttccca caagggaagg ccagggaatt 
caccagaaga gagcttcagg tttggggaag 
cgatagacaa ggaactgtat ccbbbagctt 
cgtcacaaba aagatagggg gccagctgaa 
caccgtgctg gaggagatga acctgcccgg 
cggcggcttc abcaaggtgg gccagtacga 
ggccabcggc accgtgctgg tgggccccac 
gacccagatc ggctgcaccc tgaacttccc 
gctgaagccc ggcatggacg gccccaaggh 
caaggcccbg gbggagabcb gcaccgagat 
ccccgagaac ccctacaaca cccccgbgbb 
gcgcaagctg gtggacttcc gcgagctgaa 
gctgggcatc ccccaccccg ccggcctgaa 
gggcgacgcc tacbtcagcg tgcccctgga 
catccccagc abcaacaacg agacccccgg 
gggctggaag ggcagccccg ccatcttcca 
ccgcaagcag aaccccgaca tcgtgatcta 
cgacctggag atcggccagc accgcaccaa 
cbggggcbbc accacccccg acaagaagca 
ctacgagctg caccccgaca agtggaccgt 
ctggaccgtg aacgacabcc agaagctggt 
cgccggcabc aaggtgcgcc agcbgbgcaa 
ggbggbgccc ctgaccgagg aggccgagct 
ggagcccgbg cacggcgbgb acbacgaccc 
gcagggccag ggccagbgga ccbaccagab 
cggcaagbac gcccgcabga agggcgccca 
cgbgcagaag abcgccaccg agagcabcgt 
gcccabccag aaggagaccb gggaggccbg 
ccccgagbgg gagbtcgtga acaccccccc 
ggagcccatc abcggcgccg agaccbbcba 
gcbgggcaag gccggcbacg tgaccgaccg 
caccaccaac cagaagaccg agcbgcaggc 
ggaggbgaac abcgbgaccg acagccagba 
caagagcgag agcgagcbgg bgagccagab 
gtaccbggcc bgggbgcccg cccacaaggg 
ggbgagcgcc ggcatccgca aggbgcbgbb 
gcacgagaag baccacagca actggcgcgc 



ccbgbacbgc gbgcaccagc gcabcgacgb 2160 
cgaggagatc ' aagaacaaga gcaagcagaa 2220 
cagcagcaac gbgagccaga actaccccab 2280 
ccagagccbg agcccccgca ccctgaacgc 2340 
cagccccgag gtgatcccca bgbbcagcgc 2400 
gaacabgabg cbgaacabcg bgggcggcca 24 60 
cabcaacgag gaggccgccg agbgggaccg 2520 
ccccggccag abgcgcgagc cccgcggcag 25 8 0 
ggagcagabc ggcbggabga ccggcaaccc 2640 
cbggabcabc cbgggcctga acaagabcgb 2700 
cabcaagcag ggccccaagg agcccbbccg 2760 
ccgcgccgag caggccaccc aggacgbgaa 2820 
gaacgccaac cccgacbgca aggccabccb 2880 
ggagabgabg accgccbgcc agggcgbggg 2340 
cgaggccabg agccaggbgc agcacaccaa 3 000 
ccagaagcgc atcaagbgcb bcaacbgcgg 3 060 
cgcccbgcgc aagaagggcb gctggaagbg 3120 
caccgagcga caggcbaata gggaagabcb 3180 
bbcbtcagag cagaccagag ccaacagccc 3240 
agacaacaac bccctctcag aagcaggagc 3300 
cccbcagabc acbctbbggc agcgaccccb 3360 
ggaggcccbb cfcagacaccg gcgccgacga 3420 
ccgcbggaag cccaagabga bcggcggcab 3480 
ccagabccbg abcgagabcb gcggccacaa 354 0 
ccccgbgaac abcabcggcc gcaaccbgcb 3600 
cabcagcccc atcgagaccg bgcccgbgaa 3660 
gaagcagtgg ccccbgaccg aggagaagab 3720 
ggagaaggag ggcaagabca gcaagabcgg 3780 
cgccatcaag aagaaggaca gcaccaagbg 3840 
caagcgcacc caggacbbcb gggaggtgca 3 900 
gcagaagaag agcgbgaccg bgcbggacgb 3 960 
caaggacbbc cgcaagbaca ccgccbbcac 4020 
catccgcbac cagbacaacg bgcbgcccca 4080. 
gbgcagcabg accaagabcc bggagcccbt 414 0 
ccagbacabg gaccaccbgb acgtgggcag 4200 
gabcgaggag cbgcgccagc accbgcbgcg 4260 
ccagaaggag ccccccttcc bgbggabggg 4320 
gcagcccabc gbgcbgcccg agaaggacag 43 80 
gggcaagcbg aacbgggcca gccagabcba 4440 
gcbgcbgcgc ggcaccaagg cccbgaccga 4500 
ggagcbggcc gagaaccgcg agabccbgaa 4560 
cagcaaggac cbgabcgccg agabccagaa 4620 
cbaccaggag cccbbcaaga accbgaagac 4680 
caccaacgac gtgaagcagc bgaccgaggc 4740 
gabctggggc aagaccccca agbbcaagcb 4800 
gtggaccgag tacbggcagg ccaccbggab 4860 
ccbggbgaag cbgbggbacc agcbggagaa 4920 
cgbggacggc gccgccaacc gcgagaccaa 4980 
cggccgccag aaggbggbgc cccbgaccga 5040 
cabccaccbg gcccbgcagg acagcggccb 5100 
cgcccbgggc abcabccagg cccagcccga 5160 
cabcgagcag cbgabcaaga aggagaaggt 5220 
cabcggcggc aacgagcagg bggacggccb 5280 
cctggacggc atcgacaagg cccaggagga 5340 
catggccagc gacbbcaacc bgccccccgb 5400 
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ggtggccaag gagatcgtgg ccagctgcga caagtgccag ctgaagggcg aggccatgca 5460 
cggccaggtg gactgcagcc ccggcatctg gcagctggca tgcacccacc tggagggcaa 5520 
ggtgatcctg gtggccgtgc acgtggccag cggctacatc gaggccgagg tgatccccgc 5580 
cgagaccggc caggagaccg cctacttcct gctgaagctg gccggccgct ggcccgtgaa 5640 
gaccgtgcac accgacaacg gcagcaactt caccagcacc accgtgaagg ccgcctgctg 5700 
gtgggccggc atcaagcagg agbbcggcab ccccbacaac ccccagagcc agggcgtgat 5760 
cgagagcatg aacaaggagc tgaagaagat catcggccag gtgcgcgacc aggccgagca 5820 
ccbgaagacc gccgbgcaga tggccgtgtt catccacaac fctcaagcgca agggcggcat 5880 
cggcggctac agcgccggcg agcgcatcgt ggacatcatc gccaccgaca tccagaccaa 5940 
ggagctgcag aagcagatca ccaagatcca gaacbbccgc gbgtacbacc gcgacagccg 6000 
cgaccccgtg tggaagggcc ccgccaagct gctgtggaag ggcgagggcg ccgtggtgat 6060 
ccaggacaac agcgacatca aggtggbgcc ccgccgcaag gccaagatca tccgcgacta 6120 
cggcaagcag atggccggcg acgacbgcgb ggccagccgc caggacgagg acatgggcag 6180 
caagtggagc aagagcagca tcgtgggctg gagggaggtg agggagaggc tgaggcagac 6240 
cctggccgcc gccagaggcg tgggcgccgb gagccaggac ctggagaagt acggcgccgb 6300 
gaccagcagc aacgtgaacc accctagctg cgcctggctg gaggcccagg aggaggagga 6360 
ggtgggcttc cctgtgaggc ctcacgtgcc tctgaggcct atgacctaca agggcgcccb 6420 
ggacctgagc cactbccbga aggagaaggg cggcctggac ggccbgabct acagcaggag 6480 
gaggcaggag atcctggacc tgtgggtgta caacacccag ggctacttcc ctgactggca 6540 
gaactacaca cccggccctg gcgbgaggta ccctctgacc ttcggctggt gcttcaagct 6600 
ggtgcctgtg gaccctgacg aggbggagca ggccaacgag ggcgagaaca acagcctgct 6660 
gcaccctatg tgccagcacg gcabggacga cgaggagaag gagaccctga ggtggaggtb 6720 
cgacagcagg ctggccctga ggcacagggc ccaggagabg caccctgagb bctacaagga 6780 
ctgctgaaca cgtggcgaab tcttgctgtg cctbctagtt gccagccatc tgttgtttgc 6840 . 
cccbcccccg tgccbtccbb gaccctggaa ggtgccacbc ccacbgbccb ttcctaataa 6900 
aabgaggaaa ttgcatcgca tbgtctgagt aggbgbcatt ctattctggg gggtggggtg 6960 
gggcagcaca gcaaggggga ggabbgggaa gacaatagca ggcatgctgg ggatgcggtg 7020 
ggcbcbabgg gbacccaggt gctgaagaat bgacccggtb cctccbgggc cagaaagaag 7080 
caggcacatc cccbtctcbg tgacacaccc bgtccacgcc cctggbbcbb agttccagcc 7140 
ccactcabag gacacbcaba gctcaggagg gcbccgcctt caabcccacc cgctaaagba 7200 
ctbggagcgg tcbcbccctc cctcatcagc ccaccaaacc aaacctagcc tccaagagbg 7260 
ggaagaaabb aaagcaagab aggcbabbaa gbgcagaggg agagaaaatg cctccaacab 7320 
gbgaggaagb aabgagagaa atcatagaab bbctbccgct bccbcgctca ctgactcgct 7380 
gcgctcggbc gbbcggctgc ggcgagcggb abcagcbcac bcaaaggcgg baatacggbt 7440 
abccacagaa tcaggggata acgcaggaaa gaacabgbga gcaaaaggcc agcaaaaggc 7500 
caggaaccgb aaaaaggccg cgbbgcbggc gtbtbbccab aggcbccgcc ccccbgacga 7560 
gcabcacaaa aatcgacgcb caagtcagag gbggcgaaac ccgacaggac babaaagaba 7620 
ccaggcgbbb cccccbggaa gcbcccbcgb gcgcbctcct gtbccgaccc bgccgcbbac 7680 
cggatacctg tccgccbtbc bcccbtcggg aagcgbggcg ctbtcbcaab gcbcacgctg 7740 
baggtabcbc agtbcggbgb aggbcgtbcg cbccaagctg ggcbgbgbgc acgaaccccc 7800 
cgbbcagccc gaccgctgcg cctbatccgg baacbabcgt cbtgagbcca acccggtaag 7860 
acacgacbba bcgccactgg cagcagccac tggbaacagg abbagcagag cgaggbabgb 7920 
aggcggbgct acagagbtcb bgaagtggbg gccbaactac ggcbacacta gaaggacagb 7980 
atbbggbabc bgcgcbcbgc bgaagccagb fcaccbbcgga aaaagagfctg gtagcbcbbg 8040 
atccggcaaa caaaccaccg cbggbagcgg bggbbbbtfcb gbbbgcaagc agcagabbac 8100 
gcgcagaaaa aaaggatcbc aagaagabcc bbtgafcctbb bcbacggggb cbgacgcbca 8160 
gtggaacgaa aacbcacgbb aagggabbbb ggtcabgaga bbatcaaaaa ggatcbbcac 8220 
cbagabccbb bbaaabtaaa aabgaagtbb baaatcaabc baaagtabab abgagbaaac 8280 
bbggbctgac agbbaccaab gcbbaabcag bgaggcacct abctcagcga bcbgtcbabb 8340 
bcgbbcatcc abagtbgccb gacbccgggg ggggggggcg ctgaggbcfcg ccbcgbgaag 84 00 
aaggbgtbgc bgactcatac caggccbgaa bcgccccabc abccagccag aaagbgaggg 8460 
agccacggbt gabgagagct bbgbbgbagg bggaccagtt ggbgabbbtg aacbbbbgcb 8520 
bbgccacgga acggtcbgcg bbgbcgggaa gabgcgbgat cbgabccttc aacbcagcaa 8580 
aagbbcgabb babbcaacaa agccgccgbc ccgbcaagbc agcgbaatgc bcbgccagbg 8640 
bbacaaccaa bbaaccaatb cbgabbagaa aaacbcabcg agcatcaaat gaaactgcaa 8700 
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bbbabbcaba tcaggattat caabaccaba 
agaaaacbca ccgaggcagt tccataggat 
gacbcgbcca acatcaahac aacctattaa 
tgagaaatca ccatgagbga cgactgaatc 
ttbccagact tgttcaacag gccagccabb 
caaaccgtta btcabtcgbg attgcgcctg 
aggacaatta caaacaggaa tcgaatgcaa 
aatattttca cctgaatcag gatattctbc 
cgcagtggtg aghaaccatg cabcabcagg 
aggcabaaab bccgbcagcc agbbbagtcb 
gcbaccbbbg ccabgtbbca gaaacaacbc 
gabbgbcgca ccbgatbgcc cgacabbabc 
abccabgbbg gaatbtaabc gcggccbcga 
aacaccccbb gbatbacbgb bbabgbaagc 
bbbabctbgt gcaabgbaac abcagagatb 
ccabbabtga agcatbtabc agggbbabbg 
bbagaaaaab aaacaaabag gggbbccgcg 
cbaagaaacc abbabbatca bgacabb'aac 
bcgbc 



bbttbgaaaa agccgbbtcb gbaatgaagg 8760 
ggcaagabcc bggbabcggb cbgcgabbcc 8820 
bbbcccctcg bcaaaaataa ggtbabcaag 8880 
cggbgagaab ggcaaaagcb batgcattbc 8940 
acgcbcgtca bcaaaabcac bcgcabcaac 9000 
agcgagacga aabacgcgab cgcbgbbaaa 9060 
ccggcgcagg aacacbgcca gcgcabcaac 9120 
baabacctgg aabgcbgbbb bcccggggab 9180 
agtacggaba aaabgcbbga bggbcggaag 924 0 
gaccatcbca bcbgtaacab cabbggcaac 9300 
bggcgcabcg ggcttcccab acaatcgaba 9360 
gcgagcccab bbabacccab abaaabcagc .9420 
gcaagacgbb bcccgbbgaa batggcbcab 9480 
agacagbbbb abbgbbcabg abgababafcb 9540 
tbgagacaca acgtggctbt cccccccccc 9600 
tcbcatgagc ggatacabab tbgaabgtat 9660 
cacabbbccc cgaaaagbgc caccbgacgt 9720 
cbabaaaaab aggcgbabca cgaggcccbb 9780 

9785 
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